Commons:Bots/Requests/DPLA bot (2)
Operator: Dominic (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought:
This bot is already approved for uploading, and it has performed over 2 million in the last 2 years. This request is for additional approval for the bot scope to include adding/updating SDC statements and image metadata to sync with data sources. The bot is modeled on METbot.
This bot will edit only DPLA uploads to make changes to the SDC/metadata. It will either add initial data for new uploads (including the backlog of past uploaded filed) or make changes to existing SDC statements and/or file metadata if the data source has changed over time.
All SDC statements will be added in the format specified per community consensus at Commons:Structured data/Modeling, and our project-specific page Commons:Digital Public Library of America/Modeling.
I have made a number of test edits, adding only copyright-related statements to start off, whose format is well-known by the community and used by other bots.
Automatic or manually assisted: Automatic/unsupervised
Edit type (e.g. Continuous, daily, one time run): May run continuously for long periods, with breaks.
Maximum edit rate (e.g. edits per minute): Max allowed per rate limit
Bot flag requested: (Y/N): Y (already flagged)
Programming language(s): Python, using combination of PWB and direct MediaWiki API calls.
The code that produced the test edits above can be viewed here The exact code will evolve as other types of statements are modeled. Dominic (talk) 21:26, 1 July 2021 (UTC)
- Discussion
- Could you please make test run for other types of statements? It would be reasonable to add year of publication to copyrights statement. --EugeneZelenko (talk) 14:30, 2 July 2021 (UTC)
- @EugeneZelenko: Date fields may be more complicated to model, because DPLA has one "date" while Wikidata has several different possible properties for date. That one I'd like to discuss more with the community first. However, I went and added DPLA ID (P760), if you want to see other data types, e.g. [1]. Dominic (talk) 17:55, 6 July 2021 (UTC)
- Looks OK for me, but please take care about property's constraints. --EugeneZelenko (talk) 14:20, 7 July 2021 (UTC)
- Indeed, in this case I am just assuming the property's data types will be changed if we want to apply it to Commons files. Dominic (talk) 13:43, 12 July 2021 (UTC)
- Looks OK for me, but please take care about property's constraints. --EugeneZelenko (talk) 14:20, 7 July 2021 (UTC)
- @EugeneZelenko: Date fields may be more complicated to model, because DPLA has one "date" while Wikidata has several different possible properties for date. That one I'd like to discuss more with the community first. However, I went and added DPLA ID (P760), if you want to see other data types, e.g. [1]. Dominic (talk) 17:55, 6 July 2021 (UTC)
- @EugeneZelenko: Not seeing any objections, maybe we can wrap this up? Dominic (talk) 20:03, 2 August 2021 (UTC)
If there are no objections, I think task should be approved. --EugeneZelenko (talk) 14:06, 3 August 2021 (UTC)