Commons:Bots/Requests/DrilBot

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Operator: Drilnoth (talk)

Bot's tasks for which permission is being sought: Aiding in the license migration. DrilBot is currently doing this on the English Wikipedia and it will add the migration parameter in the following instances: If the image is tagged as both GFDL and cc-by-3.0 or cc-by-sa-3.0, then migration=redundant. If the image seems to indicate that it is public domain or fair use in addition to GFDL, then migration=needs-review. If the image is tagged with {{GFDL-self}}, and it does not contain a URL, then migration=relicense. The bot can't currently check the upload dates of images in order to tag normal {{GFDL}} and similar tags safely based on upload date, but self-created images should almost all be relicensed (I'm about to request that AWB be enhanced with date-checking functionality). This date issue also means that after August 1 it will not be able to relicense images, just tag them redundant/needs-review.

The RegEx needed to make the bot work is kind of complicated, so it won't really be possible to program it with all the license templates (just the major ones), but this shouldn't be a problem because it will just skip pages with license tags it might not recognize, not change the page incorrectly. I'll get the list of files from various sources, including categories and lists of what pages transclude certain templates.

Automatic or Manually Assisted: Automatic.

Edit type (e.g. Continuous, daily, one time run): When I'm around to debug the bot if someone finds an error.

Maximum edit rate (eg edits per minute): On en.wp, the bot's been running at about 10 edits per minute. Depending on where it is in the various lists of images that may need migration, it varies between about 4 and 14 seconds (it just saves about 1 second after finding a file that it will change). I can slow this down if required.

Bot flag requested: (Y/N): Y

Programming Language(s): AutoWikiBrowser. It uses the advanced find-and-replace features of AWB to do all this, and has worked without any problems to my knowledge on the English Wikipedia.

Drilnoth (talk) 19:46, 15 July 2009 (UTC)[reply]

Discussion

  • Please don't edit images using {{License migration is redundant}}.
  • Fair use? We don't do fair use at Commons
  • What's the regex?
  • Oh, btw, beware of the original upload log (don't alter it)

Please note that User:BotMultichill, User:BotMultichillT and User:Abibot are currently filtering out the "easy" images in Category:License migration candidates. Multichill (talk) 19:55, 15 July 2009 (UTC)[reply]

  • Hmm... I hadn't known about those bots (the license migration task force page and talk page don't seem to say anything about them, or at least I didn't notice them, and the size of Category:License migration completed compared to en:Category:Wikipedia license migration completed made me think that bots weren't running there, so I thought it had all been manual thus far). Anyway, here are my responses to your other comments; if you think that the bot would be useful after seeing the answers, I'll be happy to run it, but if you think it would be redundant that's okay (sorry for not doing more research!)
    • 1: Can do. Things like {{Self}} and {{GFDL-self}} still need some work... e.g., if it is tagged only as GFDL (no cc) without a migration parameter, then it can probably be relicensed, and if a cc template is on the same image but for some reason outside of {{Self}} or {{GFDL-self}}, then the migration is redundant.
    • 2: Well, that's already programmed into the RegEx and might as well be used... if something is mentioned as fair use at Commons then something is really wrong. Here it would more apply to the public domain & GFDL combination.
    • 3: en:Wikipedia:Bots/Requests for approval/DrilBot 3 has the basic find/replace RegEx that I am using on en.wp (in the collapse box). I've made a number of modifications since then in order to fix some bugs, and on Commons I'd add a few more replacements for other license templates and remove things such as the {{Self}} updating which Commons doesn't need because of {{License migration is redundant}}.
    • 4: I think that I can do that. If worse comes to worse, just skip pages with "Original upload log" on them, but I'm pretty sure that that could be coded without too much trouble. --Drilnoth (talk) 20:14, 15 July 2009 (UTC)[reply]

 Support LiCom needs all the help it can get. Huib talk 19:59, 15 July 2009 (UTC)[reply]

Account looks not created. Please also do a test run. --EugeneZelenko (talk) 14:40, 16 July 2009 (UTC)[reply]

  • Account creation with the unified login was being goofy... I thought it was created with the unified login, but it seemed that it wasn't, but I went to create it with the unified login, and it looked like it was created, and now it's not. :/ Ah, well. What are the Commons policies on test runs then? E.g., any timeframe or edit limit? --Drilnoth (talk) 01:55, 17 July 2009 (UTC)[reply]
    • Just don't go to fast and don't break it. The current bots are doing a number of easy replacements. The command is

replace.py -lang:commons -family:commons -regex -nocase "\{\{self(2)?\|GFDL\|cc-by-2.5\}\}" "{{Self|GFDL|Cc-by-sa-3.0-migrated|Cc-by-2.5}}" "\{\{self(2)?\|GFDL\|cc-by-sa-2.5,2.0,1.0\}\}" "{{self|GFDL|Cc-by-sa-3.0-migrated|Cc-by-sa-2.5,2.0,1.0}}" "\{\{GFDL-self\}\}" "{{Self|GFDL|Cc-by-sa-3.0-migrated}}" "\{\{Self(2)?\|GFDL\}\}" "{{Self|GFDL|Cc-by-sa-3.0-migrated}}"

"\{\{self(2)?\|GFDL\|FAL\}\}" "{{self|GFDL|Cc-by-sa-3.0-migrated|FAL}}" -summary:"Changing image for [[Commons:License Migration Task Force/Migration|the big license migration]]" -excepttext:"Original upload log" another bot is working on {{GFDL-user}}. These bots will greatly reduce the number of images in Category:License migration candidates (before these bots started running we had almost 800.000 images in there). It would be nice if you could focus on the more difficult cases like for example images transfered from other wiki's (the images are skipped by me). Multichill (talk) 14:20, 17 July 2009 (UTC)[reply]

Sounds good; I'll see if I can code that. Could an admin add the bot to Commons:AutoWikiBrowser/CheckPage so that I can start the trial (I hope to in the next few days; I've been busy recently). --Drilnoth (talk) 03:05, 18 July 2009 (UTC)[reply]

Account is not created yet. I think this request should be suspended until the account is created.--Kwj2772 (msg) 13:02, 18 July 2009 (UTC)[reply]

I agree, create the account and we'll continue. Multichill (talk) 13:14, 18 July 2009 (UTC)[reply]
Okay... the account should be created. It was weird... usually when I go to a new wiki, my main account is already logged in, but with the DrilBot account it wasn't. Huh. The account should be created now. --Drilnoth (talk) 02:41, 20 July 2009 (UTC)[reply]

Actually, I don't think that I'll have the time to code the more complicated replacements (especially with AWB and the date problems), so I'm going to withdraw this request. The current bots seem to be fixing most things without too much difficulty anyway. --Drilnoth (talk) 02:50, 20 July 2009 (UTC)[reply]