The following discussion is an archived debate. Please do not modify it. Subsequent comments should be made in a new section. The result of the discussion was Approved.

Operator: -- lucasbfr talk

Automatic or Manually Assisted: Automatic, after review of the list of affected images

Programming Language(s): C# (DotNetWikiBot + API)

Function Summary: Browse Category:Images on Wikimedia Commons to discover instances where the same image is on WP and commons under different names. Replace the Wikipedia version by the Commons version.

Edit period(s) (e.g. Continuous, daily, one time run): Once per week? (Manual start)

Already has a bot flag (Y/N): N

Function Details: The idea is to tackle the backlog at Category:Images on Wikimedia Commons. The images that sit there are said to be on both the local and remote repositories. Checking and deleting duplicates is a trivial matter. However, the tedious part is when both images don't share the same name and the local version is used. This bot would browse the category once a week, check whether there is a duplicate on commons (SHA1 hash + size), check the image is tagged with a free license (to avoid losing Non Free images because they were incorrectly tagged) and is not featured, check the commons image is not shadowed by an other Wikipedia image, and if everything is correct replace the Wikipedia image by its Commons counterpart in all namespaces.

If no Commons version can be found, it will remove the image from the category.

I have made some tests and the detection procedure looks accurate. -- lucasbfr talk 09:37, 6 January 2009 (UTC)[reply]

Discussion[edit]

With this task, does your bot have any mechanisms to check if the version of the image on commons is an image that should not be on commons? i.e. the copy on Wikipedia is fair use or under an incompatible license. Because if a fair use image ends up on commons under a different name, the bot could potentially replace the fair use Wikipedia one (permitted by policy) to the same image on commons (obviously not allowed). Foxy Loxy Pounce! 01:17, 7 January 2009 (UTC)[reply]

I debated this when I designed it, and I think it wouldn't be of much use in the end: The bot will only proceed images that are under a free license on en.wp, if it doesn't find one of the following templates, it marks the image as non free in his report:
{ "gfdl", "cc-by", "Multilicense replacing placeholder", "((PD", "NoRightsReserved", "GPL", "((Attribution", "CopyrightedFreeUse", "No rights reserved" }
The bot also checks the image is not under any deletion process here (PUI, ifd, fair-use)
I can add the same check on Commons, but if the image is tagged as free on en.wp, it will most probably be tagged the same there too :/
The list of images that will be unlinked will be reviewed as a gallery before proceeding, so any blatantly incorrectly tagged image won't be processed. -- lucasbfr talk 08:29, 7 January 2009 (UTC)[reply]
My concerns have been addressed. Foxy Loxy Pounce! 14:07, 7 January 2009 (UTC)[reply]
Approved for trial (20 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. RichardΩ612 Ɣ ɸ 18:38, 8 January 2009 (UTC)[reply]
 Done, see the contributions.
It delinked the 4 pictures at User:Lucasbfr/CommonsImages (Ok), noticed a problem with 1 picture (there's a local picture with the same name that shadows Commons' copy) User:Lucasbfr/CommonsImages (Warn) and refused to act on 2 pictures (they weren't to be found on Commons) User:Lucasbfr/CommonsImages (Errors). There were no non-free image on that (very small) subset. -- lucasbfr talk 11:12, 10 January 2009 (UTC)[reply]
Note that there was a small glitch on the 10:55 edit, that was fixed immediately. -- lucasbfr talk 11:13, 10 January 2009 (UTC)[reply]

 Approved. BJTalk 11:24, 10 January 2009 (UTC)[reply]

The above discussion is preserved as an archive of the debate. Please do not modify it. Subsequent comments should be made in a new section.