This is a FAQ regarding EarwigBot's copyright violation detector.

Last updated: — Earwig talk 19:35, 25 May 2014 (UTC)

Copyright

Main page: Wikipedia:FAQ/Copyright

What are copyright violations?
A copyright violation is any text or file being used on Wikipedia without the proper permission from the owner.
I am the author of the content. What can I do?
You can choose to release the content under a suitable license. If you fill out a declaration of consent and email it to permissions-en@wikimedia.org, a volunteer will make the necessary changes to the page.
The content is in the public domain or under a suitable license. What can I do?
TODO

AFC

How does the bot find articles to search?
The bot runs a check on every AFC submission (i.e. pages that start with "Wikipedia:Articles for creation" or "Wikipedia talk:Articles for creation") immediately after it is created, restored, or moved into the AFC space. Pages are never checked more than once.
How can I make it check a specific page?
You can use the web interface at tools:~earwig/copyvios. A report for each page tagged by the bot is also available by clicking on the "comparison" link from ((AfC suspected copyvio)).
What do I do if the bot correctly identified a violation?
Copyvios are deletable under CSD G12, so you may tag it with ((db-copyvio)). You can also blank the submission's content, add ((afc cleared)), and decline it using the "cv" reason. If a violation exists but is not serious enough to warrant deleting everything, you can remove the copied parts, the bot's ((AfC suspected copyvio)) template, and review the submission normally.
What do I do if the bot incorrectly identified a violation?
Remove the ((AfC suspected copyvio)) template to avoid confusing other users, and please let me know if the mistake is something that should be fixed. If the suspected violation is actually a mirror or public domain site, consider asking an admin (or me, on my talk page) to add the URL to User:EarwigBot/Copyvios/Exclusions.

Technical

How does it detect violations?
TODO
Basically, it searches for chunks of text in the whole web via Yahoo! search (using a Yahoo! BOSS API key paid by the WMF).
How can I make it ignore a specific mirror/public domain site?
Ask an admin (or me, on my talk page) to add the URL to User:EarwigBot/Copyvios/Exclusions.