@Sennecaster, MER-C, and Moneytrees: regarding the discussion at URFA/2020 of WP Cyclone FAs, could you please check my work on one FA I already marked “Satisfactory”? I have run everything I know to run, and looked at everything I know to look at, and cannot find any of the problems mentioned. As far as I know, I checked all articles and sub-articles and found no copying within wikipedia without attribution, and I ran Earwig on the last version of the article as it was built (before it underwent copyediting and significant changes at FAC) versus the archive.org versions of the sources. As well as a general Earwig. I come up with nothing, and going diff by diff, it appears that Cyclonebiskit built the article without any of the mentioned problems, and did not copy either to or from other articles. If I missed something, or if there is a technique I need to use, I need to know before I continue looking at the other hurricane articles. Also, if my work is correct, would it be an indication that further checks on Cyclonebiskit’s articles aren’t needed?
For example, I ran Earwig on the last version of Effects/Louisiana before it was submitted to FAC compared to the archive.org version of NCDC source: https://copyvios.toolforge.org/?lang=en&project=wikipedia&title=&oldid=294205893&use_engine=0&use_links=0&turnitin=0&action=compare&url=https%3A%2F%2Fweb.archive.org%2Fweb%2F20120205161759%2Fhttp%3A%2F%2Fwww4.ncdc.noaa.gov%2Fcgi-win%2Fwwcgi.dll%3Fwwevent%7EShowEvent%7E323928
If you are able to let me know if I’m on the right track, I can continue checking other Hurricane FAs on the WP:URFA/2020 list. SandyGeorgia (Talk) 05:47, 1 December 2021 (UTC)
On the season article,
SandyGeorgia (Talk) 06:29, 1 December 2021 (UTC)
Here I found what may be two different things on a Featured article, so I need to stop and check. I believe there was NOAA public domain text copied in to the first iteration of the article, but there are also some other things that show up.[1] This would be a good example for showing me what to do next on cases like this. SandyGeorgia (Talk) 02:46, 2 December 2021 (UTC)
At FAR, I found minor copying within pre-FAC, but considerable copying within during FAR. Don't know what to do with this:
@Sennecaster, MER-C, and Moneytrees: could you all please look at my comments on the Featured article review of this season article? It has both old and new copying within issues, coming from the article and going to the article. I think the version that came to FAC was free of copyvio, but my work should be checked. It does, though, have some current cut-and-paste from (I believe) public domain sources, incurred during the FAR, and would be a great example for all of you to engage to show all of us how to better examine these issues and to help us understand what to do next. SandyGeorgia (Talk) 16:08, 2 December 2021 (UTC)
… here. SandyGeorgia (Talk) 03:03, 15 December 2021 (UTC)
Why does this list not mention FEMA?
SandyGeorgia (Talk) 16:37, 27 December 2021 (UTC)
I am about eight hours into checking Tornado at Wikipedia talk:Featured article review/Tornado/archive2, and have just realized neither it, nor all of its sub-articles, are listed here. I have so far identified no direct cut-and-paste, but plenty of unattributed copying within and public domain text. Why is this suite of articles not included? SandyGeorgia (Talk) 01:50, 28 December 2021 (UTC)
Done, please see and check Wikipedia talk:Featured article review/Tornado/archive2#Summary. SandyGeorgia (Talk) 15:38, 28 December 2021 (UTC)
HurricaneParrot (talk · contribs · deleted contribs · page moves · block user · block log) (WPTC editor) and Cyclonicationly (talk · contribs · deleted contribs · page moves · block user · block log) (tropical cyclone article area editor) have been blocked for socking per Wikipedia:Sockpuppet investigations/HurricaneParrot/Archive. Close paraphrasing has been found from both accounts, with the latter also containing close paraphrasing from non-tropical cyclone articles. Sennecaster has told me (off-wiki) to lay off on requesting a case in the meantime since the user has already been blocked, but I thought it was worth mentioning here since some of the articles with close paraphrasing (e.g., Typhoon Grace (1954), Typhoon June (1954)) are relatively new and aren't part of the list of pages. Chlod (say hi!) 05:07, 15 January 2022 (UTC)
See discussion. MER-C 15:00, 8 May 2021 (UTC)
The most common type of web copyvio seems to be close paraphrasing or copying of impacts and preparations from online news sources. This project is well-archived, so almost all of the links are still live or have archives.
There are some paywalls, mostly to newspapers.com that can be accessed through The Wikipedia Library. When possible, rewriting for the paraphrasing should be used. These are often older articles. There is also some offline sourcing in old (topic-wise) articles.
As of June 2021[update] translation violations in the impact/preparation sections are possible in Vietnamese and Japanese, both of which have difficulties translating well through machines (machine translation, or MTL). DeepL works decently okay with Japanese, but Vietnamese is not an option. When possible, seek someone who can read one of the languages to confirm for violations.
The JTWC, NOAA, NWS, and PAGASA (unless noted) are all considered as in the public domain. All other agencies are copyrighted under fair use terms. If there is substantial close paraphrasing or direct copying, appropriately tag with ((PD-notice)) or remove/rewrite.
There is frequent copying without attribution from the season to storm pages. This occurs most commonly in the Meteorological History sections and can be checked by comparing the date of addition to the storm article with the revision on the season article before the addition date of the storm article. These can be repaired with ((Copied)) if it was not already properly attributed. Tools such as Who Wrote That? and searching revisions by date on the history page work extremely well. This must be manually spotted. It is safest to assume that almost every article started out with copying.
TL;DR: Paywalls are to newspaper archives, web copyvio is in the impact/preparations sections from news sources, translation vio is hard to use MTL for and is present, check the season pages for copy pasting, and make use of revision searching/diff checking tools. Sennecaster (What now?) 05:30, 12 June 2021 (UTC)
The purpose of this investigation should be to identify serial violators in the topic area and how to deal with them. If you find a violation, please list who added it and what kind of violation it was. Moneytrees🏝️Talk/CCI guide 03:35, 5 October 2021 (UTC)
I've given this some thought, and to be honest, this should never have been open. The worst of the violations have already been noted and addressed (the occasional lack of attribution isn't enough to leave this open), and going through a wild goose case on an entire project is a waste of limited resources. The old fashioned way of addressing them as they are found seems to be ok for this situation. Unless there's a strong rationale otherwise I'm just going to close this soon and call it. Wizardman 00:52, 25 February 2024 (UTC)