Support or Oppose[edit]

The following discussion is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.


A better thought out plan is needed (for example, the interwiki bots must replace the wikitext-based interwiki links with wikidata-based interwiki links if and only if they would otherwise have updated the interwiki links.) —Ruud 16:41, 18 January 2013 (UTC)[reply]

Discussion[edit]

Could you add a short summary to the top that explains what is going to happen? If I understand, they will load all the interwiki data into wikidata, at which point we can safely remove all interwiki links from articles. And the proposal is to do that slowly, rather than trying to have a bot edit every article on the wiki. — Carl (CBM · talk) 11:41, 17 January 2013 (UTC)[reply]

My understanding is that the intention is to have this done alongside other edits (perhaps recruiting the developers of AWB for this?), where it won't create "new" revisions taking up extra server space and connection time unnecessarily. עוד מישהו Od Mishehu 13:00, 17 January 2013 (UTC)[reply]
I don't understand this. Surely using Wikidata capability is an improvement to any article and not a null edit, right? Isn't that the whole purpose of Wikidata? Why would we discourage this? Rmhermen (talk) 16:11, 17 January 2013 (UTC)[reply]
This. --MZMcBride (talk) 17:03, 17 January 2013 (UTC)[reply]
Because at the beginning it will not provide any visible improvement to the article. --Rschen7754 17:17, 17 January 2013 (UTC)[reply]
So? That's also true of hidden categories, and I haven't seen any proposal to limit then. What is exactly the problem that you're trying to avoid, and why this RfC was created? Some concrete example of what kind of edit is affected will also help. Diego (talk) 18:20, 17 January 2013 (UTC)[reply]

Neither the proposal nor the background links explain what we are replacing interwiki links with. If I create a new en:Friend and want to connect it to the existing zh:朋友 and es:Amigo, how do I do that? I don't think it's possible to comment intelligently on this proposal without that information. – Philosopher Let us reason together. 17:15, 17 January 2013 (UTC)[reply]

You have to create an item on Wikidata, which is on another project and outside the scope of this RFC. --Rschen7754 17:17, 17 January 2013 (UTC)[reply]
I assume these are being created automatically from the current interwiki links, right? This is the sort of thing that a short summary could clear up. — Carl (CBM · talk) 23:05, 17 January 2013 (UTC)[reply]
There are bots working on the Wikidata site, but with 7-8 bots and millions of items, it's taking a while. --Rschen7754 23:12, 17 January 2013 (UTC)[reply]
I'm not sure then how a user would know when it is safe to remove the interwikis. The front side of this page implies that at the time of rollout, it will be safe to remove the interwikis as long as it is part of a larger edit... — Carl (CBM · talk) 12:32, 18 January 2013 (UTC)[reply]
Also, the blog post [1] seems to claim the data has already been collected. I have started a new section below about that. — Carl (CBM · talk) 14:13, 18 January 2013 (UTC)[reply]
A lot of Wikidata users have a script installed, as I've indicated below. --Rschen7754 18:41, 18 January 2013 (UTC)[reply]

From the dev corner[edit]

The articles will be a lot smaller after the change, and after a few revisions the saving will be much larger than one additional revision would give, and after some time its most likely there will be an additional revision anyhow.

In addition this policy would enforce continuous operation of iw-bots for prolonged time as we can't rely on working sitelinks for working iw-links. If there are langlinks there they will take presedence. So if the links get out of sync things gets weird.

As this isn't enough templates that are using properties will not work. The parser function for properties could although be set up to use sitelinks anyhow and just disregard any langlinks.

I would say let the bots do their job, it will take weeks, probably months, to switch from langlinks to sitelinks. Perhaps the discussion should rather be whether English should be an earrly bird and rather wait. -- jeblad on his cellphone 82.113.121.29 (talk) 21:01, 17 January 2013 (UTC)[reply]

Just to make sure: this is not an opinion of the dev team, but of a single developer, who totally have the right to speak their own mind :) But please do not read this as "the Wikidata team says". Officially we do not have an opinion on this stance. We will answer all technical questions that you have, but we leave any policy decision to the communities of a given wiki. If there is a policy decision which might raise issues, and we are made aware of it, we will comment on it to point this out, but in general we regard the Wikipedia communities as autonomous. Cheers! --Denny Vrandečić (WMDE) (talk) 10:39, 18 January 2013 (UTC)[reply]
What does "it will take months to switch" mean?
  • It will take months to load the data into Wikidata
or
  • It will take months to remove all the remaining links from enwiki after the data is loaded into wikidata
I assume that the data will be loaded using the info already in the enwiki database's langlinks table, right? — Carl (CBM · talk) 12:37, 18 January 2013 (UTC)[reply]
I doubt it will be that easy, there are a lot of times where interwiki links are a mess. What makes enwiki more correct than say dewiki in these cases? Mass conversion will probably work for 90% of cases but the remainder will be a disaster to straighten out. Werieth (talk) 13:29, 18 January 2013 (UTC)[reply]
I read through the two links provided on the other side of this page, and the proposal. What I took away was: after a certain data, interwiki links on enwiki will be converted to use Wikidata. At that point, we can begin to blindly remove all interwiki links from the source code of pages on enwiki, because the data will already be in Wikidata. The proposal here states that the removal should only be done as part of another edit, rather than having people simply go from article to article removing interwiki links. If that is not what the proposal is saying (and it seems like it is not, from your comment) then the proposal needs to be significantly clarified. (I realize you did not write the proposal; I am not trying to assign any blame, just to clarify the situation that I don't understand.) In particular, in what circumstances can I blindly remove the interwiki links from a page on enwiki? — Carl (CBM · talk) 14:07, 18 January 2013 (UTC)[reply]
You can never blindly remove links from a wiki page after implementation of Wikidata. For example, let's say that the Wikipage for John Doe matches a page named "John Doe" in four or five language projects, but on 4 or 5 other ones it links up to "John Doe Company, Ltd", because those languages put more emphasis on John Doe's company than on his biography. Eventually, there will be two articles on all Wikipedias (if John Doe and his company are notable enough in all languages). The trick is sorting and checking the interwikis on Wikidata. Everyone who is checking interwikis will have to make this manual check on Wikidata, rather than the source code of the Wiki page. This will have to be manual, and therefore it will go slowly. Jane (talk) 17:29, 18 January 2013 (UTC)[reply]
Thanks for the info. If this is the case, I am not sure (and of course this is not your fault): why does the RFC talk about "Edits that solely remove the interwiki links should not be performed," if it is not possible to make sure edits blindly in the first place? — Carl (CBM · talk) 17:47, 18 January 2013 (UTC)[reply]
If someone has not yet played around with Wikidata but just reads all the headlines about it, it would be easy to assume that during any casual edit that all interwikis can be blindly removed from any article, which of course is perfectly possible to do. However, per the above, Wikipedians should *not* blindly remove the manual interwiki links without checking them against the Wikidata links first. In my example above, let's say that after Wikidata implementation, one of the languages that previously had just one article on John Doe creates an article on John Doe's company. The person creating the article on the company needs to untangle the interwiki links on the John Doe page as well as on Wikidata for both articles. Jane (talk) 11:05, 19 January 2013 (UTC)[reply]
I have a script installed that tells me if there is an item on Wikidata for the current article. --Rschen7754 18:25, 18 January 2013 (UTC)[reply]
@CBM: that is exactly what the proposal is saying. --Rschen7754 19:06, 18 January 2013 (UTC)[reply]
@Rschen7754: the proposal does not really say anything clearly about the process. So far on this talk page I have seen "You can never blindly remove links from a wiki page after implementation of Wikidata" and elsewhere I have seen someone claim that manual attention is required to examine each Wikidata item. But you are talking as if someone might start doing several articles per minute. Someone else said it's not even the case that all the links are loaded into Wikidata. I'm not planning to comment here additionally, because I don't think I can say anything sensible. — Carl (CBM · talk) 19:26, 18 January 2013 (UTC)[reply]
That's basically what I was getting at earlier - this proposal assumes that contributors are already wikidata experts. At this point, it's not even fair to assume that people know what it is, let alone that they understand how it works/the background/etc. I, similarly, have nothing else to offer here. – Philosopher Let us reason together. 06:54, 20 January 2013 (UTC)[reply]
On the contrary, I see no reason to include the facts about how Wikidata works, which are not up for discussion because that's how the developers programmed the site. Neither do I see a reason to include information that may become outdated at any point, as the developers are still changing the site around enough to break many bots. This would be like explaining how to use Commons in our image policies. --Rschen7754 06:58, 20 January 2013 (UTC)[reply]

Is the data in Wikidata?[edit]

According to the blog post,[2]

Now that the editors (with the help of a few bots) have collected all these links to articles on the various Wikipedias it is time to make use of them.

I read that as saying

Editors and bots have collected into Wikidata as much data as can be automatically collected from all the Wikipedia projects, including enwiki.

Is that not what it is trying to say? — Carl (CBM · talk) 14:11, 18 January 2013 (UTC)[reply]

They've collected a massive amount of data and are continuing to do so. It's not complete yet however. --Lydia Pintscher (WMDE) (talk) 14:45, 18 January 2013 (UTC)[reply]
Someone calculated that there are 17 Million different topics i.e. possible Wikidata Items i.e. groups of languagelinks of that 3 Million have been created yet and each 15 day about another million is created. But only a few bots are working on it yet once it goes live on all wikipedias all the interwikibots could work on wikidata instead and create the rest in no time.--Saehrimnir (talk) 22:24, 18 January 2013 (UTC)[reply]

Common sense[edit]

My thought is that this RFC needs common sense behind it. If an article has substantial savings to removing the interwiki links, that's a good thing. But what we don't want is untrained hordes of people racing around to every article trying to scrub the interwiki links out. Among other things, doing it from regular accounts will flood recent changes and make patrolling more difficult. Also, people could screw up and take out GA/FA data. And some interwiki data apparently hasn't been ported to Wikidata yet. If it's done, it should be done by a limited number of bot accounts that are set to target articles with significant numbers of interwiki links (say more than 10 or 20). It should also be integrated into AWB and .py framework so that as other bots and users go through and do other changes, it will be done along with them (like how we did the Image: -> File: conversion).

I assume a wikidata dev (or MZMcBride or Legoktm) could give us a nice chart of the number of articles with at least 1, 5, 10, 20, 50, or 100 interwiki links to give us some actual impact data to work with. Could one of you do that, please? Thanks. MBisanz talk 19:32, 19 January 2013 (UTC)[reply]

Common sense sounds good! After a day or two of contributing to Wikidata I found the odd set of data that was missing interwikilinks. Makes sense to keep them on EN until we are sure that all data is extracted (if that is possible). I think, when the time comes, the task should be dealt with by bots on the whole and if an editor feel inclined to remove interwikilinks while making another edit then they should be able to do so, But not just to load up something such as AWB go through every article looking for interwiki links. ·Add§hore· Talk To Me! 19:44, 19 January 2013 (UTC)[reply]
Per request posted at Wikipedia_talk:Database_reports#Data_request:
Has at least this many wikilinks Number of articles
1 2650578
5 837048
10 411428
20 166797
50 20987
100 3833
- TB (talk) 20:10, 19 January 2013 (UTC)[reply]
Based on that data, I think it would be wise for a bot to run through those articles with more than 50 interwiki links and leave articles below that threshold to the normal editing of bots, AWB, and what not. MBisanz talk 20:17, 19 January 2013 (UTC)[reply]
I like this idea, there's probably a lot of overlap with watchlists in those with lots of interwiki links due to them likely being high profile articles. Get the most watched pages out of the way quickly and painlessly. James086Talk 12:07, 20 January 2013 (UTC)[reply]
One of the things AWB is very good at is these "edit side-effects" - that is, a user loads a page to do some other task (such as stub sorting, semi-automated category renaming, etc) and these edits are suggested automaticly. I definitely think that AWB should implement this - and make sure they know what they're doing; if they do this - we will probably have lots of editors who will gradually remove these while not taking them off the wrong articles. עוד מישהו Od Mishehu 06:24, 21 January 2013 (UTC)[reply]
It would be great if in AWB you could somehow link to the wikidata item for checking. I also agree that 50+ interwikis will be a safe bet for auto-removal of interwiki links. In my example above I was thinking of the far more common examples with 4-10 interwiki links. Jane (talk) 13:32, 21 January 2013 (UTC)[reply]
Well, if bots are going to be removing the links from here, they should first be double-checking to make sure that Wikidata has all of them. For highly-interwikied articles, it's actually quite common for us to have a handful of links that Wikidata doesn't have. — PinkAmpers&(Je vous invite à me parler) 06:01, 2 February 2013 (UTC)[reply]
Does it matter if wikidata has all of them? Surly you just need to check if the language link you are going to remove exists on wikidata, and then remove any others that also appear linked to your article..? ·Add§hore· Talk To Me! 18:46, 13 February 2013 (UTC)[reply]

Interwiki links and G/FAs[edit]

Just a thought, if there is a G/FA in another language and therefore ((link FA)) or ((link GA)) exists on a page, once the extension is enabled would that language be listed twice? Edgepedia (talk) 18:30, 13 February 2013 (UTC)[reply]

No, because those templates do not generate the links. --Rschen7754 18:35, 13 February 2013 (UTC)[reply]
I just checked, if interwiki links get removed but the templates stay in the article, everything looks perfect.--Ymblanter (talk) 21:28, 13 February 2013 (UTC)[reply]

Wikidata deployed[edit]

Wikidata Phase I (interwiki links) was deployed on English Wikipedia about an hour ago.--Ymblanter (talk) 21:29, 13 February 2013 (UTC)[reply]

Where is this documented, with Wikipedia editors in mind? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 22:58, 13 February 2013 (UTC)[reply]
Wikipedia:Village pump (miscellaneous)#Wikidata deployment phase 1 contains the necessary links, including the blog post and the FAQ on Meta.--Ymblanter (talk) 23:03, 13 February 2013 (UTC)[reply]

Shouldn't this RfC be closed, now? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:52, 18 February 2013 (UTC)[reply]