The Signpost

Technology report

Hue, Sqoop, Oozie, Zookeeper, Hive, Pig and Kafka

October engineering report published

In October:
  • 110 unique committers contributed patchsets of code to MediaWiki (up 12 on September)
  • The total number of unresolved commits stayed at 440.
  • About 35 shell requests were processed (no change).
  • 57 developers received developer access to Git and Wikimedia Labs (up 16).
  • Wikimedia Labs now hosts 137 projects (up 6), has seen 268 instances created (up 27) and has 694 registered users (up 61).

—Adapted from Engineering metrics, Wikimedia blog

The Wikimedia Foundation's engineering report for October 2012 was published this week on the Wikimedia Techblog and on the MediaWiki wiki, giving an overview of all Foundation-sponsored technical operations in that month (as well as brief coverage of progress on Wikimedia Deutschland's Wikidata project, phase 1 of which will soon be trialled on the Hungarian Wikipedia). Of the three headlines picked out for the report, two (the redesign of the mobile site "emphasizing readability and navigation" and the launch of a Wikipedia app for Windows RT and Windows 8 tablets) have already received Signpost coverage. The third drew attention to a proposed redesign on the signup page.

There was news that Parsoid, the new JavaScript-based parser due to be released in December, can "round trip" wikitext to HTML and back perfectly on 75% of a sample of 100,000 pages, does the same imperfectly but without great fault in a further 18%, and chokes on 7%. Parsoid will form the basis for the new Visual Editor and it will be rewritten in C++ to improve its performance. Likewise, there was news of improvements to both the "Page curation" and "Article Feedback" projects; version 5 of the latter is expected to go live on all English Wikipedia articles later this year. The "Wiki Loves Monuments" app was wound down during October, while the most cryptic update came from the WMF's Analytics team, who reported that they had worked on "puppetizing ... Hue, Sqoop, Oozie, Zookeeper, Hive, Pig [and] Kafka", all of which are services that help with distributed, large scale number crunching.

This month's Engineering report is the first for which a "friendly" summary version is also available.

Coding programs

Because only six mentors signed up to help with the Wikimedia Foundation's provisional Google Code-In programme—far fewer than the necessary number of approximately twenty-one—the Wikimedia Foundation has decided not to participate. (mailing list). "I know this disappoints some of you; we do want to encourage new participants, and we want some structured mentorship that isn't just Google Summer of Code," wrote WMF Engineering Community Manager Sumana Harihareswara, adding that she would "start a new thread about a more suitable program for us to participate in." The later thread referred to recognised that the Foundation (on behalf of MediaWiki generally) would still like to participate in a separate Outreach Program for Women, a FOSS outreach program to offer paid internships to women to work on our open source projects (mailing list). Applicants and mentors are welcome.

In brief

Not all fixes may have gone live to WMF sites at the time of writing; some may not be scheduled to go live for several weeks. Though there is no poll this week, last week's question (about the utility of videos) is still open for opinions.

At the time of writing, 15 BRfAs are active. As usual, community input is encouraged.