Discovery Weekly Update for the week starting 2018-05-07

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Discovery Weekly Update for the week starting 2018-05-07

Chris Koerner-2
Привет!

Another update from the Search Platform team for the week starting 2018-05-07

**Programming note:** Due to the upcoming Wikimedia Hackathon and some
(personal) holiday time, the next update will be the week of
2018-05-28. Until then, and as always, feedback and questions are
welcome.


== Highlights==
* Map internationalization launched everywhere, and embedded maps
(mapframe) are now live on 276 Wikipedias [0]
* ''"Hello, my name is _____"'' is an in-depth blog post by Trey that
was published earlier this week where he details the irony that
searching for names is not always as straightforward as you might
think. [1]

== Discussions ==

=== Search ===
* Erik updated a script that was populating lots of 500 errors in the logs [2]
* Erik also did a lot of research to evaluate impact of adding ~2700
new shards to production cluster (there is a pdf attached to the last
comment in the ticket that contains more information) [3] There is a
follow-up ticket as well for the next steps [4]
* Trey worked on the analysis config for the new Slovak stemmer that
was deployed this week—but the plugin still needs to be deployed and
the wikis re-indexed. [5]
* Stas and others worked on looking up entities by external
identifiers - the work is done for now, but it needs a re-index to be
fully ready [6]
* David worked on externalizing the parsing logic from
SimpleKeywordFeature and FullTextQueryStringQueryBuilder and it was
pushed into production in April 2018 [7]

== Other Noteworthy Stuff  ==
* Trey's most recent updates to transliteration on the Crimean Tatar
Wikipedia are live; after a year of part-time 10% project work, the
transliteration infrastructure for Crimean Tatar is done and the
accuracy is in the high 90% range. [8]

== Did you know? ==
* The English word “dove”, as the past tense of “dive”, is one of the
rare cases where a conjugation has become more irregular over time.
The verb “dive” picked up the strong conjugation [9] by analogy with
other strong verbs, particularly “drive/drove”. [10] Going in the more
typical direction of regularization, Swedish strong verbs slowly lost
some of their distinctive plural forms. [11] The change started in the
16th century, and was still in progress as late as the 1940s.  From
the search perspective, regular forms are easier to deal with—so, way
to go Swedish!

[0] https://lists.wikimedia.org/pipermail/wikitech-l/2018-May/089964.html
[1] https://blog.wikimedia.org/2018/05/08/searching-for-names-is-not-always-straightforward/
[2] https://phabricator.wikimedia.org/T179266
[3] https://phabricator.wikimedia.org/T192972
[4] https://phabricator.wikimedia.org/T193654
[5] https://phabricator.wikimedia.org/T191544
[6] https://phabricator.wikimedia.org/T99899
[7] https://phabricator.wikimedia.org/T188530
[8] https://phabricator.wikimedia.org/T188321
[9] https://en.wikipedia.org/wiki/Germanic_strong_verb
[10] https://en.wiktionary.org/wiki/dove#Etymology_2
[11] https://en.wikipedia.org/wiki/Swedish_grammar#Historical_plural_forms

---

Subscribe to receive on-wiki (or opt-in email) notifications of the
Discovery weekly update.

https://www.mediawiki.org/wiki/Newsletter:Discovery_Weekly

The archive of all past updates can be found on MediaWiki.org:

https://www.mediawiki.org/wiki/Discovery/Status_updates

Interested in getting involved? See tasks marked as "Easy" or
"Volunteer needed" in Phabricator.

[1] https://phabricator.wikimedia.org/maniphest/query/qW51XhCCd8.7/#R
[2] https://phabricator.wikimedia.org/maniphest/query/5KEPuEJh9TPS/#R

Yours,
Chris Koerner
Community Liaison
Wikimedia Foundation

_______________________________________________
Wikitech-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l