User talk:Dan Polansky/2019

Admin again edit

Latest comment: 5 years ago3 comments2 people in discussion

Hey Dan. I'd like to nom you for adminship again. Sure, Wiktionary:Votes/sy-2016-08/User:Dan Polansky for admin failed and caused lots of people to talk, and since then you've only been blocked once, and I can't see anyone on your talk page reprimanding you. Wanna try again? --Wonderfool Dec 2018 (talk) 13:40, 1 January 2019 (UTC)Reply

Back in 2016, the probability of pass seemed rather low to me. Now, the probability of pass (crossing the 2/3 threshold) seems to approach zero. It does not seem worth the energy. --Dan Polansky (talk) 14:55, 1 January 2019 (UTC)Reply

Too bad. Admin votes are a great social event and usually generate such stimulating comversations. --Wonderfool Dec 2018 (talk) 16:46, 1 January 2019 (UTC)Reply

Creating entries in languages you don't know edit

Latest comment: 4 years ago9 comments3 people in discussion

You've been making a lot of these recently, like дэмакрат (demakrat) and demokratas. In general, it's a bad habit to create entries in languages you know so poorly that you can't even provide gender. But if you're going to do this, you should make reasonable efforts for those entries to meet some basic level of quality. For example, that would mean using the proper headword-line templates so that when you don't know the gender, the entry is placed into a category requesting that. For a language like Lithuanian, where we also try to provide the accent and declensional paradigm on the headword for nouns, you can easily grab that from the very dictionary that you're adding as a reference. As it stands, once those entries are blue links that aren't in any maintenance categories, it will likely be a very long time until anybody cleans them up, or even checks whether you've gotten them right. —Μετάknowledge^{discuss/deeds} 16:35, 9 February 2019 (UTC)Reply

Czech pivní sýr was created on 25 January 2019‎ by Metaknowledge, who does not profess to know Czech on their user page; no verification was entered into the entry except for Wikipedia. As for gender, my German is quite serviceable and actually battle tested, so to speak, but when entering German gender I always check with sources; the phrase "languages you know so poorly that you can't even provide gender" makes no sense to me. Accuracy and verification available to the customer are great things, and we should have more of them; by contrast, our redlinks in translation tables lack verification and people should work on turning them into bluelinks. --Dan Polansky (talk) 19:56, 9 February 2019 (UTC)Reply

Yep, I don't speak Czech, so I am careful to check that I've got it right and that I can add whatever the headword ought to have (in this case, just gender) when I create such an entry. You are not putting in that level of care. Even for German, which you do know, you are not bothering to use {{de-noun}}, the standard for our entries. Why? —Μετάknowledge^{discuss/deeds} 21:21, 9 February 2019 (UTC)Reply

My prime concern is with accuracy combined with verification; {{head}} is generally fine and does not require the user to enter various additional bits. I want my undivided attention to be channeled toward making sure that the semantic information I am entering is correct, and that the lemma is appropriate. The effort not only leads to new accurate entries equipped with verification artifacts but also to removal of incorrect information such as in diff and diff. --Dan Polansky (talk) 21:27, 9 February 2019 (UTC)Reply

The main problem is with incomplete entries that don't show up in any maintenance categories and won't be noticed for years. If there are no maintenance categories showing, it might be a good idea to put {{attention|de|needs gender}} or something along those lines. Either that, or look it up in Duden online. I find myself doing both when I find misformatted new German entries in my patrolling- though my German is far worse than yours. Chuck Entz (talk) 22:55, 9 February 2019 (UTC)Reply

English Wiktionary entries are usually incomplete. For instance, for English entries, only a tiny fraction of them has IPA pronunciation and another tiny fraction has at least one example sentence; these entries do not carry {{attention}} template. People looking for incomplete entries to work on have to learn how to find them. For instance, insource:/head\|be\|noun\|g=}}/ finds Belarusian nouns without gender. On a related note, the reader looking for gender is very often served by the further reading or by another Wiktionary, both one click away; but they are only served so if an entry with a further reading exists. --Dan Polansky (talk) 06:18, 10 February 2019 (UTC)Reply

Let's consider the hub benefits (one click away from other sources) on the example of Bulgarian entry китара. The entry is linked from guitar but since Bulgarian Wiktionary does not have the entry, there is no further link to another Wiktionary from the translation table. A user who lands in китара may be satisfied with the information provided: language, meaning and apparent verification (in this case I also entered gender, but let's suppose the gender is not there). If they are not satisfied, they can try the external link, which provides gender but no inflection. That failing, they can try a large competing Wiktionary, fr:китара, and there they find inflection and gender. They'll do well to try the French Wiktionary; some other Wiktionaries provide neither inflection nor gender. They can do all these things by clicking links with no typing; the only thing they have to type is guitar, in Latin script. --Dan Polansky (talk) 08:39, 10 February 2019 (UTC)Reply

Later: For ease of reference: I now created Wiktionary:Beer parlour/2019/March#Translations in languages you don't know since another user has received a message to an effect that matches neither policy nor common practice. --Dan Polansky (talk) 10:06, 17 March 2019 (UTC)Reply

Later: In teny Malagasy entry, Metaknowledge again reckoned with whether someone knows the language, and out of process removed a sense traced to a source. As Talk:teny shows, the sense "language" is traceable to a modern source and malagasyword.org even has an example sentence using the meaning "language". This one little case shows again that what matters is careful work with sources. --Dan Polansky (talk) 08:49, 22 June 2019 (UTC)Reply

Creating incomplete entries redux edit

Latest comment: 5 years ago11 comments3 people in discussion

I'm going to second what User:Metaknowledge said. You've recently created a whole lot of extremely low-quality entries in a lot of languages. PLEASE DO NOT DO THIS. If you're not willing to bother to learn how to create proper entries, it would be much better for you to refrain from creating entries at all than create bogus, low-quality entries. User:Metaknowledge has already addressed many of the issues, but I should note that e.g. an entry in Russian should

include the accent
use the proper headword template ({{head|ru|noun}} or {{head|ru|verb}} is NOT acceptable)
include the declension or conjugation
include the pronunciation using {{ru-IPA}}.

Your entry e.g. for полигамия does none of these things, and the corresponding Bulgarian entry looks just as bad. Plenty of Wiktionary editors, including many admins, have commented at various points on problems with entries you've created; please listen to them. Benwing2 (talk) 06:20, 13 February 2019 (UTC)Reply

Let me add that part of being a good editor is working *WITH* the other members of a given language community. If other members have established certain standards for entries, you should follow those; you should not simply create your own rules and ignore what everyone else has done. If everyone did that, the result would be chaos. "English entries do it such-and-such a way" is NOT a good justification for ignoring a community's rules; English is not Russian is not Bulgarian, etc., and the goals of foreign language entries in the English Wiktionary are entirely different from the goals of English entries in the English Wiktionary. Benwing2 (talk) 06:24, 13 February 2019 (UTC)Reply

The English Wiktionary has a long tradition of incomplete entries. For instance, English entries largely lack IPA markup of pronunciation, and so do Czech entries. I think providing a minimum entry with a definition and a further reading is excellent service for the dictionary user, and can be provided in volumes. I have been creating incomplete Czech entries for over a decade, and was thanked for it at the beginning. At the beginning, I could not even provide good further reading since there was none online.

Should I be prevented from contributing value in the form of accurate entries to the non-paying customer of the English Wiktionary with the use of force, I will use Beer parlour to address this as a policy issue. I submit to policy and to demonstrable consensus, as usual. I think it would be more proper of those who want to impose a non-existing policy to demonstrate consensus by starting a Beer parlour discussion themselves, but I can do it myself if required. I am certainly not "ignoring a community's rules", but rather, I am behaving in a way consistent with applicable policies that I know of, and with consensus in so far as I can determine its presence.

Let me emphasize that I have argued the matter on substance, not only on rules and policies. As for substance, a minimum entry with further reading is hugely better for the customer than no entry; this is argued in greated detail in #Creating entries in languages you don't know above. As for policy, I know of no policy prohibiting creating of entries without pronunciation and inflection. As for non-policy-based consensus, I know of no Beer parlour discussion from which it follows there is consensus against minimum entries. --Dan Polansky (talk) 11:29, 16 February 2019 (UTC)Reply

More comments on the things said. "You've recently created a whole lot of extremely low-quality enties": Not really. I created accurate entries that lack completeness, which is lack of vertical quantity, not quality. As for "than create bogus, low-quality entries": I did not create "bogus" entries; that is really inappropriate. Again, the quality is fine, but there is vertical quantity lacking. These disparaging comments are inaccurate and inappropriate. --Dan Polansky (talk) 11:35, 16 February 2019 (UTC)Reply

I'm not too keen on you creating such stubs, but I agree that calling them "bogus" is inaccurate. Per utramque cavernam 13:40, 16 February 2019 (UTC)Reply

As for whether minimum entries are better than no entries, Wiktionary:Votes/bt-2007-12/User:Tbot creating FL entries vote passed 12:2 and has not been rescinded. Unlike Tbot, I am not a bot and provide guarantee on accuracy. Unlike Tbot, I provide further reading, which is excellent added value. --Dan Polansky (talk) 11:39, 16 February 2019 (UTC)Reply

That vote is more than 11 years old. I'm fairly sure such a bot wouldn't be allowed to operate today. Per utramque cavernam 13:40, 16 February 2019 (UTC)Reply

The Tbot vote has not been rescinded so it still has all the legal force it needs. I know there are some people who hate Tbot, but I do not know whether they are a supermajority or at least a superminority. Putting the legalistic argument aside, that vote is an indication, imperfect as it may be, of views of a broader group of people than the limited group appearing on my talk page. The complaints I heard about Tbot were about lack of accuracy; what is discussed in this thread is lack of completeness. --Dan Polansky (talk) 13:49, 16 February 2019 (UTC)Reply

I created Wiktionary:Beer parlour/2019/February#Stub entries and minimum required content. --Dan Polansky (talk) 19:11, 16 February 2019 (UTC)Reply

Dan, I apologize for the tone of my comments. When I wrote them I was frustrated with your actions and it led to me saying things that I shouldn't have said. I will respond further on the beer parlour page. Benwing2 (talk) 19:21, 17 February 2019 (UTC)Reply

I appreciate the apology; I know too many people who never apologize for anything and never admit any mistake. Let us continue the discussion in Beer parlour. Here only a brief motto to help memory: Make yolk and hub and skip all fluff. --Dan Polansky (talk) 07:38, 23 February 2019 (UTC)Reply

Definitionless entries in the Russian Wiktionary edit

Latest comment: 5 years ago4 comments2 people in discussion

Some years ago, there was a Beer parlour discussion about volume creation of definitionless entries in the English Wiktionary. There was no consensus in either direction, from what I remember. I must have mentioned the Russian Wiktionary as an example of a Wiktionary which has too many definitionless entries. Let's have a look at how many definitionless entries the Russian Wiktionary currently has.

Definitionless entries seem to land in categories named like ru:Категория:Статьи без перевода/cs (172 entries), ru:Категория:Статьи_без_перевода/en (9049 entries), ru:Категория:Статьи_без_перевода/de (8513 entries), ru:Категория:Статьи_без_перевода/fr (14 157 entries), ru:Категория:Статьи_без_перевода/es (807 entries), ru:Категория:Статьи_без_перевода/ru (918 entries), etc. The template placing entries into these categories seems to be ru:Template:Нужен перевод. The search "insource:/Нужен перевод/" in ru wikt yields 54 044 entries; I do not know whether this number may be subject to error, but given the item counts for several large languages, the number seems plausible.

Entry count and page view statistics can be obtained from Report card for Russian Wiktionary, stats.wikimedia.org:

Page views per month are 16,612,080 for 1,002,462 entries as of December 2018. For comparison, en wikt has 185,677,042 page views per month for 5,896,720 entries, and fr wikt has 19,419,107 page views for 3,392,407 entries. Malagasy wikt has 254,755 views per month for 5,466,228 entries, nearly all of which are bot-taken from en wikt and fr wikt. A compact comparison of Wiktionaries from another source can be found at hypestat.com, under "Where do visitors go on this site?", where the Russian Wiktionary appears as second behind English.
Those 1,002,462 entries of the Russian Wiktionary could theoretically include non-lemmas. However, I checked inflected forms of кошка, and ru:кошки is a hard redirect to кошка, and ru:кошек has no entry. A quick look at ru:Категория:Русский язык does not show anything like non-lemma entries. These observations suggest these are in fact lemma entries.

Another statistics can be had from some kind of new v2 tool, Page views per country, stats.wikimedia.org. There we can see the portion of access to the Russian Wiktionary coming from various countries. If we change the view from Map to Table at the right upper corner and if we do the calculation, we can see that 73% of page views come from Russia and Ukraine.

Hightlight: Definitionless entries in the Russian Wiktionary seem to make up 5.4% of all lemmas, as per above.

Speculation: The percentage of about 5% of definitionless entries presents no significant detriment to the usefulness and popularity of the Russian Wiktionary.

--Dan Polansky (talk) 12:38, 3 March 2019 (UTC)Reply

"en wikt has 185,677,042 page views per month for 5,896,720 entries, and fr wikt has 19,419,107 page views for 86,780,431 entries": There seems to be a mistake in the number of entries of fr.wikt. It can't have 90 millions of them. Per utramque cavernam 13:06, 3 March 2019 (UTC)Reply

Indeed, thanks a lot. Replaced with 3,392,407. --Dan Polansky (talk) 13:30, 3 March 2019 (UTC)Reply

To remove doubt about whether these approximately 1 million entries could also include redirects, I now checked ru:Special:Statistics, which shows there are 1 009 655 content pages and 1 452 575 all pages, including redirects. A further check: The search for "Значение", which appears in headings in the sense of "meaning", shows 1 006 027 entries. --Dan Polansky (talk) 15:56, 3 March 2019 (UTC)Reply

Definitionless entries in the Serbo-Croatian Wiktionary edit

Latest comment: 5 years ago2 comments1 person in discussion

Above, I deal with definitionless entries in the Russian Wiktionary, to investigate the value and impact of definitionless entries. Serbo-Croatian Wiktionary is relevant for that kind of investigation since it has really many definitionless entries: sh:Kategorija:Riječi bez definicije has 84 720 entries. An example entry is sh:adjekcija; it has pronunciation, hyphenation, gender, inflection table and a good further reading link, but no definition.

Per new v2 tool, Page views per country, stats.wikimedia.org, Serbo-Croatian Wiktionary had 64 000 page views in February from Serbia, Croatia and Bosnia and Herzegovina. By contrast, the same statistics for the English Wiktionary yields 626 000 page views in February from Serbia, Croatia and Bosnia and Herzegovina. Admittedly, the English Wiktionary has much more content in other languages than Serbo-Croatian so the two numbers are not directly comparable.

To get a bit more comparable number, let's consider the Czech Wiktionary. Per new v2 tool, Page views per country, the Czech Wiktionary had 960 000 page views in February from the Czech Republic. The Czech Wiktionary has almost no definitionless entries, as far as I know. Czech entries there have very good coverage of pronunciation and inflection since some editors are really passionate about it, but in that regard, the Czech Wiktionary does not seem to differ from the Serbo-Croatian Wiktionary. The Czech Wiktionary has rather small coverage of non-Czech languages, especially when compared to the English Wiktionary. To account for inhabitant number, let us note that there are around 10 600 000 people in the Czech Republic, while Serbia has around 7 000 000, Croatia has around 4 000 000, and Bosnia and Herzegovina has and around 3 500 000, all per Wikipedia. To account for the total number of entries: The Czech Wiktionary has 109 407 content pages per cs:Special:Statistics, many of which are inflected form entries; the search insource:/Kategorie:Tvary/ yields 25 947 entries. The Serbo-Croatian Wiktionary has 911 552 content pages per sh:Special:Statistics, many of which are inflected form entries since sh:Kategorija:Srpskohrvatski flektirani oblici has 746 984 entries; lemma entries seem to be in sh:Kategorija:Srpskohrvatski indeks, which has 137 030 entries.

I would argue that the above numbers suggest that people from Serbia, Croatia and Bosnia and Herzegovina are for the most part not interested in definitionless entries for the language they speak; pronunciation and inflection does not make up for the missing semantics. The kind reader can do their own analysis from the provided data sources or other sources.

--Dan Polansky (talk) 18:00, 3 March 2019 (UTC)Reply

The above picture lacks the Serbian Wiktionary and the Croatian Wiktionary. The Serbian Wiktionary had 321 000 page views from Serbia in February 2019 per new v2 tool, Page views per country. The Croatian Wiktionary had 85 000 page views from Croatia in February 2019 per [1]; it had 30 000 page views from Bosnia and Herzegovina. This compares to 23 000 page views of the Serbo-Croatian Wiktionary from Croatia, and 15 000 from Bosnia and Herzegovina. Croatian lemmas in the Croatian Wiktionary appear to be in hr:Kategorija:hrvatski (indeks), which has 8021 items; hr:Kategorija:engleski (indeks) has 2171 items and hr:Kategorija:srpski (indeks) has 518 entries in Latin script. sr:Категорија:Српска именица (Serbian nouns) has 81 806 items.

As per above, if we focus on Croatia and Bosnia and Herzegovina (Serbia uses a different script), it seems that 8 000 Croatian lemmas in the Croatian Wiktionary (^) produced more page views than all the 137 030 Latin-script Serbo-Croatian lemmas in the Serbo-Croatian Wiktionary (^), of which 84 720 are definitionless (^). This could be explained by users not being interested in definitionless entries, but also by users refusing to consult a resource that they consider to mistreat what they consider to be separate languages; I do not have data to select between the two hypotheses, and other hypotheses could be possible. It is rather unclear what is going on since if we disregard the definitionless entries, there remain 52 310 Serbo-Croatian lemmas that one might think have some definitions; it could be that they are deficient or uninteresting in some way. --Dan Polansky (talk) 11:25, 8 March 2019 (UTC)Reply

Using a see-technique in Czech entries to create synonym rings edit

Latest comment: 4 years ago10 comments2 people in discussion

For nearly 5 years, I have been using a see-technique in Czech entries to create synonym rings. The technique consists in placing the synonym ring (a list of synonyms) in one entry while letting other members of the ring point to that entry from their synonym sections rather than duplicating that same ring in them and creating a considerable duplication and the associated maintenance overhead. Admittedly, the problem the technique solves is also solved by Wiktionary thesaurus. The technique proved useful, but it is now opposed at Wiktionary:Beer parlour/2019/May#Using arrows in synonyms.

The use of the technique can be seen in the synonym ring tlusťoch and an entry linking to it, tlustoprd, which I created on 27 June 2014. The technique is traditional as far as I can see and should be well understood by users of various reference works.

Notes on issues raised or claims made:

The see-technique is bad. Response: I don't think so. It is good in solving a problem. It may be inferior to some other technique, but that does not make it bad. In my view, the technique is superior to the fairly widespread duplication that exists in Wiktionary synonym sections today.
The see-technique confuses users since there is also the thesaurus. Response: I don't think so. The technique is more general than the thesaurus and should be well understood. Since for Czech the thesaurus is almost never used, the users of Czech entries should not be confused. Even if thesaurus were used, an entry whose synonym section says "See tlusťoch" should be perfectly understood.
What happens if someone moves a synonym ring to the thesaurus? Response: That has not been happening for Czech entries, but if it starts happening, the mover should update the ring members to link to the new thesaurus entry. If the mover forgets to do that, the state is not too bad: e.g. in tlustoprd, the user first has to move to tlusťoch, and then navigate to the thesaurus from there; while not ideal, it is still usable and is a result of omission that can be amended.
The technique makes it hard for users to know where to look for synonyms. Response: I don't think so. The entry that is a member of the synonym ring points to the location of synonyms: the synonyms are there where the entry points.
Polysemy of entries makes it hard for the users to find the synonyms in the synonym ring entry. Response: It is usually possible to choose as the host of the synonym ring an entry with one or only few senses. If this becomes a problem (it has not become one in Czech entries so far), we may start using the thesaurus for Czech. Furthermore, even in an entry with many senses, a synonym ring would probably be reasonably conspicuous by its length and by being a list of items.

Compared to the thesaurus, the technique has the advantage of being simpler. No new separate heading format has to be learned. One does not need to keep thesaurus principles or practices in mind. That said, I don't oppose the thesaurus; rather, years ago, I spent significant effort to bring the thesaurus to the sane state in which it is today.

Admittedly, parallel practices are not always a good thing. But they are not always a bad thing either. The current controversy would probably not arise if not for an introduction of a parallel practice via Wiktionary:Votes/pl-2018-11/Allow semantic relations under definition lines.

The discussed Czech synonym ring members can mostly be found like this: insource:/Synonyms====. *\* *See also \[\[[^T]/. The search finds 448 items.

The technique has proved useful in the Czech entries, allowing me to achieve an interesting initial coverage of a Czech synonym dictionary in the English Wiktionary. Readers of Czech content have benefited. I hope I will be able to continue using the technique in some form, to continue delivering and expanding the Czech synonym dictionary; I hope editors will not forbid this practice.

--Dan Polansky (talk) 14:32, 6 May 2019 (UTC)Reply

How about the following situation (bear with me): Term A is synonymous with terms B,C,D; term B is synonymous with terms A,C,D; terms C is synonymous with terms A,B,D and E; term D is synonymous with terms A, B, C and E; term E is synonynmous with B, C, D but not so much with A. term A term B term C term D term E If it's a network rather than a ring. Ketiga123 (talk) 17:28, 28 June 2019 (UTC)Reply

Do you have a specific example in mind? Synonymy on sense level tends to be an equivalence relation; this not so on term level, of course, where synonymy is not transitive because of there being multiple meanings per term. In situations where synonymy misbehaves, the see-technique should probably be avoided, but then, the thesaurus would not help either, I think. In the situations where I have used the see-technique, I did not notice any problems. --Dan Polansky (talk) 17:36, 28 June 2019 (UTC)Reply

Like, all of the row below are in a sense synonymous with each other, but not really. I would say, any two adjacent terms are closely synonymous, but I am not sure that the ends are very synonymous. So by clicking on new synonyms, you would wander further away from the source term. It's like a scale, or a network. Ketiga123 (talk) 18:46, 28 June 2019 (UTC)Reply

желанный (želannyj) ↔ привлекательный (privlekatelʹnyj) ↔ возбуждающий (vozbuždajuščij) ↔ сексуальный (seksualʹnyj) ↔ секси (seksi) ↔ сексапильный (seksapilʹnyj) ↔ эротичный (erotičnyj) ↔ заводящий (zavodjaščij) ↔ распаляющий (raspaljajuščij) ↔ пикантный (pikantnyj) ↔ апетитный (apetitnyj)

Thank you. These are not what I would call synonyms; some of them are near-synonyms at best. Like attractive is not a synonym of stimulating or provocative, and these again are not synonyms of erotic or sexy. Near synonymy is not an equivalence relation. For interest, there are Thesaurus:attractive, Thesaurus:exciting, and Thesaurus:sexy. That's not to say that near-synonyms are not interesting; I usually put them to See also section. --Dan Polansky (talk) 19:09, 28 June 2019 (UTC)Reply

Right, I've used 'Related terms' for that end, as in götvərən, but I guess 'See also' is better. Ketiga123 (talk) 19:26, 28 June 2019 (UTC)Reply

Related terms is for etymologically related terms per WT:EL#Related terms. I still occasionally find an entry that I created a decade ago and that gets this wrong. --Dan Polansky (talk) 19:29, 28 June 2019 (UTC)Reply

Since you are an expert of "inscourse:", how can I translate the following into a search query? Finding all "Related terms====" followed by a new line and "* {{l|az"? Ketiga123 (talk) 12:01, 30 June 2019 (UTC)Reply

@Ketiga123: The following seems to find what you are looking for: insource:/Related terms====.\* \{\{l\|az/. The regular expression used does not expressly pick a newline but rather covers it via ".", but it should not really matter for your purposes, I guess; I tried to use \n but it did not work. --Dan Polansky (talk) 20:38, 4 July 2019 (UTC)Reply

Thank you very much! Ketiga123 (talk) 21:17, 4 July 2019 (UTC)Reply

Custom text insertion shortcuts using JavaScript edit

Latest comment: 4 years ago1 comment1 person in discussion

I used to use bookmarklets (JavaScript in browser bookmarks) to insert template-like blocks of text into the wikitext but I no longer do. Since recently, I am using a JavaScript-based shortcut replacement technique. For instance, I type "_ncsn_" into the edit box, press F8, and the typed shortcut gets replaced with the text of my choice, in this case generic text for a new Czech noun. This has no side effects since underscore is not used much in template names, and therefore, inadvertent replacements of other parts of the wikitext are unlikely. The texts to be inserted are all at a single page with a single revision history, in my personal space: User:Dan Polansky/common.js. The shortcuts are to be nearly like keyboard shortcuts; I have _rt_ for ===Related terms===, _pcs_ for Czech pronunciation, _frcs_ for Further reading for Czech, etc.

This is all largely browser independent; there is no need to carry bookmarklets from a browser to browser. And you do not get anything like history with bookmarklets.

The implementation at User:Dan Polansky/common.js is simple, consisting of function expandShortcuts, function addKeyListener and the call "$( addKeyListener );". It seems so simple that it might not be worth mentioning, but it is such a neat trick, and obviously, I spent years on wiki without realizing I should use the trick.

A non-programmer should find it reasonably easy to expand and customize; for instance, the row ".replace(/_e_/g, '===Etymology===')" lets _e_ be expanded to ===Etymology===.

The user interface is very minimalistic: there is no menu and no GUI. The functionality is accessible by pressing F8, and that is the only entry point to the functionality; no other key is used.

A related reading is User:Equinox/How to be fast.

--Dan Polansky (talk) 08:54, 5 July 2019 (UTC)Reply

Thank you for creating R:SWO edit

Latest comment: 4 years ago3 comments2 people in discussion

thanks for creating {{R:SWO}}, but i don't think it should be added in cases where the Oxford definition doesn't match with any of our wiktionary definitions, like on ka#Swahili. i am also a little concerned that it may not add enough extra information, being that wiktionary is descriptive (so we can't use it for definitions) and Oxford doesn't have etymological or class information to cite. the definitions are admittedly helpful to "cheat" with before looking for attestations though. --Habst (talk) 02:19, 9 July 2019 (UTC)Reply

I placed the R:SWO template to "Further reading" sections rather than "References" sections, and therefore, the template is shown to serve as a further reading, not to substantiate anything. I am not sure you should have removed the further reading from ka entry even if the further reading has something different from Wiktionary entry: with the further reading link, the reader has it easy to follow the link and see that the Wiktionary content is not covered by that source; without the further reading, the reader is on their own. That said, I wonder whether ka should be sent to WT:RFV to ask to attesting quotations, provided you are right that the current content is not covered in the Oxford dictionary (I have not checked).

As to a further reading lacking some class of information such as etymology, that is rather usual for further reading items, and I don't see why that should be a problem. The further reading link makes no claim that it substantiates all information in the entry. This notion was reinforced by renaming "External links", the former title of the section, to "Further reading" via a vote. The readers and editors can use the further reading in a variety of ways: a reader may want to double check rather than relying on a single source, Wiktionary; a reader may want to find more senses than covered by Wiktionary; an editor may want to double check definitions and if in doubt make more research in other sources or send a sense to RFV. The further reading is a generic tool.

As for attestation, Swahili is listed at Wiktionary:Criteria for inclusion/Well documented languages, and therefore indeed requires quotations of actual use for verification. But hypothesis verification is distinct from hypothesis generation; one can use any source one wants for hypothesis generation, including dictionaries, baring copyright infringement. Even informal verification process can involve dictionaries, such as when an editor thinks: it's in the dictionary so I'll leave it be now, letting someone else question the entry or try to dig the quotations of use.

Further reading is a widespread practice in the English Wiktionary. German entries usually link to Duden, Polish entries usually link to PWN; etc. The search insource:/\{\{R:Duden/ shows that Duden is in 32,437 entries; insource:/\{\{R:PWN/ shows PWN at 18,678 entries. I am a big fan of further reading, and if I had a magic wand, I would let the following happen: At least one relevant further reading link to a monolingual dictionary to every entry, as far as possible; abracadabra, and ... oops, magic_wand.py: service unavailable. --Dan Polansky (talk) 07:36, 9 July 2019 (UTC)Reply

okay, that is fair and i see your point.

i've created mediawiki bots before, and i think your "magic wand" idea would be pretty easily achievable by a bot, if the only condition to add is to check for presence of the target word in Oxford. this gives me an idea, so i'll experiment with that when i have the time. --Habst (talk) 14:52, 9 July 2019 (UTC)Reply

Oxford dictionaries for many languages edit

Latest comment: 4 years ago2 comments1 person in discussion

There used to be Oxford dictionaries at oxforddictionaries.com, published by Oxford University Press, not to be confused with the Oxford English Dictionary (OED). The English and Spanish ones are now at lexico.com, it seems:

Lexico English, lexico.com/en - English definitions
Lexico Spanish, lexico.com/es - Spanish definitions; Spanish to English; English to Spanish

And there is a template: {{R:Lexico}}.

Many other dictionaries are available online from Oxford University Press. Per WP, the "collection includes bilingual dictionaries between English and a variety of languages (Greek, Gujarati, Hindi, Indonesian, Latvian, Malay, Northern Sotho, Romanian, Southern Quechua, Spanish, Swahili, Tajik, Tamil, Tatar, Telugu, Tok Pisin, Tswana, Turkmen, Urdu, Xhosa, and Zulu) known as Oxford Living Dictionaries, which are provided free of charge, as well as Oxford Dictionaries Premium, a subscription service."

Selected Oxford Living Dictionaries, often English ↔ Language, Oxford University Press:

Greek · Gujarati · Hindi · Indonesian · isiZulu · Latvian · Malay · Northern Sotho · Romanian · Setswana · Southern Quechua · Swahili · Tajik · Tamil · Tatar · Telugu · Tok Pisin · Turkmen · Urdu

Alternatively, you can navigate to a dictionary for a particular language by going to https://languages.oup.com/, then clicking on the three horizontal lines at the top left, and then making a choice; in a different view of the same site, there is "Select Your Dictionary" at the top right.

Berlín edit

Latest comment: 4 years ago1 comment1 person in discussion

At Berlin a declenion table is missing. I wonder why one says do Berlína and not do Berlínu (like do Londýnu, do Madridu)

https://en.wiktionary.org/wiki/Berl%C3%ADn#Czech

— This unsigned comment was added by Rasmusklump (talk • contribs) at 12:22, 27 July 2019.

I added a declension table to Berlín. You can also find a declension table at cs:Berlín and in IJP:Berlín[2]. The preposition do takes genitive, and indeed, we say "do Berlína", "do Kolína", but also "do Londýna"; I find "do Londýnu" unnatural. You can find some other similar genitives in Rhymes:Czech/iːna, including Rýna, Zlína, Děčína, Jičína, Trenčína, Vsetína, and Hodonína. I know no rule governing the choice of -a (Berlína) vs. -u (Madridu) for genitive; both use the declension pattern hrad. --Dan Polansky (talk) 16:23, 27 July 2019 (UTC)Reply

-cký vs. -ký and their ilk edit

Latest comment: 4 years ago1 comment1 person in discussion

I am investigating a certain conundrum about representation of the suffix in řecký as -cký vs. -ký, and similarly in herecký.

I posted deliberations to Talk:-cký, Talk:-ický, Talk:-ctví, and a link to Talk:-ctvo and Talk:-cko.

I made the tentative decision to include c in the suffix representation, that is, -cký, -cko, -ctví and -ctvo. These complement -ský, -sko, -ství and -stvo.

I noted what to me looks like a "missing link": bangkocký and bangkokský.

I consulted Slovník afixů and other sources. I find having "-ství/-(c)tví" as a headword (as Slovník afixů does) unobjectionable, subsuming both lidství and herectví in one macrosuffix, as it were. By contrast, I find dividing řecký and polský to two macrosuffixes hard to understand, namely Slovník afixů "-ký/-oký/-eký/-iký/-ec-ký/-(n)ic-ký"[3] and "-ský/-ký, -ov-ský, -en-/-ěn-ský, -án-ský, -in-ský, -ín-ský"[4]. Since, even if we analyze řecký as Řek + -ký (as I currently don't), it should still be part of the same macrosuffix as -ský, contrary to what Slovník afixů does.

I noted related entries in other Slavic languages: Russian -цкий and Polish -cki.

--Dan Polansky (talk) 13:10, 11 September 2019 (UTC)Reply

How many edit

Latest comment: 4 years ago2 comments2 people in discussion

Hi Dan, always (ok, not always, but pretty much often) when I look up who made the article about the Czech word in the English Wikipedia, it is you - and they are often interesing entries! Can you tell me how many Czech entries have you added into the en/cs(/...?) Wikipedia so far? :) Thank you and go on, please! --Jiří Janíček (talk) 17:53, 6 November 2019 (UTC), PragueReply

Thank you very much for your encouraging words. According to sigma - created pages report for Dan Polansky, tools.wmflabs.org, I created 25,351 entries in the English Wiktionary. As you can guess from the link, the overwhelming majority of those are Czech entries. According to Xtools - created pages report for Dan Polansky, tools.wmflabs.org, I created 25,368 entries in the mainspace of the English Wiktionary; I also created a substantive volume of the Thesaurus pages and Rhyme pages. The English Wiktionary Category:Czech lemmas has 32,296 entries. Mějte se hezky. --Dan Polansky (talk) 10:15, 13 December 2019 (UTC)Reply

Pink ping edit

Latest comment: 4 years ago1 comment1 person in discussion

meta:Steward_requests/Bot status#JAnDbot --Ernest Escobar Mancuso (talk) 17:19, 7 November 2019 (UTC)Reply

Add topic