Open main menu

Wiktionary β

User talk:Angr


Hi, was reading the page on Ancient Greek in Wiktionary and saw your name at the bottom. Maybe you can help me or point me to the right contact. I am not sure how to deal with some grc entries in Wiktionary. There are terms that differ only by a diacritic. See for example σέλῑνον (sélīnon, celery) in parsley and σέλινον (sélinon) in sedano. As I am extracting etymological data from Wiktionary, I would like to collapse those two entries into one entry only. Do you think I can use some kind of general rule, e.g., replace ῑ with ι? Or should I just keep both to represent data in Wiktionary? The link to the tool I am developing is etytree Epantaleo (talk) 15:49, 13 October 2017 (UTC)

As you might have noticed, both links go to the same entry; the macron is just extra information on vowel length that is usually not represented outside of headwords. You can find all the diacritical marks that are stripped for each language by looking at the subpages of Module:languages. —Μετάknowledgediscuss/deeds 15:52, 13 October 2017 (UTC)
(edit conflict) Ancient Greek entries never include the long and short marks, though they may be displayed. Thus the two links you provided above both point to the same page name, σέλινον. The long and short marks are always optional; if you aren't sure, you can leave them out. Other Ancient Greek diacritics, however, are not optional and are part of the page name: the acute and circumflex accents, the rough and smooth breathings, and the diaresis are all required parts of the page name. —Aɴɢʀ (talk) 15:54, 13 October 2017 (UTC)

liu'unta, rei'ittäjäEdit

Apostrophe happens to be the correct sign in the Finnish orthography. We use all sorts of language-specific signs, why not this?--Hekaheka (talk) 19:57, 19 October 2017 (UTC)

I don't think any language distinguishes the typewriter apostrophe ' from the curly apostrophe . The choice between them is purely aesthetic. We have entries for don't, j'ai, Türkiye'yi, δ’ (d’), and so on, and many such entries have hard redirects from the spellings with the curly apostrophe. I don't see any reason to treat Finnish differently. —Aɴɢʀ (talk) 08:28, 20 October 2017 (UTC)

etyl cleanupEdit

If you want, you can also target af (Afrikaans), ht (Haitian Creole), id (Indonesian), ms (Malay), sk (Slovak), sl (Slovene), sq (Albanian), and tpi (Tok Pisin). I have cleaned up some entries in all of these, but there's bound to be more. DonnanZ (talk) 13:19, 20 October 2017 (UTC)

@Donnanz:   DoneAɴɢʀ (talk) 13:49, 20 October 2017 (UTC)
Thanks, I'm hoping other users will get the message... DonnanZ (talk) 14:05, 20 October 2017 (UTC)

Sorry, I forgot to mention Turkish (tr). You can save it for another batch if you want. DonnanZ (talk) 15:38, 20 October 2017 (UTC)

Return to the user page of "Angr".