User talk:-sche/Archive/2015

Schlackenlosigkeit

Latest comment: 9 years ago1 comment1 person in discussion

See WT:RFV#Schlackenlosigkeit. The discussion has advanced beyond my extremely modest knowledge of German and may even need a native speaker. DCDuring TALK 23:01, 13 January 2015 (UTC)Reply

Αγαρηνών et al

Latest comment: 9 years ago1 comment1 person in discussion

The "misused" templates were put there for a purpose - if you want to change any more Greek entries please let me know. — Saltmarsh^{συζήτηση-talk} 11:11, 16 January 2015 (UTC)Reply

gahn

Latest comment: 9 years ago2 comments2 people in discussion

Could you check the codes on this page? Thanks. DTLHS (talk) 22:08, 23 January 2015 (UTC)Reply

Meh. Someone changed the header, but not the codes, from nds-de to plain nds (rather than adding a separate section for the Dutch Low Saxon term). >.> The entry could be band-aided by either changing the header or the codes, but the general disagreement and slow-motion edit-warring about how to handle the various Low German lects makes for so much ugliness that I am losing interest in editing them. - -sche (discuss) 03:59, 24 January 2015 (UTC)Reply

Why the "hmm..."?

Latest comment: 9 years ago8 comments3 people in discussion

I agree that the previously-listed meaning of that was odd, but... what is the meaning of your edit summary? Are you doubtful of something? Or...? Tharthan (talk) 21:52, 24 January 2015 (UTC)Reply

Mostly I was doubting the previously-listed meaning, but I also wonder if the wording I introduced really covers the citations, and/or if there are actually two senses, one used of people, and the other of places (the latter presumably similar to shire#Verb). - -sche (discuss) 23:22, 24 January 2015 (UTC)Reply

I share your doubts. Also, are you sure that parish is a verb? Parished could easily be interpreted as a denominal adjective. DCDuring TALK 23:55, 24 January 2015 (UTC)Reply

The 1972 citation and the second sentence of the 1992 citation seem very verbal to me. I'll see if I can find other inflected forms. - -sche (discuss) 01:41, 25 January 2015 (UTC)Reply

Check out the 1917 and 1991 citations (the latter technically of re-parish). There's also the citation below, which I can't make sense of. - -sche (discuss) 01:49, 25 January 2015 (UTC)Reply

1903, Maxwell Gray, Richard Rosny, page 210:
"You will take pleasure in parishing. Mother used to parish."

"How do you know I like parishing?"

"Your uncle said so."

"Oh! did he?"

"And you may like the rectory people; it's a fine old house, and often full of visitors."

after e/c

I'm not hostile to the verb view for the sense, just uncertain. I've looked for the parishing form, but just found it with certainty for what is now a new intransitive sense, for a distinct etymology of parish#Etymology 2 ("perish"), and for a noun sense. I may just have a block for the verb sense. There was a book title that seemed to be the sense I've been doubting.

The citation above is of the definition I added: "To visit residents of a parish". It's used of parish priests and also of women doing socializing possibly under color of visiting the sick, aged, shut-ins etc. DCDuring TALK 01:57, 25 January 2015 (UTC)Reply

OK, I'll add it to that sense, which is now well-cited. - -sche (discuss) 01:59, 25 January 2015 (UTC)Reply

The 1917 cite is syntactically though not semantically intransitive. The "re-parishing" cite is helpful. It's tough with a word that shows up so uncommonly in what are to me somewhat alien contexts. The word is certainly used with a meaning that is at least nearly verbal. I doubt anyone would challenge it on the same grounds such as my doubts. DCDuring TALK 02:08, 25 January 2015 (UTC)Reply

Distribution

Latest comment: 9 years ago3 comments2 people in discussion

As was revealed in a discussion that I had previously with Dbfirs, it seems the distribution of /ɛəɹ/ and /æɹ/ words differs between British English and dialects of North American English that do not possess the merry, Mary, marry merger.

Particularly...

"vary" is often /væɹi/ in non-merry,Mary,marry merger dialects (though, I will admit, its traditional /vɛəɹi/ pronunciation is still heard amongst the older generation. My mother, for instance, uses /vɛəɹi/, whilst my father and I use /væɹi/ [as does much of the younger generation]. Similarly, parent for myself, my family, and most of my peers is /ˈpæɹənt/, whilst /pɛəɹənt/ is the pronunciation I have heard in church and by some others. It seems to be about a 50-50 distribution.

In conclusion, some words that have a traditional /ɛəɹ/ in British English and old fashioned North American English seem to have shifted to /æɹ/ in the younger generations.

Do you (or anyone else visiting your talk page) have any idea as to why this might be? Tharthan (talk) 16:34, 25 January 2015 (UTC)Reply

Generic phonetic simplification? Influence from GenAm, where the sounds aren't distinguished? I don't know. North American English regional phonology#New_England says "Western New England [... and] Connecticut and western Massachusetts in particular show the same general phonological system as the Inland North, and some speakers show a general tendency in the direction of the Northern Cities Vowel Shift—for instance, an /æ/ that is somewhat higher and tenser than average[.]" The phoneme that's next higher than /æ/ is /ɛ/. You're describing things going in the opposite direction, but I can imagine how a reduction in the contrast between the two sound in non-Mary-merging dialects, combined with an outright merger of the sounds in the surrounding dialects, could lead people who tried to maintain a distinction between the words (Mary, marry, merry) to use a new / un-original sound to do so. In English, I've heard people maintain the pen/pin distinction backwards, and in German people mix up [ɛː] and [eː] if they try to maintain a distinction between them. - -sche (discuss) 21:45, 25 January 2015 (UTC)Reply

Hmm... it seems to me to be more of a specific hypercorrection than anything else, though, because other words besides the previous two seem to retain their correct pronunciations. I dunno. I just hope that we don't have another Great Vowel Shift or anything like that any time soon, because that seems to be the direction being headed towards. Tharthan (talk) 21:55, 25 January 2015 (UTC)Reply

sch

Latest comment: 9 years ago3 comments2 people in discussion

Hi there. I wanted to ask you about the [phonetic] transcription of the German /phonem/ /ʃ/. Should it be [ʃʷ] because of the lip rounding, or should we not use [ʷ] just as we've decided not to use [ʰ]? I personally would be in favour of [ʃʷ] because unlike aspiration there seems to be little regional/idiolectal variation and, even more importantly, there would be no wondering when and when not to use it since /ʃ/ would just always become [ʃʷ]... But I don't know. What do you think?Kolmiel (talk) 17:41, 25 January 2015 (UTC)Reply

I would treat it like aspiration, and so I wouldn't use it. I note that de.Wikt, which only uses narrow transcriptions, doesn't use [ʷ]. You could ask on WT:T:ADE, though. This is not entirely here or there, but ... people occasionally propose "diaphonemes" around here (ultra-broad transcription); this seems like the opposite, ultra-narrow transcription. Perhaps one day we'll start adding both and have a sequence of //ultra-broad//, /broad/, [narrow] and [[ultra-narrow]] transcriptions. - -sche (discuss) 21:53, 25 January 2015 (UTC)Reply

No it's fine, just wanted to check if you were in favour of using it. It's not that important I guess, and it's not a "Herzensangelegenheit" of mine.

I just think we shouldn't base our decision on the German wiktionary. Their transcriptions aren't narrow, they're just given between squared brackets because most traditional dictionaries do that. They would be very wrong if understood literally, especially things like [pakn̩] which don't exist in the German language and which I suspect might be almost physically impossible to the human mouth.Kolmiel (talk) 21:48, 27 January 2015 (UTC)Reply

The names= field in the data modules

Latest comment: 9 years ago7 comments3 people in discussion

I'm looking at changing this now, and I already made a few initial modifications. But I'd like to confirm just what the plan was again. If I remember correctly, the idea was to split it into three fields:

canonicalName
otherNames
Some field for the things that are subsumed under this name, but are not just alternative names.

I'm not sure what to call that third field, though, so do you have suggestions? Also, what should be done in ambiguous cases where there is no agreement whether something should be classified a subvariety or not? Perhaps, I could only split off number 1 for now, leaving 2 and 3 together until we sort that out more completely. —CodeCa t 22:21, 25 January 2015 (UTC)Reply

Oh, great! :)

Perhaps the third field could be called "varieties" or "varietyNames"?

I assume that when you say "no agreement whether something should be classified a subvariety or not", the alternative to classifying it as a subvariety is classifying it as an alternative name for the whole language. (If there's disagreement about whether or not something is a dialect of one language or a separate language, that's a question we're going to settle at an earlier stage, namely the stage of granting it a code or not, before we ever get to any of these names fields. Right?) There are cases where certain names refer both to dialects and to the whole language; in the earlier discussion I suggested that in such cases we could either (1) list the name in both places, or (2) decide that anything listed in a higher field will not be repeated in a lower field (so, anything listed in "otherNames" will not be repeated in "varietyNames"). - -sche (discuss) 22:37, 25 January 2015 (UTC)Reply

The question is mostly relevant to reconstructed languages, at least in the way I intended it. Proto-Uralic for example has Proto-Finno-Ugric as a subvariety, but some linguists contend that they are one and the same. Austronesian is often considered synonymous with Mon-Khmer (both share a Wikipedia article too). And there are probably similar situations for other languages.

I'm not sure if "varieties" is clear enough. I would like to have "sub" in the name so that it's clear in what way it's distinct from "otherNames". So "subvarieties"? I've also seen "sublects" used by some people. —CodeCa t 22:53, 25 January 2015 (UTC)Reply

Well, I would handle proto-language cases the same as other cases, either always list such names in both fields, or decide one field always has priority. The first approach might more accurately convey that some authorities use _(whatever)_ as an alt name for the whole language and other authorities use it as the name of a "dialect", and keeps us from having to pick which field to list the name in. If we went with the second approach, my gut reaction would be to "prioritize" the "higher" field, and so list "Proto-Finno-Ugric" as an alternative to "Proto-Uralic" and not list it as a dialect.

As for the name: well, how about "subvarieties"/"subvarietyNames"? All but one of the hits of google books:"sublect" OR "sublects" are scannos of "subject". Or perhaps something like "subsumedVarieties", to convey that the main purpose is to list cases where ISO-code-having subvarieties have been subsumed, rather than e.g. to start listing every non-code-having dialect of English. - -sche (discuss) 23:22, 25 January 2015 (UTC)Reply

(edit conflict) Of the two, I like "sublects"- it sounds more neutral. Actually, it's the "sub" part that makes me nervous. Except in the case of pluricentric languages, we don't explicitly mention the standard lect at all, which is every bit as much a sublect as all the things we call the sublects. More often than not, the only difference between the "standard" and the "sublects" is an accident of history: In Old English, for instance, the Wessex dialect is generally treated as standard, but eventually the East Midlands dialect took its place. That means a sublect became the standard and the standard became a sublect. In reality, though, they're still just two sublects, with the main difference being that the standard sublect tends to influence and crowd out the other sublects.

Of course, it would look funny to include "Standard xyz" in the list of sublects, so I guess we're stuck with the current arrangement. Still, I wonder if there's a way to distinguish the language as a whole from its sublects without implying that only those lects different from the standard are sublects.Chuck Entz (talk) 00:08, 26 January 2015 (UTC)Reply

This raises the question of what we want to list in sub[variety/lect] field. Initially, when subvariety names were included in languages' lists of alt names, it was because the named subvarieties had previously been considered languages (generally by the ISO, but in some cases merely by us via granted and then revoked exceptional codes); the subvariety names were listed so that people who thought they were languages would know where they went.

However, I can see how we might find it useful to make comprehensive lists of languages' dialects (including dialects when have never been considered own languages); such lists could in some far-future version of Wiktionary be meshed with the context labels so that entries could be put in cleanup categories if they were categorized as belonging to another language's dialect, for instance.

I'd still use "subvarieties" for the name since "sublects" doesn't appear to be a word; even the Google Scholar hits are scannos for "subjects", lol. - -sche (discuss) 19:54, 26 January 2015 (UTC)Reply

I think it would be a good idea to make a list of dialects. But it would be very hard to manage because there are so many, and there will always be a need to specify a particular variety that is more fine-grained than any we've defined so far. So if we want to add something like that, we would have to take the possibility of unrecognised dialects into account, like the label template already does. —CodeCa t 20:05, 26 January 2015 (UTC)Reply

surbasser

Latest comment: 9 years ago3 comments2 people in discussion

Hi! If this is a real French verb, could you define it? If it's not a real verb, I'll need to delete all the inflected forms someone created for it (Special:WhatLinksHere/surbasser). - -sche (discuss) 09:20, 27 January 2015 (UTC)Reply

Most often, it's a typo for surpasser or surbaisser. However: 1. it seems that, in architecture, surbassé has been used as well as surbaissé (but I cannot find citations clearly showing that it was used as a verb). 2. I also find surbassé used for music, and very few uses clearly using a verb surbasser (try to Google "il surbasse" and "qui surbasse"). I think I can guess the sense (make music overbassed), but I'm not a specialist. Lmaltier (talk) 21:36, 27 January 2015 (UTC)Reply

I see. Thanks for checking! - -sche (discuss) 21:42, 27 January 2015 (UTC)Reply

Flood flag

Latest comment: 9 years ago4 comments2 people in discussion

Hi, could you give me the flood flag for about 20 minutes, please? --Type56op9 (talk) 18:41, 28 January 2015 (UTC)Reply

Nah, you're not supposed to be operating a bot. - -sche (discuss) 19:36, 28 January 2015 (UTC)Reply

Actually, it's not a bot. It is WT:ACCEL, which looks like a bot. --Type56op9 (talk) 11:40, 29 January 2015 (UTC)Reply

Fair enough. I just went through and patrolled your latest batch. - -sche (discuss) 20:01, 29 January 2015 (UTC)Reply

Proto-Ta-Arawakan

Latest comment: 9 years ago10 comments3 people in discussion

Hi, could you create a language module for Proto-Ta-Arawakan as well? --Victar (talk) 19:17, 30 January 2015 (UTC)Reply

I've created a family code for Ta-Arawakan, "awd-taa". However, neither "Proto-Ta-Arawakan" nor "Proto-Ta-Arawak", nor "Proto-Ta-Maipurean", "Proto-Ta-Maipuran", or any of the other alt names I tried gets any Google Books or Scholar hits, or even raw web hits. Are you sure it's a valid proto-language? - -sche (discuss) 19:52, 30 January 2015 (UTC)Reply

Thanks. Yeah, what happens is it usually just gets called Proto-Arawak. Incidentally, Arawak is also a language within Ta-Arawak, otherwise known as Lokono. It's all very convoluted, but consequentially I have these reconstructions that shouldn't be called Proto-Arawakan since they aren't attested outside of Ta-Arawak, ex. *hayaba. --Victar (talk) 22:17, 30 January 2015 (UTC)Reply

I've also seen it awkwardly called "proto-Caribbean Northern Arawak". --Victar (talk) 23:07, 30 January 2015 (UTC)Reply

OK, thanks for the clarification. In general, I would say "meh, if someone wants to create entries for such-and-such proto-language that existed, go for it". However, User:Tropylium has recently been arguing against creating separate codes and appendices for cases where things are reconstructible only to certain dialects of proto-languages, and if other linguistic works just treat Proto-Ta-Arawak as Proto-Arawak (and AFAICT never mention or confirm the existence of Proto-Ta-Arawak at all), that does make me question if we really need a code for it. Tropylium, do you have an opinion on this? - -sche (discuss) 03:48, 31 January 2015 (UTC)Reply

Looking at Wikipedia's classification, it seems that Ta-Arawakan is a fairly deep subgroup within the wider Arawakan family, and accepted by each of the three otherwise very different classification schemes. Sounds like good enough grounds for separate treatment. Cleanup will still be possible later, if it turns out that there exists a better way to define a subgroup comprising these languages (but AFAIK Arawakan is not one of those families where a micro-detailed family tree is known yet). --Tropylium (talk) 04:07, 31 January 2015 (UTC)Reply

OK, I have created "Proto-Ta-Arawakan" with the code "awd-taa-pro". - -sche (discuss) 04:36, 31 January 2015 (UTC)Reply

Thanks to you both! Yeah, the whole Arawak tree is outdated, based on paper from 1991. I'm working on a draft for a new version based on various published works, w:User:Victar/Template:Arawakan languages. --Victar (talk) 17:37, 31 January 2015 (UTC)Reply

I wonder where the heck the D came from in awd and in Taíno the Q in tnq? I think the Arawak languages just got the bottom of the barrel. If I had some say, I would rename Arawak to lcn for Lokono/Locono and use arw for the language family. --Victar (talk) 01:27, 31 January 2015 (UTC)Reply

Yeah, some forethought would have done the ISO good. Particularly strange are the cases where languages which have three-letter names have been given codes that aren't those three letters, e.g. Abu is ado (while abu is Abure), Col is liw, and so on. - -sche (discuss) 03:49, 31 January 2015 (UTC)Reply

sınalgı

Latest comment: 7 years ago4 comments2 people in discussion

"sınalgı" was deleted but they (88.XXX.XXX.XXX) added again! --123snake45 (talk) 02:13, 1 February 2015 (UTC)Reply

CodeCat has deleted it. The IP seems to be correct that there are citations of the word on Usenet now, but there are only two of them, and they're from only a few months apart; the word would need three citations spanning over a year to meet WT:CFI. - -sche (discuss) 02:22, 1 February 2015 (UTC)Reply

The author (Arslan Tekin) says: "Look at it, it is using sınalgı for television and ünalgı for telephone at Kyrgyzstan"

So, it is Kyrgyz. It isn't Turkish. --123snake45 (talk) 03:00, 1 February 2015 (UTC)Reply

Spammers came again. Theirs made-up/fake words (sınalgı, çimerlik, estelik...) have to delete. --123snake45 (talk) 21:53, 27 May 2017 (UTC)Reply

düşerge

Latest comment: 9 years ago2 comments2 people in discussion

Can you take a look at rfv page? I've added the citations with Azerbaijani adaptations so you may compare them. --85.103.244.86 17:13, 3 February 2015 (UTC)Reply

I invited three Turkish-speaking users to take a look at the citations. One of them, User:Dijan, is the one who said the previous citations were Azeri. The Azeri versions you've provided do look consistent with Dijan's comment that "every single one of them is a Turkish rendition of the Azeri language (literature and poetry) that was not translated into Turkish", but I will wait for the other users to comment. I'm at a disadvantage here because I (and more other Wiktionarians) don't speak Turkish or Azeri, and it's clear there are people with axes to grind on both sides of this issue — in some cases it seems pretty clear that people have made up words that aren't actually in use, and in other cases people seem to be refusing to believe words that seem real (e.g. Citations:haydamak, where it looks like other print dictionaries are confirming that the citations are using haydamak to mean "drive"). - -sche (discuss) 17:26, 3 February 2015 (UTC)Reply

Maori

Latest comment: 9 years ago10 comments3 people in discussion

I don't agree that languages are proper nouns, but if that is Wiktionary policy, I'm not going to upset the apple cart, but just let you know that not everyone agrees. Donnanz (talk) 17:51, 3 February 2015 (UTC)Reply

I don't know that there's a policy, but it certainly seems to be common practice; all the other language names I can think of are currently categorized as proper nouns: Portuguese, Spanish, Basque, French, English, Dutch, German, Danish, Norwegian, Chinese, Navajo, etc. However, there has been some discussion in the past about how some of the things that are commonly categorized as proper nouns, such as personal names, fail to meet some of the usual tests of proper-noun-ness (names are countable; "there are two Johns in my class"). You could bring the matter up in the BP and see what others think. Languages do seem to meet more tests of proper-noun-ness than personal names, though (and there wasn't even consensus to stop treating names as proper nouns). - -sche (discuss) 18:09, 3 February 2015 (UTC)Reply

Hmm, OK, I'll think about the Beer Parlour. I would categorise names such as Gertrude, the Houses of Parliament, the White House, and the Black Sea as proper nouns, and surnames of course, and stop there. But as you point out there can be a problem with people's names; the Browns and the Joneses spring to mind. Also place names, two Bristols, two Birminghams, two Londons (maybe more), but place names and people's names are really proper nouns despite that. Donnanz (talk) 18:39, 3 February 2015 (UTC)Reply

If you are thinking about the matter, consider that taxa are considered proper nouns, because they are names of individual natural kinds (old-style Linnaean taxonomy) or lineages. This is somewhat similar to the Roman gens, or other groups of descendants of a common ancestor. Organization names, toponyms of all kinds, brands/trademarks are all proper nouns, whatever word class their components are. DCDuring TALK 18:50, 3 February 2015 (UTC)Reply

No, I wouldn't argue with taxa (taxas?), brands, trademarks, names of organisations etc. I think it's just languages as proper nouns I disagree with. Donnanz (talk) 19:02, 3 February 2015 (UTC)Reply

The argument, I think, is that a language is a singular thing that a community speaks, just like e.g. a country is a singular place that a community lives. Of course, both can be pluralized: one can speak of Germanies, Americas, and even Frances, and one can speak of "various Englishes" (American, British, Indian, etc), "Norwegians" (Bokmal, Nynorsk, Riksmal, etc), "Germans", etc (though our entries currently don't, except in the first case). It may well be as technically inaccurate to label countries and languages as proper nouns as it is to label personal names as proper nouns. On the other hand, it seems to be common, among those dictionaries which use the label "proper noun", to label all of those types of thing as proper noun, and they do generally fit tests of proper-noun-ness. - -sche (discuss) 19:41, 3 February 2015 (UTC)Reply

There's quite a few examples of plural place names: Aleutians (that entry needs splitting), Falklands, Faroes (Faroe Islands), Netherlands (Nederland in Dutch) to name a few. But languages (in my opinion) are mass nouns, instead of Englishes and Norwegians (the people are Norwegians), we should refer to forms of English, forms of Norwegian and so on. Donnanz (talk) 20:42, 3 February 2015 (UTC)Reply

I think Netherlands (where the word for a singular country happens to be plural) is different from Frances (the plural of France, used to talk about e.g. different temporal or social incarnations of France). I can find several instances of Netherlands being pluralized, both invariantly ("the two Netherlands", a la "the two fish") and, rarely (and only "in the wild", not in places that meet CFI), as Netherlandses.

Hmm, mass nouns... that's plausible. Well, we have a fair few grammarians here, let's see what they think. Would you like to bring it up in the BP, or would you like me to?

DCDuring, does CGEL say anything about whether languages are nouns or proper nouns or mass nouns? For that matter, does it say anything about whether given names are proper nouns or not? (Apologies if you've answered the latter question previously and I'm forgetting.) - -sche (discuss) 21:50, 3 February 2015 (UTC)Reply

I think the name Netherlands may be historical as it also took in the all the low countries including Belgian Flanders at one time. It is still referred to as het Koninkrijk der Nederlanden (qv Nederlanden). Anyway, I suppose I had better start a thread in the BP. Donnanz (talk) 22:20, 3 February 2015 (UTC)Reply

@-sche: I don't see any explicit statement in CGEL that a name of a language is a proper noun nor that is any other type of noun. There is no reason why a proper name couldn't have a homonym that is a mass noun. Or rather isn't that just one of the generic secondary uses of many proper names, eg, "We've had too little Ruakh in our discussions lately." (The "too much" examples would cause trouble.) DCDuring TALK 23:53, 3 February 2015 (UTC)Reply

Moves

Latest comment: 9 years ago2 comments2 people in discussion

Sorry for all the deletion requests. I was basing the original reconstructions on some outdated material. Thanks. --Victar (talk) 07:04, 5 February 2015 (UTC)Reply

No problem. With wt:AWB, it's not that hard to delete a bunch of pages. - -sche (discuss) 07:08, 5 February 2015 (UTC)Reply

dative -e

Discussion moved to Template talk:de-decl-noun-n.

A friendly request to enable AWB use

Latest comment: 9 years ago2 comments2 people in discussion

And also, could you remove edit protected status for CheckPage? I can't edit it. --Dixtosa (talk) 12:41, 7 February 2015 (UTC)Reply

Sure, I can add you to the checkpage. :) I'm not going to unprotect it, though; it's supposed to be protected, as a safeguard against people who don't know what they're doing adding themselves to it. - -sche (discuss) 18:20, 7 February 2015 (UTC)Reply

Using passer and sortir with être

Latest comment: 9 years ago4 comments2 people in discussion

Do passer and sortir use être under exactly the same circumstances? Their usage notes are a little different, and I'm not sure if that's meant to imply that the terms use être under different circumstances or not. If they use être under the same circumstances, I'd like to reword Template:U:fr:may take être as much as needed and deploy it on both entries; otherwise, there doesn't seem to be a use for that template (it's currently unused and there's no point in templatizing usage notes that only apply to a single entry) and I'd like to delete it, unless you know of other entries that could use it. - -sche (discuss) 20:51, 9 February 2015 (UTC)Reply

Yes, this template is OK, it applies to both entries, but a more complete list is (at least) descendre, monter, passer, redescendre, remonter, rentrer, repasser, rerentrer, rerepasser, reressusciter, reretourner, ressortir, ressusciter, retourner, sortir. This list is not limitative (when you add re- to a verb, this is the same rule). Actually, avoir or être is used depending on the meaning, and this is best explained with examples, but the template seems to be a good summary: when used transitively (or with a transitive sense, even when the complement is omitted), it's always avoir. Otherwise, it's être. Lmaltier (talk) 21:15, 9 February 2015 (UTC)Reply

Thanks for the clarification! I'll clean the template up a bit and add it to those entries. - -sche (discuss) 22:53, 9 February 2015 (UTC)Reply

Also note that using être is also systematic for pronominal uses of verbs: cf. je me suis trompé vs j'ai trompé. But this is a different issue, it's not limited to a few verbs. Lmaltier (talk) 06:58, 11 February 2015 (UTC)Reply

Crucially important question

Latest comment: 9 years ago3 comments2 people in discussion

From which episode of QI do those words on your main page come? It's snowy in Tennessee, and there's nothing to do. —John C5 05:20, 17 February 2015 (UTC)Reply

@JohnC5 I believe it was the J series episode 13 on "Jobs". Those were all occupations people said they had in old British censuses. - -sche (discuss) 05:47, 17 February 2015 (UTC)Reply

I have seen that episode! Probably deserves a rewatching... —John C5 05:50, 17 February 2015 (UTC)Reply

Questionable revert

Latest comment: 9 years ago3 comments3 people in discussion

I would appreciate it if your reverts were a bit more careful. For instance here, I think that edit would have been fine since many people confuse UUers for a religious denomination. However most academics refers to it as a distinct religion. By highlighting the coordinate terms, it would have been clearer that this is a distinct religion. I'm disappointed with your knee-jerk finger-trigger like reactions. 84.13.154.209 16:16, 21 February 2015 (UTC)Reply

The merits aside, someone with your long, ugly history of questionable and often downright awful edits (yes, it's obvious who you are, whatever IP you happen to be using at the moment) is in any position to criticize the people who have to clean up after you. Chuck Entz (talk) 16:45, 21 February 2015 (UTC)Reply

I think it's better to put the coordinate terms, synonyms, etc in the lemma entry, rather than in all the various possible abbreviated forms (UUers, UUs, etc). - -sche (discuss) 17:48, 21 February 2015 (UTC)Reply

allosexual entry

Latest comment: 9 years ago1 comment1 person in discussion

Many thanks for your improvements, which were far above my Wiktionariological or semantic capabilities. Looks great! FourViolas (talk) 15:03, 24 February 2015 (UTC)Reply

Trans and frequencies

Latest comment: 9 years ago5 comments2 people in discussion

You must have the frequencies form transman, transwoman, etc. wrong; please check Google Ngram Viewer. --Dan Polansky (talk) 08:46, 7 March 2015 (UTC)Reply

No, Ngram Viewer clearly shows that the spaced form is more common in the case of trans woman (link, which looks like this to me — is it different for you?). For trans man, the unspaced form was still slightly more common at the time Google's data cut off (2008), but the spaced form was becoming more common while the unspaced form was becoming less common, so it seems likely that more recent data would show the same situation as with trans woman, i.e. that the spaced form is more common (especially in light of the proscription of the unspaced form by some authorities). - -sche (discuss) 08:54, 7 March 2015 (UTC)Reply

For transwoman, my mistake: I used the default Ngram settings which ends in 2000[1], but when one extends the graph to 2008[2], the picture changes.

For transman, you are making the less common form[3] (factor 1.6) the main dictionary entry, with justification that relies on extrapolation rather than actual situation. When one combines this with the proscriptions expressed online, I am not sure what to think of this. --Dan Polansky (talk) 09:09, 7 March 2015 (UTC)Reply

Well, to assume that the actual situation matches the situation a decade ago would also be making an assumption. It would be a reasonable assumption for most words, which have many decades of use, which have consistent (parallel) trendlines, and which the events of 2008-2015 can't be expected to have had much of an impact upon. (For example, couch and sofa.) In this case, however, the trendlines are divergent (and only go back about 15 years anyway), and increasing awareness by the general public of trans people's preferences can be expected to have influenced usage in the same direction as the trendlines were going when the data cut off. (Consistency with trans woman also plays a role.) - -sche (discuss) 09:48, 7 March 2015 (UTC)Reply

You actually have a good point; the 2008 data is 7 ears old. To bet that the trend for transman has after 2008 developed in a way parallel to trends seen even before 2008 for transwoman seems reasonable enough. Fair enough. --Dan Polansky (talk) 15:06, 7 March 2015 (UTC)Reply

hijra

Latest comment: 9 years ago3 comments2 people in discussion

Concerning this. Just a thought, but I'm not convinced that it's sensible to split the definitions. This is because it seems not clear in many citations (especially earlier ones) which sense exactly is meant, and more generally I suspect that the precise meaning lies on a continuum between the two rather than being neatly split into one or the other. At any rate that was my impression when I was working (briefly) on the word. Ƿidsiþ 07:52, 9 March 2015 (UTC)Reply

A lot of citations are ambiguous, yes. However, enough are unambiguous that I don't think conflating them is appropriate, particularly because the distribution of meanings seems to have a temporal component, i.e. the meaning seems to have changed over time. Citations that refer to the past often explicitly refer to hijras as eunuchs, defined by anatomy, while contemporary uses often (mostly?) refer to the third-gender people, defined by social role/presentation. Some of the latter works even explicitly specify that (modern) hijras are not necessarily eunuchs: google books:"uncastrated hijra|hijras" gets a few hits, and google books:"castrated hijra|hijras" (which would be redundant if hijras were necessarily eunuchs) gets several more, including some like "the not-yet-castrated hijra", "they were indistinguishable from castrated hijras when crossdressed - clearly, becoming hijra as a livelihood required neither castration nor gharana affiliation", and "[they] may or may not be castrated. Hijra is a developed stage." Perhaps the solution is to make the two specific senses into subsenses of a broad 'coverall' sense? - -sche (discuss) 08:27, 9 March 2015 (UTC)Reply

And then there are google books:"female hijras", who most of the citation make clear have attained hijra status by adopting a third-gender role and not by castration. These would be especially hard to work in to a 'coverall' sense — they would require it to be very broad indeed, to cover both eunuchs and women. Perhaps the solution is to have a {{qualifier}} or usage note explain that some uses don't distinguish male eunuchs from male-bodied third-gender people? - -sche (discuss) 08:44, 9 March 2015 (UTC)Reply

Upper Franconian language‏‎

Latest comment: 9 years ago7 comments4 people in discussion

User:Purodha added user boxes that triggered the creation of a whole bunch of bad language categories and redlinks by Babel AutoCreate- pretty much the gamut of nds-nl & nds-de lects. I've gotten rid of most of the redlinks by replacing the narrow-lect category link with the appropriate broader-language category link in the User categories that were created. The one holdout is Upper Franconian, code vmf (see Category:User vmf): I'm not really sure whether it's nds-nl or nds-de. Any suggestions? Chuck Entz (talk) 03:55, 18 March 2015 (UTC)Reply

@Chuck Entz See here. —Μετάknowledge^{discuss/deeds} 04:12, 18 March 2015 (UTC)Reply

It's been a while since I was looking into this, so I forgot some important details. Yes, it's High German, not Low German. If you follow the link to the Wikimedia discussion, it turns out that after we had deleted the vmf code, Ethnologue came out with corrections that led to vmf being deemed eligible for a wiki after all. Now that Ethnologue is no longer claiming that vmf applies to Mainz and Frankfurt, we may need to revisit the issue. Chuck Entz (talk) 07:03, 18 March 2015 (UTC)Reply

Thanks for the heads-up. I have been busy, but I will look into it. (I wonder if they have also clarified frs any.) - -sche (discuss) 01:27, 19 March 2015 (UTC)Reply

Apparently they have, it's now called "Saxon, East Frisian Low". (But the population count is still wrong, hmph.) -- Liliana • 01:33, 19 March 2015 (UTC)Reply

While you're here, what are your thoughts on the newly-redefined Upper Franconian? do you think it should be included? All the varieties of German are such a mess to pick apart into discrete lects... - -sche (discuss) 02:41, 19 March 2015 (UTC)Reply

Ethnologue does a horrible job at the German dialects. It appears to cover some, but not all of them and it's generally a huge mess to work with. (I hope you've seen my newest BP topic regarding the Swiss German lects.)
Have you seen the current vmf entry? It says "Hessen state: mostly River Main area, east of Mainz and Frankfurt." How much Hesse is there at the Main east of Frankfurt? lol. They really can't figure out what they want with this code, and it doesn't help that it's called "Mainfränkisch" with "Ostfränkisch" being a supposed alternate name, even though Mainfränkisch is just one of many subdivisions of Ostfraänkisch.
I mean, we could theoretically use it for the Franconian lects, but... eh. -- Liliana • 00:06, 20 March 2015 (UTC)Reply

frs Module errors

Latest comment: 9 years ago6 comments3 people in discussion

These have been hanging around since you removed the frs code. There were 146 to start with. I've chipped away at a few of the obvious ones, but there are still about 135. The problem is, I don't know which ones are Saterland Frisian, which ones are East Frisian Low Saxon, and which are some unspecified extinct Frisian East Frisian dialect.

It won't do to have all of those module errors for an extended period- there's already been one unrelated module error that I only found out about by going through all 136 entries in the category (there's an error in a Korean module that's since brought the total up to 199). Do you think you'll be able to fix them soon? Is there anything I can do to help? Maybe User:Leasnam, who added most of them, might be able to help. Chuck Entz (talk) 03:50, 23 March 2015 (UTC)Reply

I've been changing them as I see them...but the majority of those I've added, by the looks of them, represent a sampling of various unspecified extinct East Frisian dialects. Where I can connect them to a modern Saterland Frisian word I am updating them, but not universally. Sometimes I just change the code to stq to get rid of the error short term Leasnam (talk) 04:41, 23 March 2015 (UTC)Reply

Ugh, this is one of the few downsides to our use of language modules rather than language templates: I thought I had cleaned up all the uses of frs. (I should have waited for and searched an updated database dump to be sure.) I would temporarily reinstate the code, except that Ethnologue clarified that it refers to the Low German lect, which means I'd be replacing missing information (module errors) with potentially incorrect information (it's often unclear whether uses of the code on here are meant to refer to Frisian or Low German), which I am not sure would be an improvement. I'll chip away at what I can. If an entry simply lists an East Frisian word as a cognate (not an etymon), and it's not possible to determine which precise Frisian-ic or Low-German-ic lect it belongs to, it can simply be dropped, IMO. - -sche (discuss) 04:52, 23 March 2015 (UTC)Reply

I have no qualms about dropping a non-essential cognate. We can fix later if need be Leasnam (talk) 06:06, 23 March 2015 (UTC)Reply

Here is the reference cited in the first appendix entry I looked at. It seems to be treating East Frisian as a whole, which would include not just Saterland Frisian, but also at least a couple of the extinct dialects. Maybe we need an exception code for Frisian East Frisian as a whole, or maybe we should make stq the code for the whole language. Chuck Entz (talk) 07:01, 23 March 2015 (UTC)Reply

It would be sensible to do one of those things, yes. In the past I had proposed creating gmw-fre or gmw-efr for East Frisian, but there was insufficient support for that because it was at the time still unclear if frs really referred to the Low German lect. - -sche (discuss) 14:03, 23 March 2015 (UTC)Reply

Christe

"not convinced that this form is German and not Latin, but w/e" -- even duden.de states that there's a vocative for Jesus and Jesus Christus: "Jesus [...] Anredefall: Jesus und Jesu", "Jesus Christus [...] Anredefall: [...] Jesu Christe" ("Anredefall" is German for English vocative). There most likely would still be an ablative (cf. "von dem Nomine" [Nomen], "von dem Corpore" [Corpus], "von dem/der Radice" [Radix]), but the ablative of (Latin) Jesus and Christus equals the dative and so duden only mentions a dative. Also, though it should be obvious: the vocative of Jesus and Christus can especially be found in religious song books and most likely religious prayers etc. -13:48, 19 April 2015 (UTC)

Changing the parent language of Yiddish from MHG to OHG

Latest comment: 9 years ago9 comments5 people in discussion

(Pinging people who may be interested) @Metaknowledge, CodeCat, Angr

It is not clear that Yiddish branched strictly after the beginning of the MHG period. See for example section 7.25 in Max Weinreich's History of the Yiddish Language, where he concludes "Hence we have to postulate that Yiddish began to take shape as early as the Old High German period" (p. 424). Is this enough of a reason to change Yiddish's ancestors = from "gmh" to "goh"?

Another more difficult question would be whether to add Hebrew, Aramaic, Yevanic, and/or Judeo-Romance as an ancestors (which in some sense they are), but then again we don't put Frankish as an ancestor of French (perhaps we should?).

--Wiki Tiki 89 18:34, 20 April 2015 (UTC)Reply

I'd say the second question is the easier one: No. Languages that are the sources of loanwords—even large numbers of them—are not considered ancestral. Anglo-Norman is not an ancestor of English; Latin is not an ancestor of Albanian and Welsh; Italian is not an ancestor of Maltese; and Hebrew, Aramaic, Slavic, etc., are not ancestors of Yiddish. I have no objection to changing the parent language of Yiddish to OHG. —Aɴɢʀ (talk) 18:53, 20 April 2015 (UTC)Reply

But they're not exactly loanwords, they're more like kept-words. Jews that spoke other languages and settled in German-speaking areas, slowly and gradually adopted more and more German words and grammar, keeping many words and grammatical structures from their former languages, especially from Hebrew. This had already happened several times before and so the Hebrew words and grammatical structures were direct continuations from when Hebrew was their native language. This is different from loanwords, which speakers of one language simply borrow from another language. I presume that there was similar situation with French and Frankish, although I have never read about this and far fewer Frankish words survived in French for it to be significant. --Wiki Tiki 89 19:08, 20 April 2015 (UTC)Reply

Contact languages of any kind are going to be impossible to represent accurately in terms of choosing a language as a "parent". MHG seems no less (in)accurate to me as compared to OHG; during both time periods, there was an attested Jewish form of the language written in Hebrew script that had a lot of Semitic vocabulary. Yiddish has some differences in sound changes that allow us to estimate its general point of divergence, but the differences do not seem to be particular to Yiddish so much as features of some of the High German lects (not the one(s) that led to Modern Standard German). In the meantime, I think keeping it as MHG is perfectly fine, considering that MHG already represents a span of varying lects within certain parameters of time and space which arguably include the Jewish varieties. —Μετάknowledge^{discuss/deeds} 19:19, 20 April 2015 (UTC)Reply

Well Weinreich says on the same page as the quote above "Yiddish speakers were in close contact with German speakers, and it need not occasion surprise had the German component of Yiddish, although already part of an independent language, continued to be affected by changes that took place in the German determinant." I don't know whether you find that contradictory to your point or not. --Wiki Tiki 89 19:33, 20 April 2015 (UTC)Reply

The ancestry of Yiddish is the subject of some disagreement. Wikipedia calls the view of a MHG origin a "prevailing" view. Bernard Spolsky (The Languages of the Jews: A Sociolinguistic History, 2014, page 157) says "The basis for Yiddish was a Middle High German dialect, for Yiddish often agrees with Middle High German rather than with modern German[.]" And Paul Wexler (Two-tiered Relexification in Yiddish, 2002, page 133) goes so far as to say "there are no specific Old High German phonological or lexical features in Yiddish (see Simon 1991: 253)." But Wexler believes the ultimate origin of Yiddish is actually Slavic, and the Germanic content is the result of relexification in the 9th to 12th centuries; indeed, his full sentence (emphasis mine) is "The first relexification to German took place in the Middle High German period, to judge from the fact that there are no specific Old High German phonological or lexical features in Yiddish." In turn, Weinreich says what you quote, but Wikipedia says that his model also posits that "Jewish speakers of Old French or Old Italian, who were literate in Hebrew or Aramaic, migrated to the Rhine Valley, [...] encountered and were influenced by Jewish speakers of High German" and that the ultimate origin of Yiddish is the fusion of all this, not simply OHG.

Perhaps we shouldn't list a parent at all?

De facto, we more often give OHG words than MHG words as the etyma of Yiddish words. (In the past, some entries gave modern High German forms as etyma, but this was known to be problematic and has for the most part been addressed.)

- -sche (discuss) 22:08, 20 April 2015 (UTC)Reply

The way I see it is that listing MHG as a parent implies also OHG, but listing OHG as a parent does not imply MHG. So if we are unsure about MHG, then listing OHG is not wrong. But what actual consequences does listing the parent in the module have? What got me thinking about this was when I was adding פֿאָרן (forn) to *faraną and was unsure whether to put it under MHG or under OHG. Perhaps this should be decided on a word-by-word basis. If we know a word came from MHG, then we will list it under MHG, if we know it did not, then we would list it under OHG, and if it is unclear, that is where we need to choose a default and where I think OHG would be a better choice. --Wiki Tiki 89 22:33, 20 April 2015 (UTC)Reply

Frankish isn't really an ancestor of French: there were an awful lot more of the Romance-speaking Celts then there were Franks, so the Franks were somewhat like the Mongols in China- more important historically than linguistically. Chuck Entz (talk) 03:32, 21 April 2015 (UTC)Reply

Ok, then my comparison to French/Frankish was wrong. My point remains about Yiddish/Hebrew. --Wiki Tiki 89 14:15, 21 April 2015 (UTC)Reply

Pertain

Latest comment: 9 years ago1 comment1 person in discussion

Pertain, which pertaining is just a modified version of, is defined here on English wiktionary as "Verb[edit] pertain (third-person singular simple present pertains, present participle pertaining, simple past and past participle pertained)

(intransitive) to belong (intransitive) to relate, to refer, be relevant to" The "to belong" sense of pertaining is already covered by "of pedophilia", the "to relate" sense is already covered by "related to pedophilia", so it is redundant. Although its not necessary to be as simple here as on simple English wiktionary, its still important. Its best when writing to write in simple language, not complex. There is a book about this topic by H.W. Fowler called The King's English, you should read it. His first points in the book are, prefer simple words to complex words, prefer short words to long words, prefer common words to unusual words, and prefer Germanic words to Romance words. He would agree with me that pertaining would need to go in this case. --PaulBustion88 (talk) 02:12, 30 April 2015 (UTC)Reply

paurometabolous‎

Latest comment: 9 years ago5 comments2 people in discussion

Howdy-doo! I was just curious where you found the meaning of incomplete. It seems closely related to the meanings I've seen, but not quite the same. Just thought I'd ask. —John C5 04:01, 8 June 2015 (UTC)Reply

I saw it in The Century Dictionary (1914) defined as "characterized by incomplete metamorphosis", and that sense is suggested by citations like "cockroaches, grasshoppers, lice, true bugs, and so on, undergo paurometabolous or incomplete development" (Foundations of Wildlife Diseases, 2014, →ISBN, page 126). That citation is why I offered the shorter gloss "incomplete" before the semicolon, btw (since "paurometabolous development" is not "development characterized by incomplete development"). It's probably not a separate sense, and could be removed if sense 1 were expanded a bit. Btw, Century has a second sense, "of or belonging to the Paurometabola", which is defined as "in Brauer's system of classification, those insects in which the metamorphoses are slow, inconspicuous, and very incomplete, as the Orthoptera". The former looks like a candidate for Category:mul:Taxonomic names (obsolete). - -sche (discuss) 05:10, 8 June 2015 (UTC)Reply

Based on the wiki page for w:Hemimetabolism, I believe the word incomplete is used to mean "not executing all of the normal stages of metamorphosis," as opposed to "failing to complete metamorphosis." The ambiguity lies in that the members of Paurometabola succeed at their form of metamorphosis, but this metamorphosis does not conform to the standard metamorphic pattern. I might suggest abridged or atypical as opposed to incomplete because the latter most sounds like the bugs never succeed at maturing, which is certainly not true. Does this sound reasonable to you? —John C5 21:26, 8 June 2015 (UTC)Reply

Good point about w:Hemimetabolism. Actually, why don't we just link to that page? See what you think of my change to the entry, and feel free to undo or expand upon it. - -sche (discuss) 00:23, 9 June 2015 (UTC)Reply

Looks good to me! :) —John C5 00:56, 9 June 2015 (UTC)Reply

Partition verb senses by grammar, semantics, register/topic/context?

Latest comment: 9 years ago5 comments2 people in discussion

Looking at your excellent, extensive work on take reminded me of a question that bothered me about sense division, especially in verbs (though it comes up in other word classes).

Which of the various possibilities should take precedence in grouping definitions? For verbs, most dictionaries divide definitions into transitive and intransitive and, as a result, have some redundancy and obscure some semantic relationships. I often feel that certain groups of registers/topics, eg, sports, games, nautical, belong together no matter whether there are semantic reasons to split them. Some would group all archaic and obsolete senses.

We already split some semantically analogous senses by PoS eg, adjectives and adverbs, conjunctions and adverbs, conjunctions and pronouns, prepositions and adverbs, adverbs and nouns (eg, home). These splits make it harder to see the semantic similarities. Have we written off that kind of semantic visibility? Do we have to?

My natural inclination is to have grammar take precedence, but I'd be happy to hear arguments for the other possibilities. DCDuring TALK 20:31, 10 June 2015 (UTC)Reply

Working on take got me to thinking about sense grouping, too. I don't desire to adopt other dictionaries' practice of separating transitive and intransitive verbs, I only separated them on take to make the entry easier to work on. Now that I'm finished adding senses, I'll probably go back and interweave the transitive and intransitive ones, since I think it's better to group definitions/senses according to meaning. Separating transitive and intransitive senses often obscures the fact that some senses are ambitransitive (as here, where it resulted in what was basically the same sense being listed twice) or ergative.

Separating different parts of speech seems to me like a good practice to continue. The cases where it proves difficult (however) or could be regarded as obscuring semantic connections (home) are too few and far between to justify abandoning the practice.

- -sche (discuss) 05:20, 11 June 2015 (UTC)Reply

If an English L2 section is to be read as some kind of structured, terse essay on a term, then it certainly makes sense to group somewhat semantically.

OTOH, if an English L2 section is intended to help an ordinary user find a definition, at least some users would benefit from a transitive/intransitive split, which would support faster scanning for the possible definition. (This argument also favors topical labels, which I have, perhaps wrongly, opposed.)

Another consideration is entry maintainability. Of course, to tinker with your efforts would be gilding the lily, but it is easier to assess, analyze, and repair the range of coverage of a set of definitions, if the set can be made smaller on some easy-to-determine grounds, like the hard grammatical distinction of transitivity/intransitivity. DCDuring TALK 12:41, 11 June 2015 (UTC)Reply

You are right that transitivity is an (possibly the only) easy-to-determine hard-and-fast distinction, and that segregating senses according to it could help people find specific senses. I'm not strongly opposed to it, I simply think semantic grouping is better. Where would ambitransitive and ergative senses go if senses were split by transitivity? In sections all their own, e.g. between the transitive and intransitive senses? (That would seem a bit awkward, but not outright problematic.) Or would they be duplicated and placed in both the transitive and the intransitive section? That would seem unhelpful to English-speakers, though perhaps helpful to translators (if they have distinct translations in some languages, which seems likely).

Other ways of sorting verb senses are by age (oldest—or newest—senses first) and by commonness (most—or least—common senses first). I suppose those are not mutually exclusive with grouping senses by meaning or transitivity.

Perhaps someone will devise a gadget that will give users buttons, similar to the "show/hide quotations" buttons but located e.g. at the top of each POS section, which will allow users to optionally hide senses with certain tags, e.g. obsolete, archaic, transitive, intransitive, even US (if a user knows they're searching for a sense Brits use), UK, etc.

- -sche (discuss) 21:22, 11 June 2015 (UTC)Reply

We could have sortable tables of definitions! Ugly, and needing a lot of artificial data to generate what we think is appropriate. Or we could let users run SQL queries against a database of definitions.

I've never been convinced of the utility of ergative and other high-falutin' linguists' labels for the supposed 'normal' users, if indeed we have any 'normal' users. Those mostly seem good for making sure that someone working on an entry checks to make sure that the appropriately reworded definition appears in both transitive and intransitive sections, ie, duplicate underlying semantics.

After group by the hardest of grammatical distinctions, I would group semantically, preferrably using subsenses, ordering the senses by date of attestation of the sense (in principle) or degree of concreteness (which might coincide with date of attestation for the definition in the language or an ancestor. Subsenses would follow the same ordering principle within the sense. But recourse to attestation actually means relying of OED for many words, though not so much for more recent sense development.

As we don't really have a clearly dominant approach, I think we can still let contributors do it the way they want to. I would not impose my ideal grouping and order on an entry that was a good example of another set of organizing principles and hope that no-one would waste time merely reordering and regrouping mine, unless there was a good reason (clear error, reorganizer actually working from the OED, etc). DCDuring TALK 22:11, 11 June 2015 (UTC)Reply

Orange links and ACCEL

Latest comment: 9 years ago4 comments3 people in discussion

Hi. Is there any way to combine the orange link gadget with the WT:ACCEL one? --Type56op9 (talk) 17:36, 13 June 2015 (UTC)Reply

Not that I'm aware of (I think people have asked about that before). It would be useful, though. You could ask in the Grease Pit. - -sche (discuss) 18:18, 13 June 2015 (UTC)Reply

(edit conflict) Not as such. Acceleration works by adding preloads to a redlink, which requires that there be nothing there. One would have to have an app to add a language section to an existing entry, which would require different methods. It may be possible (bots certainly have no trouble with it), but it wouldn't be a trivial exercise. Chuck Entz (talk) 18:21, 13 June 2015 (UTC)Reply

My illegal bot made such additions the time. But then it got blocked, so I had to hide the fact I was using a bot by changing the code. Then people figured out I was still using a bot. However, if this new orange-accel tool was around, I could use the illegal bot again, and pretend I was using the tool. Everyone's a winner! --Type56op9 (talk) 18:26, 13 June 2015 (UTC)Reply

Whoops.

Latest comment: 9 years ago4 comments2 people in discussion

On the "Greek" page, that was a filter that I put on my computer that did that. I'll have to make sure to check that in the future to make sure that it doesn't sneak into my edits by accident. The filter replaces words with "[word deleted]". I installed the filter because too many people were swearing left and right on many of the websites that I visit, and I grew tired of seeing it.

But yes. xD

That was pretty funny. My bad. Tharthan (talk) 18:11, 15 June 2015 (UTC)Reply

Ah, thanks for the explanation; I had wondered why it flagged "clit" but not "anal sex", haha. Thankfully people around here don't swear that much (not that I mind) — I guess it's to be expected that dictionary-editors know more articulate ways of expressing themselves. - -sche (discuss) 18:17, 15 June 2015 (UTC)Reply

Yeah, frankly I would have set it to change each word to a clean synonym, but the filter in question only allows for one all-encompassing replacement (which kind of stinks, because it reminds me of those old IRC-type chatrooms that just replaced vulgarities with asterisks rather than creatively write around them). But it's the best I can find for Firefox.

By the way, I have to ask:

You said that the main criterion for cited sources is that they must be durably archived. Are there any exceptions to that? Do we allow citations of tabloids or other "buzzword books" that may indeed use a neologism or retronym for over a year but be truly the only ones to do so. Tharthan (talk) 18:47, 15 June 2015 (UTC)Reply

Durably-archived tabloids are allowed; they aren't prestiguous, but their vocabulary is part of the great big grab-bag which is the English (or German, etc) language. Terms which are "neologisms", "slang", "informal", "rare", etc should certainly be marked with those labels, however, and in exceptional cases one can write usage notes.

What kind of "buzzword books" do you mean? Books that define and then give made-up examples of slang are disallowed by WT:CFI#Conveying_meaning, which "filters out [...] made-up examples of how a word might be used". But authors who like to work as many words from those kinds of books into their own literature, well, they're allowed. I got the impression that Georgette Heyer copied words from the 1811 Dictionary of the Vulgar Tongue and pasted them into her dialogues, sometimes clumsily. In fact, that makes me realize [4].

If a work is of such low quality that one can't be sure it is in fact using a given word (as opposed to unintentionally containing a string as a typo or misspelling), it is generally excluded, however (because CFI requires evidence of use). So, a citation like "Berlin, Germany has many ihstoric stires, as do most other cities in Germanny." would probably not be accepted as evidence that "Germanny" is an alternative spelling of "Germany". (But a book from 1600 that said "Southern Germanny is a Land of mannifold historickal Constructions, of a Roman Charackter" would suggest that "Germanny" was once an obsolete spelling of "Germany".)

- -sche (discuss) 21:11, 15 June 2015 (UTC)Reply

Stolperstein

Latest comment: 9 years ago3 comments2 people in discussion

Hallo -sche,
nach längerer Zeit habe ich mal wieder eine größere Bearbeitung getätigt und dabei den oben genannten Eintrag erweitert. Könntest du mal bitte drüberschauen und etwaige Format-, Formulierungs- und Übersetzungsfehler korrigieren. Danke im Voraus und lieben Gruß dir, Caligari ^Ɔɐ^ƀïиϠ_Ⴕ 06:02, 16 June 2015 (UTC)Reply

Natürlich; und lieben Gruß auch dir! PS, there must be something in the air (as they say) causing people to undertake big multilingual projects, since I just attempted one in the other direction, expanding (take and then) de:take. - -sche (discuss) 09:16, 16 June 2015 (UTC)Reply

I guess my English got a bit rusty. So again, many thanks for your swift corrections. Each and every correction will improve further editings...hopefully :-).

@de:take: Wow! Indeed. Great job so far with regards to the massive content expansion. Let me know when you think you completed expanding "take". There are some formatting issues that I'll let you know on your German user talk once you've done with expanding. There need to be some "Feintuning" with regards to the format. As an advice I would recommend that you take a look at articles in de:Kategorie:Polnisch, de:Kategorie:Tschechisch or de:Kategorie:Schwedisch. If you need specific help, don't hesitate to let me know.

Lieben Gruß dir, Caligari ^Ɔɐ^ƀïиϠ_Ⴕ 15:33, 16 June 2015 (UTC)Reply

Moinsen. WT: ANDS.

Latest comment: 9 years ago3 comments2 people in discussion

Moinsen. Ich biete dies: User_talk:Korn/sandkist Korn [kʰʊ̃ːæ̯̃n] (talk) 13:57, 20 June 2015 (UTC)Reply

Merging the German and Dutch lects... bleh. I don't oppose it, or support it. (As I wrote further up on this page, "the general disagreement and slow-motion edit-warring about how to handle the various Low German lects makes for so much ugliness that I am losing interest in editing them" at all.) I strongly suggest, almost to the point of insist, that one orthography should be chosen for forms to be lemmatized on / normalized to (I don't know if this is what you intended the "consonants" and "vowels" sections to do), so that we don't end up with five entries lemmatized five different ways, representing the same diphthong five different ways, as if all the words were pronounced differently, when in fact they just use different orthographies or have predictable dialectal variation. Nouns should uniformly begin with majuscule letters, or uniformly not do that, for the same reason.

I've made a few typofixes and other small changes, e.g. dropping the Dutch spellings of "coïnciding" and "reëmergence". Also note that merging Plautdietsch would need discussion quite apart from merging GLG and DLS, because people (e.g. Angr, and me) in past discussions have supported keeping it separate on account of its separate history and development on another continent.

I also suggest either dropping the "During Middle Low German [...] Central and Upper German" line, or rewriting it to give native forms (we'll have to suck it up, bite the bullet, and perform whatever other idioms are necessary to give one dialect's forms as examples) so that it doesn't imply Low Germans actually used the words "German", "Low Landic", etc, especially given that "Low Landic" gets all of four Google hits. (Alternatively, a phrasing like During Middle Low German times, the language was known by cognates of the terms "Dutch", "Saxon", "Netherlandish" or "Netherdutch" would technically be accurate, but confusing to the uninitiated.)

- -sche (discuss) 17:26, 20 June 2015 (UTC)Reply

Ganz ruhig. Ich glaube, Du verstehst meine Intention falsch. Der von mir geschriebene Text sollte ein Ausgangspunkt für ein Gespräch zwischen uns beiden über die Änderung des ANDS sein. Die derzeit existenten ANDS-DE, -NL und PDT sollte das noch gar nicht berühren, weshalb sie auch nicht erwähnt sind. Die Sektion über die Konsonanten und Vokale soll interessierte Autoren und Nutzer nur darauf hinweisen, dass eine Schreibung nicht bedeutet, dass überall dieselbe Aussprache vorherrscht und ggf. zu weiteren Eintragungen im Pronunciation-L3 anregen. (Oder wenigstens überhaupt welchen.) Von der Plautdietsch-Geschichte bin ich nicht überzeugt, da sich Plautdietsch kaum bis gar nicht von anderen Dialekten unterscheidet. Und den Teil mit den native forms verstehe ich ganz einfach nicht. Es klingt, als würdest Du befürchten, dass die Leser fälschlicherweise denken, dass die Holländer sich tatsächlich mit englischen Worten benannt hätten. Korn [kʰʊ̃ːæ̯̃n] (talk) 18:22, 20 June 2015 (UTC)Reply

Old Italic display help

Latest comment: 8 years ago14 comments2 people in discussion

Hello! Remember this discussion way in which you mentioned you make fonts? Well, this is not exactly that, but I have been working on making Appendix:Old Italic script with all of the relevant Old Italic languages (I still need to add Raetic, Camunic, Lepontic, etc.). I will then use this table as a references to create Module:Ital-translit which will service all of the Old Italic languages. I thought that it would be very nice to be able to show all the different letter forms that would map to any given Unicode letter. The documentation for how the Unicode block is defined is here and contains descriptions of all the different letter forms for each sub-script (in section 3). I was hoping you (or someone you could suggest) might be able to create PNG's for the use in {{t2i}} so that we could display all the Old Italic letter forms both in this appendix and potentially in the mainspace for quoting inscriptions. I know that this isn't a high priority for anyone, but now that I've started, I've gotten quite excited about the whole business. Below are some other reference materials for all the scripts. I'm not hoping for every little variation of every character, but if you make PNG's for the major ones, I'll do all the rest. Also, if this is just too much work, just tell me. —John C5 21:11, 24 June 2015 (UTC)Reply

Hmm, I'll see what I can do. Btw, I notice the Glagotic t2i images are a mix of svgs and gifs, although svg versions exist for at least some of the gifs and could be swapped in. - -sche (discuss) 20:01, 26 June 2015 (UTC)Reply

Yeah, that is rather weird. I have not idea how why that is the case. Also, the behavior for which I asking you is a little different than the normal t2i behavior, because I would want {{t2i|a|a2|a3|a4|a5}} to be different versions of the same letter. Just making sure you understand that for which you signed up.

Also, thanks! —John C5 23:19, 26 June 2015 (UTC)Reply

Hey again. Sorry to pester, but is there any progress on this? I want to have a discussion/take a vote to solidify the mapping of characters used in Module:Ital-translit and Appendix:Old Italic script since some of the character transcriptions (specifically those in South Picene and Camunic) are very odd. Having these for the discussion would be very useful. And again, if this is too annoying to do, please tell me. —John C5 06:29, 8 July 2015 (UTC)Reply

Thanks for the poke.
The various letter-forms in the images you showed me are all, for lack of a better word, very line-y (as opposed to calligraphic like pen- or quill-and-ink handwriting, which is what I'm more used to designing fonts based on). I did mock up variants of the A in a style somewhat like the images of the Glagolitic letters, but finishing all the alphabets in that style would take quite a while. I was going to try jotting all the letters on paper and scanning it and autotracing it into a png or svg, and then post an update, but I've been busy. Hmm, you could try it yourself — and I hope that doesn't sound rude; I'm not saying "grr, do it yourself", I just mean that you could probably do that as well as I could. And if I do later find time to make more calligraphic letters, they could always be swapped in. - -sche (discuss) 07:02, 8 July 2015 (UTC)Reply

No worries. I guess the whole making-png's-and-formatting-them-and-uploading-them thing would have somewhat of a learning curve for me. I didn't really need calligraphic versions―I was more hoping for just boring, old line versions of the different letterforms so I could disambiguate them in the appendix. It's kind of frustrating how many ways each character can appear, and having them all in a row would be useful. Is there anyone else you could recommend for this because I understand how making an all-lines-all-the-time font could be kind of dull? —John C5 07:19, 8 July 2015 (UTC)Reply

@JohnC5 OK, I've made a batch of letters and variants, which can be found at commons:Category:Italic letters. I traced a picture of an inscription, which is why the 'C' for instance is not a perfect circle; I will probably go back and make geometric 'perfect circle' variants at some point. I haven't done the whole alphabet yet. - -sche (discuss) 22:53, 10 July 2015 (UTC)Reply

You're the coolest! —John C5 00:32, 11 July 2015 (UTC)Reply

@JohnC5 Uploaded some more. Sorry this is taking a while. Think we should make a table to show all the forms (a bit like Wiktionary:Gothic transliteration but probably vertical rather than horizontal)? - -sche (discuss) 02:19, 4 August 2015 (UTC)Reply

Thanks for your help with this; they look great. I seem to have bitten off more than I can chew at the moment. Feel free to add them to the table as you see fit, or keep pestering me. Please keep pestering me. —John C5 02:52, 4 August 2015 (UTC)Reply

For now, I'm storing these in Appendix:Italic script. By the way, I notice commons:Category:Etruscan letters and commons:Category:Oscan alphabet already have some letterforms in them. - -sche (discuss) 05:39, 8 August 2015 (UTC)Reply

@JohnC5 Let me know if anything you need is missing from Appendix:Italic script. In each section, the first gallery / row are letter-forms I drew and the other rows are letter-forms which I discovered already existed on Commons. - -sche (discuss) 00:30, 9 August 2015 (UTC)Reply

Wowzers. Thanks so much for all this work. My next task will be to load them all into {{Ital2img}} and then use that to populate Appendix:Old Italic script with the appropriate letterforms. Both steps may take a while in turn. I feel, however, that this will greatly clarify the equivalency of the different symbols across sub-alphabets.

PS: Is there an abbreviation for the Appendix namespace like there is for Wiktionary (WT). I feel like I've wasted several years of my like writing out the word Appendix. Just think if you could write out APP:AITAL. That would be magical. —John C5 00:41, 9 August 2015 (UTC)Reply

There is not, but we do have a few cross-namespace redirects using the WT: shortcut. You could create WT:AITAL pointing to the appendix namespace (or even move the appendix into the Wiktionary namespace). Feel free to change the format of that page, btw. - -sche (discuss) 01:25, 9 August 2015 (UTC)Reply

Other resources

Etruscan: File:Etruscan alphabet.png, [5]
Non-Italic: File:Venetic Raetic Camunic Lepontic alphabets.png, [6], [7]
Latin: [8]
Oscan: [9]
Umbrian: [10]
Faliscan: [11]
Other Italic (ignore Messapic): [12]

double-team

Latest comment: 9 years ago1 comment1 person in discussion

Hey, there's probably a better way to put it, but at double-team I wanted to express that it suggests two people penetrating. One person can double penetrate with fingers and/or dildos, but one person can't double-team, AFAIK. WurdSnatcher (talk) 03:03, 10 July 2015 (UTC)Reply

Phobias

Latest comment: 9 years ago2 comments2 people in discussion

I direct you to Special:AbuseFilter/41 and Wiktionary:Requests for verification#agyrophobia. Also, aWa will not automatically recognise the discussion result if you forget to embolden it. — Keφr 11:56, 15 July 2015 (UTC)Reply

Duly noted, thank you.
The filter says "Of the last 8,991 actions, this filter has matched (0.00%)", is that just because it's turned off? I've turned it on, but set it to only flag edits. We can see how that works and then potentially upgrade it to warn or stop editors. - -sche (discuss) 00:22, 17 July 2015 (UTC)Reply

Template:tl

Latest comment: 9 years ago2 comments2 people in discussion

Irritating Wikipedians is a feature, not a bug. It prompts them either to drop the assumption that this project is run like Wikipedia, or leave. (Well, it did the former for me at least. And surely there are some that cannot do either, which means they should be blocked.) —Keφr 06:37, 25 July 2015 (UTC)Reply

People shouldn't be importing e.g. navboxes from sister projects (and I don't think we need {{reflist}}). Having a redirect from the name that every other project (Commons, Meta, en.WP, Simple English Wiktionary, Voyage, Source, Quote) uses to the name we use for the same thing just seems helpful, not only to users from everywhere else but also potentially for those users here who complain about every keystroke they have to type... since tl is shorter. - -sche (discuss) 09:16, 25 July 2015 (UTC)Reply

Lean keep

Latest comment: 9 years ago9 comments4 people in discussion

You wrote "Lean keep per Equinox". What does it mean? Are you leaning towards a keep vote (but not quite sure), or is it an adjective, a sort of "lean" or thin/skinny/ephemeral keep, like a "weak keep"? Equinox ◑ 08:28, 25 July 2015 (UTC)Reply

Leaning towards keeping. Ah, the terseness and ambiguity of our RFD jargon. The phrase "RFD-failed" is worse; a passing 'pedian at one point questioned me why I had deleted something if the "request for deletion failed". - -sche (discuss) 09:19, 25 July 2015 (UTC)Reply

Yep everything is bloody awful. Thank you for explaining. Equinox ◑ 09:24, 25 July 2015 (UTC)Reply

If it's not annoying, maybe I could suggest "weak __" for "lean __". I don't like placing the "vote" (weak or otherwise) if I'm not convinced, so I don't use it. But. I think I've occasionally written "weak oppose" etc. where I didn't like something but couldn't be bothered to explain why. It just needs a few of us to kill change by apathy. Hurrah. Equinox ◑ 09:26, 25 July 2015 (UTC)Reply

Why not use "RFD deleted"? --Dan Polansky (talk) 10:01, 25 July 2015 (UTC)Reply

I used to write "deleted" and someone scolded me and told me to write "failed". Equinox ◑ 10:05, 25 July 2015 (UTC)Reply

They should not have scolded you; did they perhaps confuse RFD with RFV? Many RFDs are closed as "deleted"; it is a common practice, and one that makes sense. I prefer to write "RFD deleted" rather than just "Deleted", in keeping with "RFV passed", "RFV failed", and "RFD kept", in boldface; the point is to make the closure clear and distinct as a closure, and indicate which process is being closed. But again, "deleted" is fine, and multiple people used it quite recently, including bd2412. I actually think "RFD failed" should be banned as a closure. --Dan Polansky (talk) 11:13, 25 July 2015 (UTC)Reply

I agree that "deleted" is clearer than and preferable to "failed". I suspect uses of "failed" are due to thinking of RFD (and RFV) as a process for deciding whether or not to keep an entry (an entry is deleted pursuant to the process = it fails to be kept). The deletion summary "Failed RFD, RFDO; do not re-enter" seems to conceptualize it in this way. I've boldly changed it. Several other deletion reasons in that list are redundant or need cleanup, IMO. - -sche (discuss) 22:39, 25 July 2015 (UTC)Reply

You mentioned the two "No usable content given" lines: I added the one with "Please see WT:ELE" because there were enough cases where I was adding it by hand, but there are also plenty of cases where ELE wouldn't have helped. Chuck Entz (talk) 03:14, 26 July 2015 (UTC)Reply

Wiktionary:Vietnamese transliteration

Latest comment: 9 years ago2 comments2 people in discussion

By creating this page, you caused all instances of {{vi-noun}} that include Nôm transcriptions to display a link to this page. Where in Wikipedia is the reader expected to look? The Nôm script predates the Latin-based Vietnamese alphabet, so I want to make sure it doesn't sound like the given Nôm characters are derived from the alphabetic words somehow. – Minh Nguyễn ^💬 06:39, 29 July 2015 (UTC)Reply

I created several such pages following Wiktionary:Grease pit/2015/July#remove_junk_from_Special:WantedPages. It was my impression that a (black) link was already present even before the page existed, so my edit was just to clear it off of Special:WantedPages, where it sat because of how many entries linked to it even without it existing. Feel free to add more informative content or even delete the page. Ideally, the template/module that inserts the link should be rewritten the way Module:IPA was recently, to only add links for the small number of languages which have transliteration schemes documented on Wiktionary, rather than performing an expensive check (as it does now) to see whether or not the dot (which, as an aside, I doubt very many people notice in any language) should have a blue link or be black. - -sche (discuss) 06:59, 29 July 2015 (UTC)Reply

tmh

Latest comment: 8 years ago4 comments2 people in discussion

Can we change the primary name of tmh (in Module:languages/data3/t) from "Tamashek" to "Tuareg"? tmh is the macrolanguage containing thv ("Tahaggart Tamahaq"), taq ("Tamasheq"), ttq ("Tawallammat Tamajaq"), and thz ("Tayart Tamajeq"). "Tamashek" is just an alternative spelling of "Tamasheq" and makes it very confusing. Also, "Tuareg" is simply a much more widely used name for these languages. --Wiki Tiki 89 15:40, 6 August 2015 (UTC)Reply

Yes, "Tuareg" would be a clearer name for it. Should we even have tmh at all, though, if we include its subvarieties as separate languages? (I note that ber, the macro-macro-language code containing tmh, was deprecated in favour of its subdivisions.) - -sche (discuss) 19:20, 6 August 2015 (UTC)Reply

I personally feel that Berber is overdivided. I'm not an expert, but it seems Tuareg languages are all relatively mutually intelligible (see here, for example) even if they have different realizations of some consonants (evident in the language names I listed above). So maybe we should merge all of Tuareg into one? The simplest thing for now, though, is to just rename tmh to Tuareg. --Wiki Tiki 89 19:44, 6 August 2015 (UTC)Reply

Yes, deprecating the sub-dialect codes in favour of tmh would also work. (And yes, Berber is quite over-divided...) - -sche (discuss) 19:57, 6 August 2015 (UTC)Reply

northern fur seal translations for WOTD?

Latest comment: 8 years ago9 comments3 people in discussion

k'oon is soon (10 August) to be a foreign WOTD. I have added entries for Callorhinus ursinus and northern fur seal. Could you take a look? Also, if you can find any Native American translations, they would make northern fur seal more interesting. The seals apparently ranged as far south as Baja. I've also left a note for Chuck Entz, as this might really be in his wheelhouse. DCDuring TALK 16:09, 6 August 2015 (UTC)Reply

I tend to know more about the languages on the other (Atlantic) coast, but I'll see what I can do. - -sche (discuss) 19:46, 6 August 2015 (UTC)Reply

We are lucky if we get folks to click through at all, let alone look at translations, let alone be impressed. So only modest effort, with high likelihood of success, is worthwhile. Thanks. DCDuring TALK 19:58, 6 August 2015 (UTC)Reply

There's a Tlingit translation here, which I think might be x̲'ún or x'ún in the orthography used by the current entries. Also, I wonder about the "hair seal" and "big seal" in this Yurok reference- could one of those be the northern fur seal]? Chuck Entz (talk) 21:19, 8 August 2015 (UTC)Reply

I made an assumption, based on the distribution of fur seal species, that in any native northern Pacific language a word for fur seal had as its original referent the northern fur seal, whatever else might now be covered by the word. Hair seal seems likely. I could not venture a guess about big seal, as I don't know what seals have been extant on the Pacific coast of North America. DCDuring TALK 21:30, 8 August 2015 (UTC)Reply

The northern elephant seal could easily be the referent for a term that glosses as "big seal". DCDuring TALK 21:34, 8 August 2015 (UTC)Reply

Yurok, since it is Algic, I know a bit about: chkweges, which that work translates as "hair seal", is indeed the northern fur seal, Callorhinus ursinus. As for Tlingit, we do seem to use x̱ in pagetitles, so I think x̱'ún is the orthography to go with (some of our entries currently use ’, but this strikes me as wrong). - -sche (discuss) 21:32, 8 August 2015 (UTC)Reply

Take a look at our entry for hair seal, and my revision of it. It is confusing that several references (not just the Yurok one) gloss as "hair seal" words that mean "fur seal". - -sche (discuss) 21:40, 8 August 2015 (UTC)Reply

Maybe I was too hasty on hair seal. I can't imagine that any people that depended on seals for food, clothing, etc could fail to make a distinction between seals with fur and those with only hair, the latter being good for storage, portage, kayaks etc, more than for clothing, where animal fur would be valued for warmth. But I couldn't find in the Yurok reference a distinction between "hair" and "fur". Human hair, at least, seems to be the referent for words that included the morpheme "lep". It may be that the Yurok "big seal"/"sea lion" vs "hair seal" distinction (or at least that of the author of the lexicon) is close to ours between eared seals (Otaridae, which include the fur seals, but also include sea lions, which do not have fur) and earless seals (Phocidae). DCDuring TALK 23:33, 8 August 2015 (UTC)Reply

Whitelist nominations

Latest comment: 8 years ago2 comments2 people in discussion

(tried responding back at the Whitelist, but I apparently don't have permission to do so – I apologise for posting here)

I checked Redboywild's edits and they seem to be ok – formatting is correct and I couldn't find a single mistake or bad translation. So I see no reason why he shouldn't be whitelisted. Thank you for consulting me about it :-)

PS: Just found out that this user has been warned a couple of times in the Romanian Wikipedia and blocked once for introducing obscenities. This happened some time ago and he hasn't done it since. He has probably – and hopefully – matured, but I'll keep an eye on his edits so they're up to par. --Robbie SWE (talk) 15:46, 10 August 2015 (UTC)Reply

Oh, apologies, I forgot you were only a sysop on ro.Wikt and not here. Thanks for the input. - -sche (discuss) 17:39, 10 August 2015 (UTC)Reply

Two spellings

Latest comment: 8 years ago2 comments2 people in discussion

I have a question: Are außlegen and meßen pre-1996 spellings? --Lo Ximiendo (talk) 02:58, 17 August 2015 (UTC)Reply

In one sense, yes — they were used in the 1600s, and the 1600s are before 1996. But in practical terms, no — when it comes to categorization or the like, "pre-1996" refers to spellings which were still standard right up until 1996, which these weren't. - -sche (discuss) 07:18, 17 August 2015 (UTC)Reply

Sardinian translations

Latest comment: 8 years ago4 comments2 people in discussion

If you weren't already (painfully) aware of this: see Category:Pages with module errors, which seem to be all translation and descendents sections. I've cleared a few, but it's slow going with the translation sections hidden. Also, I noticed that there were also a couple of minor Sardinian lects that weren't affected. Chuck Entz (talk) 13:01, 17 August 2015 (UTC)Reply

Sigh. As I lamented about Frisian, these translations went un-updated because good translations have been invisible (short of searching a database dump) ever since we switched from templates to Module:languages, as opposed to ttbc and t-check translations, which are categorized. Perhaps all {{t}}s should put entries into hidden categories like "Entries with Sardinian translations".

Now that they're all in Category:Pages with module errors, I'll just plug that into AWB and go through them.

If you're referring to Gallurese and Sassarese, I didn't merge them because (as I wrote here) they are despite their names not unequivocally considered dialects of Sardinian; rather, they're often considered dialects of Corsican (co) or transitional between Sardinian and Corsican. I'll propose renaming them soon for that reason, and move any I find nested below Sardinian.

- -sche (discuss) 17:45, 17 August 2015 (UTC)Reply

In the recent reclassification of Kölsch, I used a database dump to find and fix entries in translations tables before deprecating the code, so only a half dozen residual things made their way into Category:Pages with module errors. Progress! - -sche (discuss) 01:05, 3 September 2015 (UTC)Reply

Great! I would also suggest using "insource:{{t|xxx" in the search box to find any that weren't in the dumps. Chuck Entz (talk) 03:32, 3 September 2015 (UTC)Reply

Talossan (tzl)

Latest comment: 8 years ago4 comments3 people in discussion

I see you edited this file many times before. Could you update the variable for Talossan (tzl) here and replace it with the following:

m["tzl"] = {
	canonicalName = "Talossan",
	type = "appendix-constructed",
	scripts = {"Latn"},
	family = "art",
	sort_key = {
		from = {"[àáâäå]", "ç", "ð", "[ëèéê]", "[ìíîï]", "ñ", "[öòóô]", "ß", "[üùúû]", "þ"},
		to   = {"a", "c", "d", "e", "i", "n", "o", "s", "u", "z"}} ,  -- the copyright sign is used to guarantee that ð and þ will always be sorted after all other words with respectively d and z
}

(source)

¡Graschcias, Robin van der Vliet (talk) (contribs) 18:42, 27 August 2015 (UTC)!Reply

Done. Are publications like the Guizua Compläts àl Glheþ Talossan copyrighted? If so, I would caution you not to add more than a couple dozen words in the language, because including too much of a copyrighted language (like Klingon) poses legal problems/risks for Wiktionary (for which reason the Klingon appendix was greatly condensed by me a while ago, following this BP thread). - -sche (discuss) 01:04, 28 August 2015 (UTC)Reply

Ün Guizua Compläts àl Glheþ Talossan is (as far as I can tell) a copyrighted book, but it is not the source of the language. I am also not sure if the Talossan language is copyrighted and if languages can be copyrighted in the first place, as a language is a gigantic list of facts and facts can not be copyrighted. Robin van der Vliet (talk) (contribs) 16:52, 29 August 2015 (UTC)Reply

Individual facts, no. A compilation of facts can be copyrighted, though. With a bit of work, any creative work can be analyzed as a collection of facts, but the way the facts are assembled by the creator of the work makes them copyrightable. Chuck Entz (talk) 23:31, 29 August 2015 (UTC)Reply

Updates to Template:WOTD

Latest comment: 8 years ago3 comments2 people in discussion

Hi, I updated Template:WOTD at Template:WOTD/sandbox, essentially adding a new parameter |comment= (or {{{6}}}) which allows editors to add a comment: see Template:WOTD/testcases. If that is all right, could you update Template:WOTD? I can't do it myself as I'm not an administrator. If this isn't the correct procedure for proposing changes to the template, please advise. Thanks. Smuconlaw (talk) 14:35, 31 August 2015 (UTC)Reply

Done. Neat idea; I had noticed your addition of it to manumission (28 August). - -sche (discuss) 16:51, 31 August 2015 (UTC)Reply

Great! Thanks. Smuconlaw (talk) 21:53, 31 August 2015 (UTC)Reply

Unprotection of Word of the Day pages

Latest comment: 8 years ago3 comments2 people in discussion

Could you please unprotect "Wiktionary:Word of the day/September 29" and "Wiktionary:Word of the day/September 30" so I can update them? Thanks. (If you have time, perhaps you can also go through other days of the year and unprotect them as well.) Smuconlaw (talk) 16:08, 2 September 2015 (UTC)Reply

Done. I wonder why some, but only some, of the pages were protected in the first place. - -sche (discuss) 00:58, 3 September 2015 (UTC)Reply

Thanks. No idea why this was done. Perhaps it was before there was cascading CSS protection of material on the Home Page? Smuconlaw (talk) 06:40, 3 September 2015 (UTC)Reply

Updating of Template:quote-book/source

Latest comment: 8 years ago2 comments2 people in discussion

I have created an updated version of {{quote-book/source}} at {{quote-book/source/sandbox}} to address the three issues mentioned at "Template talk:quote-book#Some suggested changes". Could you replace the contents of {{quote-book/source}} at {{quote-book/source/sandbox}}? Thanks. Smuconlaw (talk) 17:26, 3 September 2015 (UTC)Reply

Done and I left a slightly longer comment on that talk page. - -sche (discuss) 23:58, 3 September 2015 (UTC)Reply

pl-decl-phrase

Latest comment: 8 years ago2 comments2 people in discussion

This is actually wrong. See the documentation for {{pl-decl-phrase}}. I realize that this interface is somewhat hacky, but I could not find a different way to pass keyword parameters to the declension patterns. --Tweenk (talk) 22:31, 3 September 2015 (UTC)Reply

Oh, OK. At the time I made that edit, the template was just a big module error, and my edit (upon preview) made it resolve into a normal-looking table, so I figured the exclamation marks were an odd typo. - -sche (discuss) 23:53, 3 September 2015 (UTC)Reply

German capitalisation

Latest comment: 8 years ago11 comments5 people in discussion

Isn’t it about time for some archiving?

Anyway, could you please tell me if German always had the ‘capitalise all nouns’ rule? --Romanophile (talk) 03:25, 7 September 2015 (UTC)Reply

Yeah, you're right, I need to archive.

No, German and its predecessors (Old/Middle High German) didn't always capitalize nouns. In the medieval period, capitals were generally only used at the beginning of sentences. Even after capitalization of nouns and names became standard in the Baroque period, some authorities (such as the Brothers Grimm, authors of the major Deutsches Wörterbuch) were opposed to it and persisted in writing in minuscule.

- -sche (discuss) 20:46, 7 September 2015 (UTC)Reply

So, would it be permitted to include minuscule forms as obsolete forms? --Romanophile (talk) 21:11, 7 September 2015 (UTC)Reply

I think that would be a bad idea, since the difference isn't specific to the word, but a general rule. You would end up with a lowercase entry for just about every noun attested before a certain date, which would be about as useful as a entries for italicized or underlined forms. Chuck Entz (talk) 21:26, 7 September 2015 (UTC)Reply

Okay, fair enough. But what if the word is not attested in a capital form? Do we capitalise it anyway? --Romanophile (talk) 21:55, 7 September 2015 (UTC)Reply

I would. Otherwise, you imply that there's some inherent difference from all the other nouns which were also lowercase back then. Of course, Middle High German and Old High German would be uppercase or lowercase by their own rules, since we consider them separate languages. Chuck Entz (talk) 22:19, 7 September 2015 (UTC)Reply

I agree with Chuck. We do similarly for English: old capitalized Nouns don't have Entries, and we've tended not to capitalize common Nouns even if they're more common in old Works in capitalized Form, although there are a few Exceptions (like Admiraless, which I only just moved). - -sche (discuss) 22:29, 7 September 2015 (UTC)Reply

"We do similarly for English: old capitalized Nouns don't have Entries" -- Does that mean there shouldn't be capitalized entries or does it simply mean that they're missing? Also what's in case of other European languages, like Latin and Danish in which nouns were also (sometimes) capitalized? If capitalized spellings are discrimited against, shouldn't there at least be some note somewhere? For example, there could be a page somewhere explaining English habits, like explainining differences between US English and UK English and explaining English capitalization habits. If a single page would be too long, there could be sub-pages like "English habits/Dialects" and "English habits/Capitalisation". -84.161.31.53 12:31, 25 October 2015 (UTC)Reply

Wiktionary has decided to exclude old Capitalizations of ordinary Nouns as a Matter of Course, along with sentence-initial Capitals and all-caps (the usual Examples cited in Discussions are "The" and "THE", Variants of "the"), long-s, and various typographic Literatures (e.g. Talk:ﬁsherwoman). I proposed last Year that we should write these Exclusions down in some central Place, but nothing happened; perhaps I'll suggest it anew.

Wiktionary:About Latin#Orthography_for_Latin_entries documents how we handle Latin, although some Things (like that we don't include "EQVVS") seem to be so basic that they're not spelled out but only implied by e.g. the Note that the Form which we do have an Entry for is "equus".

I don't recall if we've discussed old Danish Capitalization or not, but I see no Reason it wouldn't be handled like old English Capitalization. - -sche (discuss) 19:10, 25 October 2015 (UTC)Reply

Can the descisions be found somewhere? The exclusion of sentence-inital capitals and modern all-caps and typographic ligatures makes sense. But in case of capitalised nouns and normal antique Latin forms in all caps the exclusion is doubtful.

In case of long-s the exclusion would even be against Wiktionary's aim "to describe all words". While it's easy to change "winter" into "Winter" when one knows that "winter"/"Winter" is noun, it's not easy to change some s into long-s. In some cases, it's more like impossible to know where long-s's are put, if one doesn't know the rules concerning long-s's. (Simplified basic rules like "s" is used at a word's end and long-s is used elsewhere often are incorrect.)

Also old Latin abbreviations like "IMP" for "IMPERATOR" can be found in (special) dictionaries and can't be changed into a pseudo-modern spelling like "Imp" or "imp", because a modern abbreviation would be "imp." or "Imp." which is another word as it's written with a dot.

And in some of these cases, I have doubts whether descisions were made or not, or whether they were real descisions and not just some uttered opinions somewhere. For example, it's possible that nobody thought of old Latin abbreviations like "IMP" and thus no descision was made.

Also, what's in case of Modern Greek? The about page clearly states "This is a draft under discussion.". On the discussion page Katharevousa forms (Modern Greek spelled with diacritics etc.) are mentioned and some Katharevousa words have own entries (e.g. καρβονικόν). But the about page and the discussions don't clearify where to mention Katharevousa forms. Is e.g. καρβονικόν a related word, a synonym or an alternative form of καρβονικό? (Comparing it with other languages, like English prae- and pre-, Katharevousa forms should be alternative forms.) What about καρβονικός and it's declension, where should the neuter Katharevousa form καρβονικόν be mentioned? Under alternative forms with the addition "neuter", in the declension section, in the header?

Other questions in some way related to this:

* In case of Wiktionary:About Latin it seems that some things weren't discussed - at least not at Wiktionary talk:About Latin - but rather made up by some authors. E.g. in case of the edit from 27 November 2011 with the comment "→‎Quotations: Adds rule for marks over final a for disambiguation of ablatives from nominatives", it doesn't seem that there was a discussion. On the talk page there was no edit around that time and the author of that change didn't make a change which would indicate a discussion about it. (There was a discussion with another topic at "Wiktionary:Grease pit" and a discussion on his talk page which he commented. But both wouldn't be fair places for a discussions of his edit.)

* The About Latin page states that words with j should point to word with i. But what's if only a form with j is attested? Well, that doesn't mean that the form with i is not attestable, but when it's not attested (no one found a quote), then the word with j can't point to a form with i. Well, at least not, if one doesn't make up words and words forms.

* What's if there is a term without an English translation? E.g. Swedish "tankstreck" and German "Gedankenstrich" refer to the mark "–", but usually just when used in certain contexts and sometimes not it's not restricted to that smaller dash but might also refer to "—". Thus, both terms do not belong into a translations of "dash" or "en dash". But still it would be nice to see translations for these terms. The current practice is like this: The words are incorrectly given as translations of an English word or there is no translation section with words like that. Possible 'solutions' which should be better: (a) One could mention "tankstreck" under the Etymology of German "Gedankenstrich" and vice verse, as they are formed similary. (b) One could create an translation template like "template:translation - thoughts stroke" which than can be embedded in the entries of the foreign words.

So regarding your old, unanswered question at "Beer parlour": Those descisions should be collected somewhere. Also, maybe those descisions should be checked whether they still make sense or not. One could also check if all so-called descisions really were descisions. E.g. a user once wrote that as far as he knows About Latin is rather a collections of ideas than of actual rules. The part, "... think tank, working to develop a formal policy.", should support his attitude.

-84.161.2.62 20:08, 27 October 2015 (UTC)Reply

Questions about Wiktionary's policies towards Latin should be directed towards Latin-speaking and Latin-editing editors on WT:T:ALA. Likewise, questions about Ancient Greek should be directed to WT:T:AGRC; people there are more qualified than I am to tell you about Katharevousa. I've started a BP thread about long s and ligatures: Wiktionary:Beer parlour/2015/October#Documenting_how_to_handle_long_s_and_ligatures. - -sche (discuss) 02:20, 28 October 2015 (UTC)Reply

Broken usage tracking in MediaWiki:Gadget-RegexMenuFramework.js

Latest comment: 8 years ago1 comment1 person in discussion

Hello -sche. You changed a link in MediaWiki:Gadget-RegexMenuFramework.js to remove it from Category:Pages with broken file links. The broken page links are a common hack to track global usage via Special:GlobalUsage or the usage tool. Unfortunately your change broke the tracking, so the page will no longer receive maintenance updates as needed. Would you consider excluding JavaScript pages from Category:Pages with broken file links instead? (I can put together the code to do that via MediaWiki:Broken-file-category for you.) —Pathoschild (talk) 02:59, 11 September 2015 (UTC)Reply

fickern seems to have become an autonym...

Latest comment: 8 years ago2 comments2 people in discussion

Rather than create Category:Palatine German and Category:Kölsch German, I wanted to instead fix this entry, which is the sole entry in both of those- but I don't know enough about either language to do it even half right. I suspect you'll also want to remove some things from Module:labels/data. Thanks! Chuck Entz (talk) 04:40, 11 September 2015 (UTC)Reply

The labels in the module are largely OK, because there are (almost certainly) terms used in standard German which are specific to the Palatinate / Köln, although the details of labels like those are under discussion on the module's talk page. This entry, on the other hand, is odd... the Pfälzisches Wörterbuch only has "fickeln" and "ficken"; the Rheinisches Wörterbuch doesn't have this sense; and Google Books hits all seem to be scannos or the noun. Even raw Google hits for "zu fickern" are mostly Google Books scannos. - -sche (discuss) 21:49, 11 September 2015 (UTC)Reply

Knabe

Latest comment: 8 years ago2 comments1 person in discussion

Could you take a look at Talk:Knabe. DCDuring TALK 12:16, 22 September 2015 (UTC)Reply

Thanks for helping me with this sweet memory of my deceased parents. DCDuring TALK 06:46, 9 October 2015 (UTC)Reply

American black bear

Latest comment: 8 years ago13 comments5 people in discussion

Do you know what language "Dene" refers to here? DTLHS (talk) 18:08, 8 October 2015 (UTC)Reply

If you go to WT:LOL and press Ctrl+F and type "Dene", you will find that the Chipewyan language (code chp) has "Dene" as one of its alternative names. --Wiki Tiki 89 18:16, 8 October 2015 (UTC)Reply

That's true, but "Dene" can also refer to a whole family of languages, so I don't know what was meant. DTLHS (talk) 18:38, 8 October 2015 (UTC)Reply

You seem to be right. In WT:LOF, all I see is "Na-Dene", not "Dene", but the Wikipedia page on Na-Dene languages mentions that there the "Athabaskan" family can also be called "Dene". Anyway, the Chipewyan language is in the North Athabaskan family, which is in the Athabaskan family. Anyway, Chipewyan is the only single language I can find that goes by the name "Dene" and the Wikipedia page on the Chipewyan language says "Most Chipewyan people now use Dene and Dënesųłiné to refer to themselves and their language, respectively." Based on all this, I think Chipewyan is the correct choice. If it turns out to be wrong, it would be within our expected margin of error and we would know it's in the same family of languages anyway, so the actual Chipewyan would be similar enough. --Wiki Tiki 89 18:56, 8 October 2015 (UTC)Reply

That's a reasonable assumption, although in this case I think the rug is pulled out from under it because the gloss (=the claim that tsah means "black bear") seems to be mistaken. Desjarlais gives sas as the Dënesųłiné (Chipewyan) word for "bear", and an old article in the Transactions of the Canadian Institute clarifies the species by saying [in old orthography] "the "Déné word for Black Bear is s̀əs or s̀as according to the dialect". For comparison, Hargus gives səs as the word for "black bear" in either Sekani or Babine-Witsuwit'en — without reading her whole chapter I can't tell which — and Krauss gives x̯ešʷ as the Proto-Athabaskan word for "black bear". Whereas, Desjarlais says tsá is the word for "beaver", and Morritt citing Haas agrees (compare Sekani tsàʔ and Slave tsáʔ, both "beaver"). - -sche (discuss) 21:44, 8 October 2015 (UTC)Reply

Historically, brown bear species almost certainly ranged over the lands of the Chipewyans. Is there a term that included brown bear? DCDuring TALK 00:14, 9 October 2015 (UTC)Reply

I can't find a Chipewyan term for "brown bear", although I can find sources which gloss sas as just "bear", so it may have functioned as a generic term. I can find the term in other languages: Ruhlen has Haida xúuts "brown bear", Tlingit xúts (= /xúːc/, also written xoots) "brown bear" (/"grizzly bear"), Tsetsaut xɔ "grizzly bear". Athabaskan languages and the schools: a handbook for teachers (1984) notes "in Kutchin, shih means 'brown bear' but shìh (with lowered tone) means 'food', and these words are not grammatically or etymologically related." The Proto-Athabaskan term for "brown bear" was x̯...c per Krauss (he is unsure of the middle vowel). - -sche (discuss) 01:21, 9 October 2015 (UTC)Reply

The problem is that the w:Na-Dene languages are called that because some variation of "dene" means "people" in the vast majority of at least the Athabaskan languages. More often than not, the word for "people" gets used in the language name (at least the one native speakers use for their own language), so there could be a number of candidates. The Chipewyan term is pretty close, so it would make sense to concentrate on that part of Northern Athabaskan. Or, better yet, get @DCDuring to tell us what source he used for his mass addition of American Indian translations to that page, and we might be able to figure it out that way. Given the rather poor understanding of American Indian languages and their orthography in most general sources, I'm not so sure that was a good idea. Off the top of my head, the Hopi looks plausible based on what I know of other Northern Uto-Aztecan languages, and the Southern Uto-Aztecan ones all seem to use reflexes of the same ancestral form, which is a good sign, but "close" isn't close enough for dictionary purposes. Chuck Entz (talk) 03:41, 9 October 2015 (UTC)Reply

Why is that a "problem"? --Wiki Tiki 89 19:22, 9 October 2015 (UTC)Reply

It makes it difficult to tell which language a work that refers to "Dene" is referring to. Indeed, older generalist works (as they tend to do with a lot of languages, e.g. also Great Russian) often impressionistically consider whole swathes of the Dene family to be a "Dene" language divided into e.g. Northern and Southern dialects. - -sche (discuss) 20:30, 9 October 2015 (UTC)Reply

Ursus americanus on Wikispecies.Wikispecies DCDuring TALK 06:05, 9 October 2015 (UTC)Reply
An additional resource for this kind of thing is Online American Indian Picture Dictionaries (Native American Animals).
Do such sources not meet our standards for introducing terms to be checked? DCDuring TALK 07:08, 9 October 2015 (UTC)Reply
native-languages.org is an outstanding resource, though its spellings/orthographies are often odd. I always check its claims against other references. It is helpful for knowing what to look for, though — it's easier to find a specialist source confirming that sas is Chipewyan for "bear" (google books:Chipewyan "sas" bear) than it is to track down a book which gives the Chipewyan translation for "bear" as opposed to just mentioning Chipewyans and bears (google books:Chipewyan bear). - -sche (discuss) 20:30, 9 October 2015 (UTC)Reply

Sänger

Latest comment: 8 years ago2 comments2 people in discussion

You removed "songster" from Sänger. But than shouldn't "songstress" be removed from Sängerin too, or shouldn't it be replaced with "singeress" ("female person who sings" instead of "female person who sings (songs)")? -84.161.31.53 12:23, 25 October 2015 (UTC)Reply

Thanks for catching that. Yes, it's sufficient for Sängerin to say "female singer", IMO. If I heard someone say "singeress" it'd be a dead giveaway that English wasn't their native language. - -sche (discuss) 18:45, 25 October 2015 (UTC)Reply

Request for Zipser German Granted

Latest comment: 8 years ago5 comments2 people in discussion

User -sche, þy wish haþ been granted. See here. --Lo Ximiendo (talk) 18:52, 25 October 2015 (UTC)Reply

Þanks! - -sche (discuss) 19:18, 25 October 2015 (UTC)Reply

I also added few words of Sathmar Swabian and Silesian German. --Lo Ximiendo (talk) 19:38, 26 October 2015 (UTC)Reply

Great! Wiktionary's coverage of Germanic languages is slowly increasing. - -sche (discuss) 08:05, 27 October 2015 (UTC)Reply

I added the white and yellow flag for the language header of Silesian German. --Lo Ximiendo (talk) 06:46, 30 October 2015 (UTC)Reply

Berliner

Latest comment: 8 years ago1 comment1 person in discussion

If that's "nonstandard", then please fix it. It's simply a fact, that there are two opinions about the part of speech:

Some say that Berliner and similar words are adjectives. This is also supported by dated spellings like berliner.
Some say that Berliner etc. are nouns in gentive plural: der Berliner, gen. pl. der Berliner - so Berliner Mauer literally means "Wall of the Berliners". This is also supported by German spelling rules: nouns begin with a capital letter, adjectives not (nominalised adjectives aren't adjectives anymore, but nouns too).

-84.161.48.172 18:24, 27 October 2015 (UTC)Reply

transwomyn

Latest comment: 8 years ago2 comments2 people in discussion

What’s wrong with the samples on Google Groups? --Romanophile ♞ (contributions) 20:56, 29 October 2015 (UTC)Reply

Oh, there are some, that's great! They weren't there when I searched back in 2013, which is odd, since the posts were made before 2013... but Cloodcuckoolander (I think) has remarked upon how oddly unreliable Google's Groups search is. Thanks for revisiting the entry / noticing. (I have a short list of entries that just need one more citation that I check up on periodically, but it's woefully incomplete.) I'll turn it into an entry. - -sche (discuss) 05:06, 30 October 2015 (UTC)Reply

Template:U:de:dass

Latest comment: 8 years ago3 comments3 people in discussion

This rollback is in error. As said before (cf. revision history): google doesn't differ between ſ and s, so antiqua "daſs" (around 1871-1902) incorrectly becomes "dass" by google and thus ngram is no reliable source. Maybe in case you don't know: daſs is not the same as dass, but an alternative form of daß used in antiqua when ß was not available (this usage was deprecated in some spelling rules). daſs could also be a Heyse spelling of daß, but then Heyse's spellings (including his antiqua spellings) are (said to be) different from 20th/21st century spellings as Heyse also used ſ in antiqua (rules from 1902 should deprecate the use of ſ in antiqua and only allow s and ß, which also holds for the 1996 reform though the use of ß and s were changed).
Also: daſs is an alternative form of daß/dass, older (antiqua) spellings with ſ can't easily be derived from modern antiqua spellings and there's no bijection between older (antiqua) spellings and modern antiqua spellings, e.g. both Wachſtube (Wach-stube) and Wachstube (Wachs-tube) become Wachstube in antiqua without long s. Thus it makes sense to add spellings with ſ. (It maybe makes no sense to have an own entry for it as it's not easy to input the character ſ, as most users don't know the difference between ſ and s, and as ſ and s might be similar in case of encoding etc., but there was no link anyway.)
P.S.: In the German spelling rules (Berlin, 1908) it is: "In lateiniſcher Schrift ſteht s für ſ und s, ss für ſſ, ß (besser als ſs) für ß, für ß tritt in großer Schrift sz ein, z. B. MASZE (Maße), aber MASSE (Masse)." (antiqua ß and fraktur ß actually look differently in the text and the text itself is printed in fraktur). In early Duden editions (late 19th century) it was: "Zu merken iſt, daß man in lateiniſscher Schrift s für ſ und s ohne Unterschied, ss für ſſ und ſs für ß anwendet. Statt ſs ist auch ß zulässig." So, daſs which can be found in antiqua texts from ca. 1871 till 1902 and is OCRed as "dass" by google is an alternative form of "daß" which was more common in antiqua after 1902.
daſs was also a real Heyse spelling. But I'm not sure whether it was used in fraktur or in antiqua. If it was used in antiua than it's obviously different from dass. If it was used in fraktur, then the traditional fraktur-antiqua transcription rules from early Duden editions and the rules from 1902 could say that it has to be transcribed as dass, but even than it's a different form as it's transcribed. But it's very likely that Heyse's spellings were not as common as Adelung's spellings.
Anyway, as google doesn't differ between ſ and s (and between fraktur and antiqua), it can't be used to cite a statement like "dass was more common than daß in 1871-1902". In case of 1950 till nowaydays, ngram maybe can be used as the nazis banned fraktur and it never became popular again and as ſ became unpopular in antiqua (cf. traditional spelling rules from 1902 and reformed spellings rules from 1996).
-84.161.28.37 14:18, 6 November 2015 (UTC) and 14:40, 6 November 2015 (UTC), P.S. 17:47, 6 November 2015 (UTC)Reply

Where is your evidence that daſs should count as daß and not "dass"? To the extent that "ess-zett" is treated as a separate thing from "two esses", "ſs" is two unligatured esses, one long and one short according to the usual (translingual) rule of long- vs short-s placement. - -sche (discuss) 21:16, 6 November 2015 (UTC)Reply

Older Duden editions and German spelling rules from 1902 (see above). Both state that fraktur ß (actually more like a ligature of ſz) can be written as ſs in antiqua ("ſs für ß" and "ß (besser als ſs) für ß"). So antiqua daſs can be and often was an alternative form of daß and not of dass.
Without Duden and the German spelling rules (which say that in antiqua s is used instead of single fraktur ſ), antiqua daſs would still be another form than dass. That is, one would have to differ between three forms in antiqua: daß (traditional spelling, also prefered by the 1902 orthography), daſs (older antiqua spelling), dass (1996 reform spelling). It could be, and shouldn't be unlikely, that authors who used daſs (which could also be a Heyse spelling used in antiqua) would prefer daß over dass if they could only choose between these two forms. In case of the real Heyse spelling, at least the one used in fraktur, many arguments used against the 1996 reform spelling are invalid, e.g. sss shouldn't occur in a real Heyse spelling used in fraktur, and maybe in antiqua too.
daſs (not daſz (= daß)) in fraktur by Heyse could be dass in antiqua. But: 1. The older Duden and the spelling rules from 1902 can't be used to derive that spelling, as fraktur daſs is an incorrect form in the beginning. So, some other source is needed that says that Heyses fraktur daſs can be an antiqua dass, as it could also be that his correct antiqua form would be daſs too or that he proscribed the use of antiqua. 2. It's more likely that Heyse's spelling was rarer anway, as it was younger (Adelung came before him), was depracted in several German countries (e.g. in Prussia) and as it wasn't used in the 1902 orthography (if dass was prefered in 1871-1902, than it should be more likely that that spelling would be used in the 1902 orthography). 3. As google doesn't differ between ſ and s and between fraktur and antiqua, it is no reliable source. And to interpret google's ngram or google's books would be OR too.

(Regarding the quotes: It's hard to quote a fraktur text which differs between fraktur and antiqua in an antiqua text. Maybe it would be better with pseudo-HTML like "<antiqua>ſs</antiqua> can be used for <fraktur>ß</fraktur>", but maybe that would be harder to read.)

-84.161.28.58 12:39, 7 November 2015 (UTC)Reply

Neger

Discussion moved to Wiktionary:Tea room/2015/November#Neger.

(let's try to keep the discussion in one or two places rather than three)

Dative -e in German strong declensions

Discussion moved to Template talk:de-decl-noun-n.

German ordinal numbers

Latest comment: 8 years ago5 comments3 people in discussion

Presently their lemmas are the forms in -e. Our general practice, I think, is to put adjectives without a bare form at -er. The ordinal numbers do have a bare form, which is used with zu: zu siebt, zu acht. But these seems to be separate idioms. So I guess -er would be the right place. And if you agree: Should I move them manually, or is there a better way? Kolmiel (talk) 14:22, 9 December 2015 (UTC)Reply

I think you are correct about the general practice (or more accurately, the general desire — in practice a lot of entries were created at the wrong title and still need to be moved), which also applies to substantivized adjectives. But about this particular set of entries... why do de.Wikt and the Duden, which do lemmatize e.g. Verletzter m rather than Verletzte m (cf. this thread), both lemmatize siebte ([13]) rather than siebter ([14])? (The DWDS seems divided on the matter; there's no entry found if you search for "siebter", but if you search for "siebte", the DWDS-Wörterbuch entry lemmatizes that form, while the Etymologisches Wörterbuch entry that comes up lemmatizes the form that ends in r.) Do you know if there's any logic behind lemmatizing the -e forms? If not, then yes, for consistency they could be moved. I suppose AutoWikiBrowser could be used to speed up the process somewhat. - -sche (discuss) 01:03, 10 December 2015 (UTC)Reply

I think Duden might lemmatize the er-form only in nouns, but the e-form in adjectives. For example "oberer" is given "obere, oberer, oberes" at Duden.de. Otherwise I don't think there's a special reason concerning ordinal numbers, except possibly that these are often preceded by the definite article. But that's true of others as well, and the er-forms of ordinal numbers do occur and aren't particularly rare at all (ein zweiter Versuch, zehnter Dezember). So I think they should be moved. Kolmiel (talk) 14:19, 10 December 2015 (UTC)Reply

OK, I will find time to move and standardize most of them in AWB if there are too many for us to do by hand. AFAICT (from the category and the bluelinks in siebte) we're dealing with <50 entries, right? There's a lot of inconsistency in what part of speech the lemmas and non-lemmas use in their headers and headword-lines; achtzehnte uses 'ordinal number' and zweiter uses 'numeral', but siebte uses 'adjective'; siebtes uses 'adjective form', so I tentatively just appended 'form' to the headword line of neunzehnte and got 'numeral form'. They should all be 'adjectives' (the lemmas) and 'adjective forms' (the inflected forms), right? (This needs to be sorted out regardless of which forms we lemmatize.) Also pinging @CodeCat, who has helped sort out Wiktionary's labelling of numerals vs numbers vs adjectives. (The Duden goes with "Zahlwort"; de.Wikt with the double header "Adjektiv, Numerale".) - -sche (discuss) 22:14, 11 December 2015 (UTC)Reply

These are straightforwardly adjectives. "Numeral" is a special kind of part of speech whose definition I'm still not sure of, but see w:Numeral (linguistics). That page mentions in particular that "not all words for cardinal numbers are necessarily numerals", so not everything with a number meaning is, part-of-speech wise, a numeral. That's why we have Category:German numbers, which exists outside the POS category tree. In fact, I believe numerals are a kind of determiner, closely allied to non-cardinal quantifiers like "all", "some" or "no". —CodeCa t 22:50, 11 December 2015 (UTC)Reply

Old Picard

Latest comment: 8 years ago8 comments3 people in discussion

Should we have a language code, or at least an etymology-only code, for Old Picard? Otherwise, I assume it is currently treated as a dialect of Old French, and without an etymology code, I had to use some awkward phrasing at Rosine#Etymology. --Wiki Tiki 89 19:37, 28 December 2015 (UTC)Reply

I do find references (more than to "Old Italian"!) to Old Picard translations of texts, and to Old Picard words — including in dictionaries that give Old Picard words as etyma. Let's give it an etymology-only code, so that those etymologies which need to can cite it. Distinguishing it in general from Old French and Old Northern French might be messy, so I wouldn't grant it a full code and its own language sections until such time as someone makes a case that it needs/merits that. Based on "fro-nor", I guess the thing to add to Module:etymology languages/data would be "fro-opc". - -sche (discuss) 03:40, 29 December 2015 (UTC)Reply

Maybe "fro-pic" instead? The oldness is already implied by "fro-", and we do have "fro-nor" rather than "fro-onr". --Wiki Tiki 89 16:46, 29 December 2015 (UTC)Reply

Sure. - -sche (discuss) 18:44, 29 December 2015 (UTC)Reply

Maybe we should be consistent and use the language code for modern Picard: fro-pcd. Of course, we have fro-nor, instead of fro-nrf, so maybe I'm just playing host to the "hobgoblin of little minds". Chuck Entz (talk) 02:45, 30 December 2015 (UTC)Reply

I thought about that, but then I wondered if using the modern language's code for that element would suggest that this was another code for the same thing. "de-AT" is (a variety of) the same language as "de", whereas "fro-pic" is not "pcd" but rather (a variety of) "fro". - -sche (discuss) 05:47, 30 December 2015 (UTC)Reply

I already went with "fro-pic", I figured if there was a strong enough argument to change it, I would, but I don't see such a strong enough argument. --Wiki Tiki 89 17:01, 30 December 2015 (UTC)Reply

Gosh! And here I was expecting a discussion of one of the roles Patrick Stewart played in the Star Trek: The Next Generation series finale. We could have split up into Team Middle-Aged Picard and Team Old Picard a la the Twilight Saga. Oh well...

Seriously, though, I seem to remember reading somewhere that a few of the "Normans" that invaded England were really Picards, and that there are traces of Old Picard in English. Chuck Entz (talk) 07:25, 29 December 2015 (UTC)Reply

Add topic