Wiktionary > Discussion rooms > Beer parlour

Click here to start a new Beer parlour discussion.

Wiktionary discussion rooms (edit) see also: requests
Information desk start a new discussion \| this month \| archives Newcomers’ questions, minor problems, specific requests for information or assistance.	Tea room start a new discussion \| this month \| archives Questions and discussions about specific words.	Etymology scriptorium start a new discussion \| this month \| archives Questions and discussions about etymology—the historical development of words.	Beer parlour start a new discussion \| this month \| archives General policy discussions and proposals, requests for permissions and major announcements.	Grease pit start a new discussion \| this month \| archives Technical questions, requests and discussions.

All Wiktionary: namespace discussions 1 2 3 4 5 – All discussion pages 1 2 3 4 5

Welcome to the Beer Parlour! This is the place where many a historic decision has been made, and where important discussions are being held daily. If you have a question about fundamental aspects of Wiktionary—that is, about policies, proposals and other community-wide features—please place it at the bottom of the list below (click on Start a new discussion), and it will be considered. Please keep in mind the rules of discussion: remain civil, don’t make personal attacks, don’t change other people’s posts, and sign your comments with four tildes (~~~~), which produces your name with timestamp. Also keep in mind the purpose of this page and consider before posting here whether one of our other discussion rooms may be a more appropriate venue for your questions or concerns.

Sometimes discussions started here are moved to other pages for further development. In particular, changes to a major policy or guideline may be discussed on the corresponding talk page and “simple votes” (as opposed to drawn-out discussions) can be conducted on our votes page.

Questions and answers typically remain visible on this page for one to two months, but they can always be found in the appropriate monthly archive (based on the date discussion was initiated). While we make a point to preserve all discussions that were started here, talk that is clearly not appropriate for this page may be deleted. Enjoy the Beer parlour!

Beer parlour archives edit

2024

2023

Earlier years

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

December

May 2024

Arabic and Hebrew transliteration

Latest comment: 26 days ago15 comments6 people in discussion

Wiktionary currently transliterates the glottal stop in both Arabic and Hebrew as ʔ and the voiced pharyngeal fricative in both languages as ʕ. Would it be possible to correct these to respectively transliterate the glottal stop as ʾ and the voiced pharyngeal fricative as ʿ so they would be in line with Wiktionary's transliteration of other Semitic languages, which all use ʾ and ʿ?

Wiktionary also currently transliterates the Arabic voiceless velar fricative as ḵ. However, an alternate transliteration as ḫ is also used for this sound. Since ḫ is used for the transliteration of voiceless velar fricative for most Semitic languages except for Hebrew and Aramaic, I would like to request that Wiktionary's transliteration of the Arabic voiceless velar fricative be changed from ḵ to ḫ as well. Antiquistik (talk) 13:08, 1 May 2024 (UTC)Reply

@Antiquistik: No, we switched the other day. As for ḵ to ḫ, I don’t know, perhaps it’s better if you want to make an etymological statement that ḵ is fricativized k which we keep in begadkefat affected languages while ḫ organic. Fay Freak (talk) 18:08, 1 May 2024 (UTC)Reply

@Fay Freak In this case, I will add my opposition to the discussion regarding that change.

Concerning ḵ to ḫ, should I make another request, or should I add it to this one itself? Antiquistik (talk) 18:55, 1 May 2024 (UTC)Reply

@Antiquistik IMO the opposite change should happen and other Semitic languages should use ʔ and ʕ. The problem with the forward and backward quotes is that they're too small and too easily confused in many fonts. I also think ḵ is better than ḫ; ḫ is easily confused with the pharyngeal fricative. Benwing2 (talk) 23:49, 1 May 2024 (UTC)Reply

Personally, I agree with Benwing, although I am sympathetic to the idea that we should use whatever is most widely used, and I am also sensitive to the issue of words being findable by people who search for them using other transliteration systems. I would like us to implement having the templates/modules produce (but then potentially set to be invisible / display:none by default) other common transliterations so the entries can be found if people use our site search or Google and search for ʾiʿlān etc, as discussed in the 2022 discussion, unless that would cause problems. Then we could probably also set different CSS classes for the different transliterations so people could select whether they see ʾiʿlān or ʔiʕlān, similar to the way people can choose to see or not see {{,}} (and we could debate which one would be most helpful to have on by default for the average lay reader). - -sche (discuss) 02:33, 2 May 2024 (UTC)Reply

@-sche I think this is a good idea. AFAICT it would require some changes to Module:languages (which handles transliteration) so that a given transliteration method can return multiple transliterations rather than just one, each transliteration associated with properties such as CSS class, with one of them identified as "canonical" (meaning it is displayed while the others aren't). The only tricky thing here is manual transliterations; ideally, there would be method to convert a manual transliteration in the canonical system into each of the other systems, so that users have to specify only one transliteration rather than multiple. In the examples here, that conversion isn't hard, but sometimes it may not be possible (e.g. the current Hebrew transliterations are based on modern Hebrew pronunciation, which has several mergers compared with Biblical Hebrew, so we couldn't convert modern to Biblical Hebrew transliterations). Benwing2 (talk) 02:45, 2 May 2024 (UTC)Reply

@Benwing2: I believe that some of the existing manual transliteration entries may need to be reviewed in order to see whether their use was actually justified in the first place. Some of them are there only to workaround various technical issues, which ceased to exist. For example, this manually added transliteration for a Belarusian quotation became unnecessary after this fix. And I definitely support the idea of having multiple transliteration schemas, because this would allow introducing Belarusian Łacinka in addition to the current WT:BE TR scholary transliteration. As @-sche mentioned, the primary motivation is that words should be preferably searchable via Google or via the search box from the Wiktionary front page. Belarusian entries currently solve the searchability problem via manually added "Alternative forms" sections with red links, but this isn't ideal. So the proposed improvement has uses even beyond Arabic and Hebrew. --Ssvb (talk) 16:41, 2 May 2024 (UTC)Reply

Yes, I'm also in favour of having multiple transliteration schemes for this reason. Theknightwho (talk) 11:44, 8 May 2024 (UTC)Reply

@-sche This is a good proposal.

@Benwing2 I understand that ʔ and ʕ are more visible than the small half-rings, but I question how useful using them would be for the average reader since they are barely used in current transliteration schemes. If it hinders readers' ability to find these entries, we should avoid using them. Additionally, when is ḫ confused with the pharyngeal fricative? Antiquistik (talk) 05:42, 2 May 2024 (UTC)Reply

@Antiquistik I'm not sure what you mean by "barely used in current transliteration schemes". Are you referring to transliteration schemes outside of Wiktionary? If so, why do you think the average reader will be familiar with them, but won't be familiar with IPA? As for using ḫ, my point is that this is easily confused with ḥ (the transliteration for pharnygeal fricative), and having all three of h ḫ ḥ is going to make for endless confusion. Benwing2 (talk) 05:47, 2 May 2024 (UTC)Reply

@Benwing2 While I don't think that the average reader will be more familiar with the IPA signs, I doubt that they will be searching Arabic terms with signs from the current standard transliteration schemes substituted by IPA signs that are rarely used for Arabic transliteration.

And, as pointed out by @Ssvb, the entries need to be searchable. Using the more widely employed transliteration is the better option for this.

As for the transliteration of /x/, I strongly disagree with your position. The transliterations for other Afroasiatic languages like Old South Arabian, Ugaritic and Ancient Egyptian use both ḫ and ḥ without any problem, and I don't see why should the organic /x/ in Arabic be represented through a character used for sounds affected by begadkefat. Antiquistik (talk) 11:19, 3 May 2024 (UTC)Reply

@Antiquistik: Your premise of the signs being but used in IPA transcriptions before having been adopted by Wiktionary is wrong. We realized that there are lots of linguistic books, more or less traditionally Semitist, with them as their editorial choice for transcription. I have doomsurfed the philologies enough in the last 1½ decade to know that this is by far not so uncommon as to be stunting someone’s dictionary use. I also want to raise your attention towards pertinent languages without native writing system that can only be entered in an academic transcription, the Modern South Arabian languages, which have suffered some variations in transcription styles over the decades and native countries of researchers but I think are amenable as written down at أَيْدَع (ʔaydaʕ), whereas with all their diacritics the rings would strain the readers’ tempers. Fay Freak (talk) 11:37, 3 May 2024 (UTC)Reply

@Fay Freak How prevalent are Arabic transliterations using the IPA signs compared to the half-rings? Antiquistik (talk) 13:06, 3 May 2024 (UTC)Reply

@Antiquistik: No one, or at least not me, can do stats on such thing. There’s is also a qualitative difference in the kinds of resources that use them. In purely Semitist sources due to tradition the rings hold their ground. I have clicked around in my Semitics folder for you. I wanted to say that Leonid Kogan uses MODIFIER LETTER GLOTTAL STOP ˀ a lot, which is a bit more conspicious and between the two extremes, but the second work by him I opened ({{R:tig:Kogan:2011}} after {{R:sem-pro:GC}}), goes the whole hog and uses ʔ for Arabic and the other Semitic languages. {{R:sqt:CSOL}} and {{R:sem-pro:SED}} uses ˀ, anything published in the Journal of Semitic Studies such as →DOI the rings, we may see it as a publisher decision, in more relaxed journal pieces he seems to prefer the IPA letters? In the old and long series Perspectives on Arabic linguistics you got the IPA letters all around. There is a lot of socialization behind letter choices, you just need to get used them, but not lose aesthetic sense. University docents may teach something specific but there is a point where one shan’t believe other people. Younglings learn and adults function by imitation but science by organized skepticism, a dilemma.

The complicated part: I can hold you a lecture how it is has to do with spatial-temporal memory, again the first chapter of the handbook of memory, ASD and the law I mentioned. Everything normal in the head, you guys tribally react to relations previously experienced with and from other people, in spite of the meatspace effecting the worst selection bias, contrary to universalism of science. You underestimate the psychological background behind all this. I did hardly positively respond to what teachers required or expected from me in terms of organizing a treatise, by some internal logics which aren’t strictly rationally evident, writing points of a paper in this and that order and not missing out a super-influential fashionable nonsense in the field I mean, which is detrimental to exams, and self-portrayal in job applications, however exquisitely able to judge the merits of the matter in isolation, and I am now very aware how strong feelings about signs come about, without sustaining them myself. We don’t just count voices together to let the loudest party win, this is not how creating good stuff works, only a working hypothesis. Fay Freak (talk) 14:09, 3 May 2024 (UTC)Reply

@Fay Freak I would still suggest that we should use the most prevalent system, but given your explanation, I am willing to accept the present status quo while still maintaining my opposition in the original change proposal. Antiquistik (talk) 08:28, 26 May 2024 (UTC)Reply

Descendant tree design

Latest comment: 30 days ago15 comments8 people in discussion

Here's my idea for a horizontal tree style that could be generated by {{etymon}}. I've switched up the colour scheme, since this is a descendants tree rather than an etymology tree. We can also include question marks or labels just as in the etymology tree. Let me know what you think! @Vininn126, Equinox, Sławobóg, -sche, 0DF Ioaxxere (talk) 21:24, 1 May 2024 (UTC)Reply

How would you represent borrowings and morphological reshaping in this format? Also I think I prefer Design 2, because in Design 1 the single right-branching node might be interpreted as somehow different from the below-branching nodes (and in addition, in Design 1 someone might e.g. interpret the juncture where Proto-Italic branches off as its own node, a daughter of PIE rather than just an artifact of the design). However, even better than either IMO would be one where the parent is centered vertically among all of its children rather than being at the top. Benwing2 (talk) 02:55, 2 May 2024 (UTC)Reply

@Benwing2: Probably with the same label system that {{etymon}} already uses. I like your idea for centering the node, although for trees with a huge number of lines it might lead to the ultimate ancestor being far down the page. Possibly the ultimate ancestor could be given some kind of special status where it always goes at the top left of the page. Ioaxxere (talk) 05:31, 2 May 2024 (UTC)Reply

I think Design 2 is also my preference, at least on desktop. Vininn126 (talk) 13:25, 3 May 2024 (UTC)Reply

Design 1

Proto-Indo-European *ph₂tḗr	Proto-Germanic *fadēr	Proto-West Germanic *fader	Old English fæder	Middle English fader	English father
					Scots faither
				English faeder
	Proto-Italic *patēr	Latin pater	Old French pere	Middle French pere	French père	English père
			English pater	Tok Pisin pater
	Proto-Celtic *ɸatīr	Old Irish athair	Manx ayr	English ayr

Design 2

Proto-Indo-European *ph₂tḗr	Proto-Germanic *fadēr	Proto-West Germanic *fader	Old English fæder	Middle English fader	English father
					Scots faither
				English faeder
	Proto-Italic *patēr	Latin pater	Old French pere	Middle French pere	French père	English père
			English pater	Tok Pisin pater
	Proto-Celtic *ɸatīr	Old Irish athair	Manx ayr	English ayr

Design 3

Proto-Indo-European *ph₂tḗr	Proto-Germanic *fadēr	Proto-West Germanic *fader	Old English fæder	Middle English fader	English father
					Scots faither
				English faeder
	Proto-Italic *patēr	Latin pater	Old French pere	Middle French pere	French père	English père
			English pater	Tok Pisin pater
	Proto-Celtic *ɸatīr	Old Irish athair	Manx ayr	English ayr

Design 4

Proto-Indo-European *ph₂tḗr	Proto-Germanic *fadēr	Proto-West Germanic *fader	Old English fæder	Middle English fader	English father
					Scots faither
				English faeder
	Proto-Italic *patēr	Latin pater	Old French pere	Middle French pere	French père	English père
			English pater	Tok Pisin pater
	Proto-Celtic *ɸatīr	Old Irish athair	Manx ayr	English ayr

Design 5

Proto-Indo-European *ph₂tḗr

Proto-Germanic *fadēr

Proto-West Germanic *fader

Old English fæder

Middle English fader

English father

Scots faither

English faeder

Proto-Italic *patēr

Latin pater

Old French pere

Middle French pere

French père

English père

English pater

Tok Pisin pater

Proto-Celtic *ɸatīr

Old Irish athair

Manx ayr

English ayr

At the risk of stating the obvious, only a small fraction of the descendants are being shown here. Is this focussed on English? Nicodene (talk) 21:49, 1 May 2024 (UTC)Reply

@Nicodene: This is just a mockup. I created all the HTML by hand, but the full (automatically-generated) tree will have all the descendants. Ioaxxere (talk) 22:11, 1 May 2024 (UTC)Reply

How would they all fit? Some of the ‘nodes’ have dozens of direct descendants. Nicodene (talk) 22:16, 1 May 2024 (UTC)Reply

@Nicodene: The tree would be extremely tall in that case. Either way, it would still be significantly more readable than something like what we currently have at Reconstruction:Proto-Sino-Tibetan/s-la#Descendants. Ioaxxere (talk) 22:19, 1 May 2024 (UTC)Reply

I have to agree with Nicodene. With etymology trees and the vertical format, it makes more sense to me because the tree will be much more compressed, but for descendants, I can't really see it working as well. It'll get really unwieldy and fast. The list you've pointed too isn't good either, but I don't like replacing one problem with another one. Looking at the link you've sent, how would this interact with etymology-only languages or the situation with Chinese? AG202 (talk) 03:06, 2 May 2024 (UTC)Reply

Etymology-only languages shouldn't be too difficult to handle in general. For Chinese, I feel like including dozens of dialectal pronunciations in Reconstruction:Proto-Sino-Tibetan/s-la is excessive and we should reduce that to only those forms which were borrowed into other languages. It's also possible that descendants trees will end up having less automation than etymology trees in general. Ioaxxere (talk) 05:31, 2 May 2024 (UTC)Reply

One thing that needs to be addressed is alternative forms. In Middle English, there are loads of them for everything. They can't always be ignored, because there are enough cases like catch and chase from Old French: chacier, chacer (chiefly Anglo-Norman); cachier (northern), flour and flower from Middle English: flour, fflour, fflowr, fleur, flor, floure, flower, flowr, flowre, flowyr, flur or even morrow and morn from Middle English: morwe, morewe, morowe, morow, morrou, morue, morw, morȝe, morewen, morowen, morȝen, morwen, morwyn, morwhen, morwoun, morun, moron, moryn, morn; morgen, marhen, mareȝen, morghen, moruwe, where different alternative forms have different descendants. Chuck Entz (talk) 18:31, 3 May 2024 (UTC)Reply

Love it. After a quick glance at the HTML, is the only difference alignment? I think that since this could appear early on in a number of entries that have right-floating tables of contents, I think left-alignment makes the most sense to avoid some of the inevitable bunching. —Justin (koavf)❤T☮C☺M☯ 22:14, 1 May 2024 (UTC)Reply

@Koavf: No, the difference is whether there are connectors on the bottom of the boxes. I have no idea why the alignment is different, actually... Ioaxxere (talk) 22:16, 1 May 2024 (UTC)Reply

Ah, I see that now. —Justin (koavf)❤T☮C☺M☯ 22:17, 1 May 2024 (UTC)Reply

┌────────────────────────────────────────────────────────────────────────────────────────────────────┘ @Ioaxxere: Please excuse the lengthy delay in my response. I prefer design 4, with design 3 a close second for me. Thanks for asking. 0DF (talk) 17:11, 21 May 2024 (UTC)Reply

get rid of noun and adjective plural form categories once and for all

Latest comment: 1 month ago9 comments8 people in discussion

There appears to be consensus established here, here and here, as well as in this diff, to not categorize noun and adjective non-lemma forms in separate 'noun plural forms' and 'adjective plural forms' categories. Yet when I made such a change for newly added Chadian Arabic terms, my favorite editor User:Fenakhay went on a revert spree. By longstanding consensus, we do not in general categorize non-lemma forms as e.g. Category:Russian noun prepositional case forms etc., so I don't see why an exception needs to be made for noun plural forms. However, I'd like to get clear consensus here to remove all such categories and delete the entries from Module:category tree/poscatboiler/data/non-lemma forms that allow such categories to be recognized. We have already done this for some languages; for example, there is intentionally no Category:English noun plural forms, and that page is protected against re-creation by bots or non-admins.

The alternative is to outline a clear rationale for why we need such categories and a rule for which situations they are allowed and which situations they aren't allowed. Either way, the current haphazard situation, where some languages have such categories and some don't, and the categories are incomplete, is unmaintainable.

Benwing2 (talk) 23:45, 1 May 2024 (UTC)Reply

And a stronger consensus at Wiktionary:Requests for deletion/Others#Category:Adjective plural forms by language. It seems that Fenakhay is the only editor who supports the retention of these categories. Consensus is against them. This, that and the other (talk) 02:55, 2 May 2024 (UTC)Reply

I support getting rid - trivial category intersections like this are a waste of time. Theknightwho (talk) 03:24, 2 May 2024 (UTC)Reply

I don't see any rationale for this kind of category either and so am in favour of deleting them. Nicodene (talk) 14:13, 2 May 2024 (UTC)Reply

I agree as well. Ioaxxere (talk) 17:57, 2 May 2024 (UTC)Reply

Support deleting these. Ultimateria (talk) 17:21, 6 May 2024 (UTC)Reply

If we have this kind of thing, it should be with a clear rationale for when/where and why (as Benwing says) and it should be added automatically, probably by whatever headword- or definition-line templates we're using to declare something as a noun plural form, paucal form, etc in the first place — I say this because as far as I saw in the prior RFDs, the categories were populated haphazardly and manually with handfuls of entries, which is not useful. The usefulness of categorizing non-lemma forms by their specific non-lemma-ness seems small (though not nonexistent) to me; I suppose if I wanted to know what kinds of endings Foobarian noun plural forms had, a category would be useful, but the array of endings which Foobarian noun plural forms have could alternatively be mentioned on the About Foobar page, or on the Foobarian equivalent of Appendix:English grammar. Can anyone articulate something these categories would be useful for? (Absent that, I have no objection to deleting them, and indeed voted to do so in some of the prior RFDs.) - -sche (discuss) 19:21, 2 May 2024 (UTC)Reply

Personally, I find these categories very useful from a navigational standpoint, so I'd like to see them kept. That said, they should be added automatically as part of templates like {{infl of}} and {{plural of}}, not added manually by users. Binarystep (talk) 11:26, 5 May 2024 (UTC)Reply

@Binarystep Do you realize this is simply an intersection category? In general we don't usually include intersection categories because you can search for any combination using the Search feature. In this case, e.g. to do the equivalent of CAT:Chadian Arabic noun plural forms, you can search for the combination of category CAT:Chadian Arabic noun forms and template Template:plural of. Adding them automatically using templates like {{infl of}} and {{plural of}} has already been tried, but it turns out to be difficult from a programmatic standpoint in some cases and a maintenance headache, which is the reason I want them removed. Benwing2 (talk) 20:02, 5 May 2024 (UTC)Reply

template similar to Template:alt or Template:desc for Derived terms, Related terms, etc.?

Latest comment: 3 hours ago32 comments7 people in discussion

Hi. User:Fay Freak and I have been having a discussion about using {{alt}} or {{desc}}, or a creating a similar template, for Derived terms and the like. This came up because Fay Freak has been using {{desc|nolb=1}} in Derived terms sections. (Note: |nolb=1 disables the language name at the beginning. FF proposes renaming |nolb= to |nolang= to avoid confusion with |lb= for labels and because what's being suppressed is a language name, not a label.) Both {{alt}} and {{desc}} let you specify a series of terms along with per-term properties plus overall labels for the whole set of terms, although the syntax of the two templates is different and {{desc}} has some extra features specific to descendants. Note that we also have {{syn}}, {{ant}}, etc. for inline synonyms/antonyms/etc., which likewise have support for specifying a series of terms with both per-term properties and overall labels. The current syntax for Derived terms, Related terms and such involves manually listing each term with {{l}} and using {{q}} to add qualifiers as needed, but compared with {{alt}} and {{desc}} this is both more cumbersome and less standardized, meaning that different people format things differently. I think we ought to have some way for Derived terms sections and the like of specifying a list of terms plus labels, similar to {{alt}} and {{desc}}. The question is, should we just reuse e.g. {{alt}} for this purpose, or create another template? (If the latter, I'd maybe call it {{terms}}.) Potentially we could rename {{alt}} to {{terms}} or something similarly generic and keep {{alt}} as an alias, since there isn't really anything about {{alt}} that is specific to Alternative forms.

I'm omitting mention of {{col3}} and the like; while these are useful especially for long lists of similar terms, they don't provide the ability to specify a set of labels at the end of the list of terms, as {{alt}} and {{desc}} do.

Benwing2 (talk) 05:22, 2 May 2024 (UTC)Reply

That'd be quite nice. All I have to add is that it'd help to have the option to split derived terms into columns or put them in collapsible boxes, as people have been doing with a variety of other templates (cf. cado). Nicodene (talk) 14:01, 2 May 2024 (UTC)Reply

I think we'd be able to scrape this to be honest. All it'd need is an etymology section for most terms... Vininn126 (talk) 16:20, 5 May 2024 (UTC)Reply

@Vininn126 I don't quite understand what you mean, can you clarify? Benwing2 (talk) 20:03, 5 May 2024 (UTC)Reply

Sorry, misinterpreted. Not sure I have a strong opinion. Vininn126 (talk) 07:17, 6 May 2024 (UTC)Reply

I'm having trouble understanding the need for such a template beyond stringing multiple {{l}}s together. Can you give an example? I'm also confused by the association being made between Derived terms and Alternative forms. They're pretty distinct in my mind. -- Sokkjō 03:41, 11 May 2024 (UTC)Reply

@Sokkjo User:Fay Freak gave the example in Sittenstrolch of using {{desc|de|Sittich|lb=prison slang|nolb=1}} under Derived terms in order to get the label functionality; it displays as

Sittich — prison slang

You can get a somewhat similar effect using {{alt|de|Sittich||prison slang}}:

Sittich (prison slang)

Here, only one term is listed but you can easily imagine listing multiple terms and multiple labels, which are supported in both syntaxes. Note that you couldn't so easily just use a qualifier because the labels autolink like {{lb}} labels, but don't categorize. I suppose you could write

* {{l|de|foo}}, {{l|de|bar}}, {{l|de|baz}} {{lb|de|prison slang|Austria|nocat=1}}

which displays as

foo, bar, baz (prison slang, Austria)

much like writing

* {{alt|de|foo|bar|baz||prison slang|Austria}}

but as you can see, the former is much more awkward.

The reason I brought this up is that there's not a lot of functionality (and arguably no functionality) that's specific to {{alt}}; that's why I mentioned generalizing (or simply renaming) {{alt}} so it can be used outside of Alternative forms sections. Benwing2 (talk) 07:10, 11 May 2024 (UTC)Reply

In the example Sittenstrolch, there is no reason a usage label would belong there -- that should be left to the entry page. If I saw a user add that, I would delete it. -- Sokkjō 07:27, 11 May 2024 (UTC)Reply

Obviously not everyone agrees with you, because qualifiers and labels are extremely common in derived terms, synonyms and the like. I would tread lightly and think twice before deleting such a label. Benwing2 (talk) 08:39, 11 May 2024 (UTC)Reply

What other users are putting usage labels in the derived terms section?! -- Sokkjō 05:07, 12 May 2024 (UTC)Reply

It's useful if it's only a derived term in one particular (uncommon) sense, and you want to make that clear, but that's quite a rare scenario. Theknightwho (talk) 22:52, 19 May 2024 (UTC)Reply

Being able to string together multiple {{l}}’s is all I ever wanted for Christmas. Nicodene (talk) 05:56, 12 May 2024 (UTC)Reply

As the bulk Derived terms adder, I use {{col-auto}}. I'm going through Wiktionary:Todo/compounds not linked to from components/2024-01/page 1 slowly and (mostly) surely. If anyone wants to help, or program a bot to do it, go ahead. P. Sovjunk (talk) 22:16, 19 May 2024 (UTC)Reply
Column templates, like {{col-auto}}, are often overkill, as seen here, and I do not support bot action. -- Sokkjō 05:01, 20 May 2024 (UTC)Reply
There's nothing detrimental about it, so I see no reason why it matters. Theknightwho (talk) 05:12, 20 May 2024 (UTC)Reply

@Benwing2 Any chance it could be done using not {{alt}} – which was objected to above – but rather a clone of it named something like {{dt}} (for derived terms)? Nicodene (talk) 02:45, 6 June 2024 (UTC)Reply

@Nicodene This could be done, although if we call it {{dt}} rather than e.g. {{terms}}, someone will soon find a use for it in ==Related terms== or other sections. Benwing2 (talk) 08:23, 6 June 2024 (UTC)Reply

@Benwing2 {{terms}} sounds good to me. Nicodene (talk) 01:02, 7 June 2024 (UTC)Reply

@Benwing2 Any chance we can go ahead with this? Sorry for the repeated ping. Nicodene (talk) 23:26, 20 June 2024 (UTC)Reply

Myself, and I assume others, do not support this. -- Sokkjō 23:39, 20 June 2024 (UTC)Reply

@Sokkjo I believe banning labels and qualifiers from derived or related terms sections would require community consensus since they are currently possible/allowed. For the record I'd support such a ban and tend to delete examples on sight. Nicodene (talk) 00:22, 21 June 2024 (UTC)Reply

@Nicodene Victar doesn't seem to support any changes to anything; unless there are cogent objections from others I will go ahead and create the template in a few days. In any case, in general I don't believe merely creating a template requires a vote or even strong consensus; consensus should rather be about usage. Benwing2 (talk) 01:06, 21 June 2024 (UTC)Reply

@Benwing2: Using a template other than {{l}} for Derived terms and Related terms is breaking with style guides and would absolutely require a consensus from the community. I'm honestly surprised you think it's OK just to take it upon yourself to make that change unilaterally. -- Sokkjō 01:14, 21 June 2024 (UTC)Reply

First of all that's not what I said; you're putting words in my mouth. Second, what you say about Derived terms and Related terms sections isn't even true; people routinely use things like {{der3}}, {{col-auto}} etc. in those sections. Benwing2 (talk) 01:19, 21 June 2024 (UTC)Reply

Phish, c'mon, those are column templates. That's a cheap argument fallacy. -- Sokkjō 02:09, 21 June 2024 (UTC)Reply

Well unless they speak up, it doesn't really matter. We can also assume others who haven't spoken up here support it based on this. Vininn126 (talk) 08:15, 21 June 2024 (UTC)Reply

┌────────────────────────────────────────────────────────────────────────────────────────────────────┘ I'm not really understanding the rationale for yet another template. Why can't {{l}} be used? — Sgconlaw (talk) 02:28, 21 June 2024 (UTC)Reply

@Sgconlaw Because

* {{l|en|A}}

* {{l|en|B}}

* {{l|en|C}}

* {{l|en|D}}

* {{l|en|E}}

* {{l|en|F}}

* {{l|en|G}}

is a PITA to write out compared to

* {{terms|en|A|B|C|D|E|F|G}}

Edit: and also the latter displays a nice compact

A, B, C, D, E, F, G

rather than something like

Nicodene (talk) 04:37, 21 June 2024 (UTC)Reply

That’s why we have the column templates like {{col3}}. Not seeing a need for this, honestly. — Sgconlaw (talk) 04:56, 21 June 2024 (UTC)Reply

And I'm not seeing a need for 58 templates for basically listing words. Nicodene (talk) 05:31, 21 June 2024 (UTC)Reply

Yes the column templates are a huge mess and people keep creating more :( ... there's of course the existing WT:RFDO#remove lesser-used column templates topic from Dec 2022 that got partially done but is currently stalled. BTW {{terms}} is conceptually different in that it just makes it easier to list multiple terms on a single line, something that's already done a lot but awkwardly. Benwing2 (talk) 08:24, 21 June 2024 (UTC)Reply

Just for the record, I'm also in favor of reducing the number of these and instead opting for a more versatile option. 08:25, 21 June 2024 (UTC) Vininn126 (talk) 08:25, 21 June 2024 (UTC)Reply

Plurals on head lines and declension tables

Latest comment: 1 month ago5 comments4 people in discussion

Is there any point in having both plurals on the head line and a declension table showing the plural for a noun lemma? I would be inclined to omit the plural(s) when there is a declension table. --RichardW57m (talk) 16:36, 2 May 2024 (UTC)Reply

@RichardW57m it would perhaps help to specify which language you're thinking of and give an example. This, that and the other (talk) 03:07, 6 May 2024 (UTC)Reply

The specific language where this has come up is Lithuanian, avìdė, which currently only displays the plural through the declension table. A similar specific is with the Lithuanian adjective headword template, where until recently many ordinals' neuter form was wrong and contradicted the following declension table. --RichardW57m (talk) 11:27, 7 May 2024 (UTC)Reply

IMO it depends on how regular the inflections in question are. If they serve as something like principal parts, I think it's useful to put them on the headword line as well as in the declension table, because then someone with some familiarity with the language will know how to inflect the term without needing to look through the whole declension table to figure out what the most important parts are. This is similar to how we list the past historic and past participle for Italian verbs. OTOH if they are largely predictable, putting them in the headword line is less useful. Benwing2 (talk) 23:27, 8 May 2024 (UTC)Reply

As Benwing suggested, I would say the answer is language-specific. For example, in German, plurals seem to be the most unpredictable declined form of a noun, so it makes some sense to give the plural in the head line.--Urszag (talk) 22:38, 9 May 2024 (UTC)Reply

A way to more easily connect with readers: a follow-up

Latest comment: 1 month ago7 comments5 people in discussion

Following Wiktionary:Beer_parlour/2024/March#A_way_to_more_easily_connect_with_readers, I wrote to WMF in an attempt to figure out how to best resolve this issue. @Johan Jönsson replied and has given us an option, I think. He suggests we create a new mailing list for admins and for us to put enwiktionary in the name somehow. What do people think of this solution? Vininn126 (talk) 16:03, 3 May 2024 (UTC)Reply

Support Ioaxxere (talk) 16:56, 3 May 2024 (UTC)Reply

Support This, that and the other (talk) 08:30, 5 May 2024 (UTC)Reply

Support Binarystep (talk) 12:45, 5 May 2024 (UTC)Reply

Support Thadh (talk) 11:09, 6 May 2024 (UTC)Reply

Okay, I'm going to move forward with this. See phabricator:/T364731. Vininn126 (talk) 10:38, 13 May 2024 (UTC)Reply

Update, we have a private mailing list for admins (please open phabricator thread for details). Any active admins may sign up. Ladsgroup mentioned we may also open a public general use mailing list if we want. I'll leave that discussion for another time. Vininn126 (talk) 07:20, 14 May 2024 (UTC)Reply

Volga Türki language

Latest comment: 1 month ago14 comments7 people in discussion

Greetings, I'd like to propose giving Volga Türki an L2.

It is a significant member of the Middle Turkic literary languages, and is as important as Ottoman Turkish, Chagatai and Karakhanid, all of which already have their own Wiktionary categories: Category:Ottoman Turkish language, Category:Chagatai language, Category:Karakhanid language. Volga Türki is considered a descendant of Karakhanid, together with Chagatai, however they all are roughly contemporary.

It was in wide use in the Volga-Ural region from 15th century (if including Qissa-i Yosof poem by Qul Ghali, then from 12th century) until adoption of Cyrillic and Latin scripts for Tatar and Bashkir languages under Soviet rule. Even though before Soviet rule, at late 18th-early 19th century the written languages for Tatar and Bashkir started to slightly diverge from Volga Türki, it remained a common standard for international affairs, especially between other Turkic groups.

Its addition would not only help with etymological sections, but also help connect the cognates with other Turkic languages, similarly to other Middle Turkic literary languages' sections.

As for Unicode characters, numerals and readings, I already have prepared all of this, and will work on adding them as soon as the category is created. The sources of lemmas are going to be taken from books, dictionaries and other written resources from that time period. I will try to list a source for each lemma whenever possible.

The only issue, however, is that the language does not have its own ISO 639-2 code yet. I propose one of the following codes to be used for the language: iut (for İdil-Ural Turkic); tui (Turkic of İdil-Ural). I deprecate codes like vut (Volga-Ural Turkic) and ott (Old Tatar) firstly due to the name Volga not being used by the locals, especially during the era of Volga Türki, and secondly due to the name Volga/İdil/İdel Türki being neutral, and Old Tatar primarily referring to the diverged variant of Volga Türki that was used specifically for Tatar. Bababashqort (talk) 16:06, 3 May 2024 (UTC)Reply

What is the Volga Turki corpus and how accessible is it? Qissa-i Yosof poem by Qul Ghali should definitely not be included, as it is covered by Khorezmian Turkic [1]. Allahverdi Verdizade (talk) 20:40, 3 May 2024 (UTC)Reply

Support BurakD53 (talk) 17:57, 4 May 2024 (UTC)Reply

Its corpus mostly isn't digitalised, but practically all Bashkir and Tatar literature from at least 16th century until late 19th century is written in Volga Türki. The books, manuscripts and magazines are still preserved in a lot of libraries in Tatarstan and Bashkortostan. As for Qissa-i Yusuf, that is somewhat debatable, but given the timeframe it probably suits Khorezmian, as one of the ancestors of Volga Türki. Bababashqort (talk) 07:24, 5 May 2024 (UTC)Reply

@Bababashqort: for the last issue, we generally make up our own codes using the code for the group it belongs to (probably "trk") followed by a hyphen ("-") followed by some sequence of letters that's not already in use by us. That way there's no chance of our code conflicting with an ISO code. Since this is strictly for internal use and our modules and css/jss code convert everything for browsers, we don't have to use existing ISO codes. Chuck Entz (talk) 18:24, 4 May 2024 (UTC)Reply

Yes, I've been told that wiki uses a placeholder, but didn't exactly know how it worked. Thank you for explaining!

In this case I'd suggest trk-iut Bababashqort (talk) 07:25, 5 May 2024 (UTC)Reply

@Bababashqort We try to use the first three letters of the lect in the second part of names like this. What do you think of trk-idi or trk-vol? Benwing2 (talk) 08:04, 5 May 2024 (UTC)Reply

trk-idi includes only the Volga part, as well as trk-vol. The name itself, however, is taken from the most widespread naming of the language, which unfortunately is shortened to Volga Türki, omitting Ural. And speaking of İdil, it is actually spelled as İdel in Tatar itself, İdil is just more Common Turkic. Therefore the only solution seems to be trk-iut, it's not that hard to deduce I think. Bababashqort (talk) 11:54, 5 May 2024 (UTC)Reply

@Allahverdi Verdizade suggested to make a Turki category instead, which I'd very much prefer. It would remove the need to add more distinct subvariants of it, such as North Caucasian Turki, Nogay Turki and others. This would also allow to use derivation template for all languages that used it: Crimean Tatar, Kumyk, Nogay, Bashkir and others. Bababashqort (talk) 13:21, 5 May 2024 (UTC)Reply

@Bababashqort Sure, that works. What language is this a category of? Benwing2 (talk) 19:56, 5 May 2024 (UTC)Reply

I think he meant he wants Türki as a language code, not specifically Volga Türki Bortkastningskonto (talk) 07:01, 6 May 2024 (UTC)Reply

@Bortkastningskonto @Bababashqort OK, I need more information then. Is "Türki" supposed to be an L2 language? This is an awfully generic name for a language, and I would likely oppose this name for this reason. And I will repeat my assertion that the code for Volga Türki should be 'trk-vol' in keeping with the name. The code should reflect the first three letters of the lect name barring extraordinary circumstances (usually due to ambiguity when there are multiple lects sharing the first three letters, which is not an issue here). @Allahverdi Verdizade can you weigh in here? I am not qualified enough to tell whether this should be an L2 language, an etym-only language or just a label of some other language (the last two being rather similar). Benwing2 (talk) 07:11, 6 May 2024 (UTC)Reply

I didn't actually suggest making Türki a L2, rather I wondered whether it wouldn't be better to do so depending on how different Volga Türki is from, say, North Caucasian Türki. I can't answer that question myself, and I think, in general, very few people can give a well-informed opinion on that. Reading this book on North Caucasian Turki (in Russian) might help a little. Considering that Bababashqort is likely only going to work with sources written in the Volga variety, maybe it is the safest to create a Volga Turki L2, in which case you would circumvent the problem with "awfully generic name". Documents in North Caucasian Turki are terribly inaccessible (not digitized or normalized), so I don't think anyone is going to work with them.

In any case, there is also the problem of classifying "literary languages" and fitting them into genealogical tree schemes. It is often said that this or that language "is moslty X, but also incorporates elements of Y", at the same time as it "continues the literary tradition of Z". I can't exactly tell you what it means that "Volga Turki continues the tradition of Khorezmian Turki", which in turn "continues the tradition of Karakhanid", as it oftentimes is put in Russian books on the matter. Too much arbitrariness for my taste. So my opinion is that these "literary languages" maybe should not have ancestors and descendants. Allahverdi Verdizade (talk) 17:35, 7 May 2024 (UTC)Reply

Support Yorınçga573 (talk) 20:23, 9 May 2024 (UTC)Reply

Request for a new language

Latest comment: 1 month ago5 comments2 people in discussion

Yet again, I request for Old Lombard to be listed separately, as for now Old Lombard is listed as a dialect and not a language. That Northern Irish Historian (talk) 17:30, 4 May 2024 (UTC)Reply

I notice that Old Italian is currently an etym-only variant of Italian. Why can't Old Lombard be the same? How different are Lombard and Old Lombard? Benwing2 (talk) 18:59, 4 May 2024 (UTC)Reply

Old Lombard:

Faremo preg a Deo a Questi cominzament
et a la soa mather ke preg l’omnipotent.
Ke n’des a dir et a far tute l so placiment
Ço ked is la scritura si se conven a dir
De la pasin de Christ a ki ne plas hodir
La qual per nu katif je plase sostegnir
Bene questi paroli de panzer e da stremir
Qui longa fis e dis del pasio del fy de la rayna.
La qual si m’dia gratia et a mi sia vesina
Ke parlo dritament de la pasion divina
St’apreso si me scampo da la infernal pena.

Modern Lombard:

Ambiaróm con ‘na preghiéra a Dio
e a sò madèr che la préghes l’Onipotent
Che nómes a dì e a fa töt de so gradimènt
E per bontà sò el vègnes a compimènt
Chèl che la dis la Scritüra isé come l’è giöst a dìl
De la pasiù de Cristo a chi che öl sintìl
Pasiù che per notèr pecadùr la sèrf a soportà
Con rasegnasiù chèste parole de pianzer e dè dulùr
Ché se parla e se dìs del fiöl de la regina
Che la me dàghes gràsia e la me stàghes vizìna
‘Ntat che parle drit de la pasiù divina
Semài che scamparó de la pena infernal.

That Northern Irish Historian (talk) 22:35, 8 May 2024 (UTC)Reply

@That Northern Irish Historian That's not what I was looking for; you have pasted in two different translations which naturally will be different. If you try to match up the corresponding words, they are IMO marginally different enough to maybe be considered different L2's (although they differ less e.g. than the current Occitan dialects). I notice however that there are 0 lemmas currently listed as Old Lombard; are you actually planning on adding some? Benwing2 (talk) 23:16, 8 May 2024 (UTC)Reply

Yes, but see zinqui, Jesu, and other pages. It is not working. That Northern Irish Historian (talk) 23:24, 8 May 2024 (UTC)Reply

Words attested as Oghuz in DLT (Wiktionary:Beer parlour/2023/January#Oghuz language again (continuation of Wiktionary:Beer parlour/2022/December#Oghuz language))

Latest comment: 1 month ago4 comments3 people in discussion

That's how we enter these words. If you have any objections, please write here. BurakD53 (talk) 14:29, 5 May 2024 (UTC) wordsReply

lol. Yes, I have objections. Allahverdi Verdizade (talk) 16:11, 5 May 2024 (UTC)Reply

As I said before, I want the {{trk-ogz-pro}} code to be removed and replaced with {{trk-ogz}}. Since we have already reconstructed them all under the {{trk-pro}} pages, Proto-Oghuz is quite unnecessary. If anyone still wants to reconstruct Proto-Oghuz, you can reconstruct it using the * sign on the Oghuz page. (Which is quite unnecessary) Likewise, {{trk-klj}} can also refer to the Arghu language, but the data in this language consists of a few words. {{trk-ogz}} is the direct ancestor of all Oghuz languages, in short, it is the same as Proto-Oghuz {{trk-ogz-pro}}. However, we cannot enter these Oghuz or Proto Oghuz words recorded in the Diwan into the site as entries. It requires reconstruction in order to be entered to us. However, these Proto Oghuz words, also Proto Khalaj words, are not a reconstruction. I think that both of them should be entered as input on the site, the biggest reason is that these languages cannot be assumed to be dialects of other languages. But since the Arghu language consists of only a few words, it can be entered under the name Proto. Oghuz language is mentioned many times in the Diwan and even information about its grammar is given. A few Proto Khalaj, i.e. Arghu, words may be added as exceptions. But since this is the case for Oghuz, there is no need to create a language code called Proto-Oghuz. This is my opinion. I firmly reject the addition of these Oghuz words to Old Anatolian Turkish. Not every word mentioned in the Diwan has been witnessed in Old Anatolian Turkish, and the place where Kashgarî shows the Oghuzs on the map in the period he mentions is not Iran, but Central Asia. Also the words here are more archaic than the form in which they are found in Old Anatolian Turkish. BurakD53 (talk) 18:22, 5 May 2024 (UTC)Reply

Support Yorınçga573 (talk) 20:10, 9 May 2024 (UTC)Reply

Lemma categories

Latest comment: 1 month ago7 comments6 people in discussion

Discussion moved from WT:Beer parlour/2024/April#Lemma categories.

I've been cleaning up Special:UncategorizedPages, and I've run across a number where @Nicodene has disabled categorization for alternative forms. My understanding is that all mainspace entries should be in either Category:[Language] lemmas or Category:[Language] non-lemma forms. While an alternative form is supposed to be a stub that links to the main form, as far as the categories are concerned, it's a lemma. It's certainly not a non-lemma form, because it has its own non-lemma forms. Leaving it out of both categories raises the question of why we have the entry at all, if we feel we need to hide it: if we don't link to it in the main entry, there's no way to navigate to it.

This has come up before over the years, and we've more than once decided to do it this way. As far as I can tell, Nicodene is the only editor who's doing otherwise. Has anything changed? Chuck Entz (talk) 03:13, 6 May 2024 (UTC)Reply

Why should Category:Franco-Provençal lemmas be clogged with twelve different renditions of ôtro, seventeen of ôtrament, and ten of solament? Why should Category:Old French lemmas (not to mention Category:Old French adverbs) be clogged with two hundred seventy one renditions of iluec? The whole point of a lemma is to provide a citation form to cover the variants. That is how altforms and altspellings are handled by the vast majority of dictionaries. Nicodene (talk) 03:23, 6 May 2024 (UTC)Reply

I'm of two minds here. Yes, we generally include alternative spellings and forms as lemmas; otherwise, for example, we'd end up including only one of oxidi{s,z}e as a lemma, and the other would go nowhere. At the same time, however, including 171 alt variants of iluec seems like serious overkill. Maybe we need a separate policy for non-standardized languages vs. standardized ones. Benwing2 (talk) 07:16, 6 May 2024 (UTC)Reply

At a minimum, every entry should be in some category. As far as how that's been accomplished up to now, my understanding matches Chuck's, that every entry is supposed to be categorized as either a lemma or a nonlemma (or both) and that alternatively-spelled nouns are still nouns (and lemmas, from the category / grammatical perspective). We could change that, e.g. add a parameter which, instead of turning categorization off, moves the entries from "Category:Foobarian nouns" to at least a POS-agnostic catchall "Category:Foobarian alternative forms and spellings", or something more specific like "Category:Foobarian alternative forms and spellings of nouns", "Category:Foobarian alternative forms and spellings of lemmas", but I do think we should continue to regard a completely uncategorized entry—an entry that cannot be accessed from any part of our category tree—as a problem.
There was support for not putting just any alternative spelling into topical categories in this 2022 discussion, but that didn't leave the entries categoryless.
FWIW, the issue of terms having tons of spellings isn't strictly limited to overall-nonstandardized languages, e.g. English has lots of spellings of kinnikinnick, Muhammad, voivode... but I think Benwing's suggestion of handling this on a per-language basis (and just accepting that the English categories will have a few cases like Muhammad where there are a bunch of spellings) is probably more workable than e.g. trying to decide (in a way that can be maintained over time with any consistency) on a per-spelling basis what counts, in a mostly-standardized (but standards-body-less "ungoverned") language like English, as a "standard" spelling. (E.g., several of the alternative spellings of Muhammad are used mainly in scholarly works, so dismissing them as nonstandard seems hard; and in the other direction, for a largely dialectal word, determining why any one spelling should be considered more standard than another seems hard.) - -sche (discuss) 13:54, 6 May 2024 (UTC)Reply

We could change that, e.g. add a parameter which, instead of turning categorization off, moves the entries from "Category:Foobarian nouns" to at least a POS-agnostic catchall "Category:Foobarian alternative forms and spellings"

I would be quite happy to use that if it were available as an option.

My main concern is keeping the categories clear and usable. When I look up 'Foobarian feminine nouns', for instance, I'd rather not have to wade through 5–10 (+) duplicates for every distinct noun. That is a serious headache with languages like Franco-Provençal or Romansch. Nicodene (talk) 07:54, 7 May 2024 (UTC)Reply

@-sche: I would like this to be implemented for English as well. Having full-fledged entries for minor spelling variants was a bad idea. Ioaxxere (talk) 03:08, 10 May 2024 (UTC)Reply

I disagree. All words should be given equal status, at least when it comes to categorization. I don't think Wiktionary should be treating variant spellings as inferior forms of the main entries. For starters, every spelling is (or was) the "default" spelling to someone. Using the example of Muhammad, for instance, there are plenty of people named Mohamad, Mohamed, Mohammad, Muhamad, Muhammet, etc. and it seems weird to claim that their names are merely lesser variants of the single "canonical" spelling. There's also the fact that some spellings carry unique etymological information, have slightly different pronunciations, or are used primarily by certain groups (regional spellings, for instance, or spellings used primarily by non-native speakers). Frankly, I find it troubling that there have been so many recent attempts lately to get us to reduce our coverage rather than expand it. At this rate, I won't be surprised if someone starts a proposal to convert alternate spellings into hard redirects. Binarystep (talk) 19:20, 11 May 2024 (UTC)Reply

Edit with "username removed"

Latest comment: 1 month ago12 comments5 people in discussion

This edit has the user name removed. How can one see (if not who the user is), which user removed it and why? [2] Equinox ◑ 09:33, 6 May 2024 (UTC)Reply

I removed it because it was an accidental IP/logged-out edit by an editor (the same as did a similar change to unrapable). — SURJECTION ^{/ T / C / L /} 10:17, 6 May 2024 (UTC)Reply

I'm officially saying: don't do that. You can revert, delete, but do not wipe content unless it's real serious stuff like child porn. Thank you. Equinox ◑ 23:01, 7 May 2024 (UTC)Reply

Re how to see which admin performed the revdel: it's technically in the "View logs for this page" link on the edit history page, [3]. If there were a lot of revdels and they did not follow so closely after the time the edits themselves were made, e.g. if I now went to the page and hid a revision from two months ago, and then Surjection hid a revision from one month ago as well as your edit just now, it might be hard for non-admins [who don't have "diff" links] to discern from that log who hid which thing... I guess in that case they'd just have to say "hey, who revdel'd X" and admins could check.) - -sche (discuss) 14:28, 6 May 2024 (UTC)Reply

@Surjection: I want you to understand how it looked to me: I saw that someone had made an edit, they had no name, I couldn't see them, or talk to them, or discuss, it was like a GHOST DID IT. And I couldn't see who removed their name either. If you ever spent time on WP:OFFICE then ...well. Equinox ◑ 22:50, 7 May 2024 (UTC)Reply

I would, personally, be happy to see text like "edit made by a user whose name is hidden by this admin: Surjection". What I think is wrong and bad and goes against our free openness is just that MYSTERY NO-NAME. Equinox ◑ 22:51, 7 May 2024 (UTC)Reply

Side point: I know Chuck Entz (for example) likes to "clean the graffiti wall" so that vandals can't see their names. But I don't like that. The wiki should be a public space and we should only hide the history in real serious situations like "doxxing" (real name-addresses) or... am I wrong? @-sche @Chuck Entz @Surjection (and even worse, are there Wikipedia rules we are supposed to obey as children.) Equinox ◑ 22:55, 7 May 2024 (UTC)Reply

AFAIK it's global WMF policy to suppress this kind of thing (the IP addresses of users who've accidentally edited logged out), and indeed to suppress it way harder than a mere revision-deletion like Surjection did: "oversighters" have (or had?) database access to delete the information so hard that not even admins can see it. (But it also takes time to contact them, so it's fine for admins to revdel it in the meantime, like this.) This is precisely because of doxxing concerns, because many IP addresses identify the person's real address. (Other IP addresses, of course, merely send you to that one farm in Kansas.) If you ever see an edit where you think the content of the edit is wrong, just undo the edit... as you saw in this case, the username being suppressed doesn't prevent you from undoing the edit. - -sche (discuss) 01:39, 8 May 2024 (UTC)Reply

Would there be an issue if contributors were to hide their IP address with their screen names after, say, a week? CitationsFreak (talk) 03:46, 8 May 2024 (UTC)Reply

I should clarify that AFAIK such hiding only happens when someone requests it—usually the person who made the edit, though plausibly someone else who simply noticed what was going on. Last I heard, WMF folks were trying to roll out something that automatically obfuscates all IP addresses by making them show up in edit histories as e.g. incrementing numbers that change periodically or on request (so anytime someone thinks their current [non-]IP is getting too much attention from admins, they can hit "refresh" and start doing vandalism under a new identity, just like logged-in users can by creating multiple accounts), which will probably remove the need to do this in the future, if it gets implemented. - -sche (discuss) 05:12, 8 May 2024 (UTC)Reply

Would there be an issue if contributors could request that their IP address be hidden by their screen name? CitationsFreak (talk) 05:42, 8 May 2024 (UTC)Reply

Tangent: Is there a way to "claim" an edit you made while accidentally logged out? Caoimhin ceallach (talk) 21:51, 12 May 2024 (UTC)Reply

The issue of Old Kashubian (Old Pomeranian?)

Latest comment: 1 month ago31 comments6 people in discussion

I came to a recent realization about the {{R:zlw-opl:SPJSP|Old Polish dictionary}}: it contains texts from Pomerania with Pomeranian features, as it was made during a time when Kashubian was considered a dialect of Polish. However, typologically, this is very, very wrong. Pomeranian is considered North Lechitic, and anything "Polish" and (Masovian, Upper Polish, Lower Polish, and Silesian) are considered East Lechitic, therefor anything Old Kashubian should not be considered Old Polish. I propose a split; I intend to add the location of creation for any Old Polish documents anyway for a future dialectal project (for Old Polish this means categorizing somehow location of attestation by dialect) and separating any texts from Pomerania for "Old Pomeranian" with a code zlw-opm, or perhaps "Old Kashubian" zlw-ocb with Kashubian and Slovincian as the children. These codes seem clunky to me and I am open to others. I have also corroborated this by emailing the editors of the Old Polish dictionary, who have told me that it indeed is "Old Kashubian", which they accept in their framework of Old Polish. Gorazd also holds the same view. @Thadh @Sławobóg @Rakso43243 @Benwing2 @Mahagaja @Silmethule. Vininn126 (talk) 10:50, 6 May 2024 (UTC)Reply

Alternatived are if we accept Kashubian and Slovincian as the descendants of Old Pomeranian, then we could set them both to be descendants of Old Polish. However, the argument for this is one could accept "Old Kashubian" as a constituent of Old Polish - not a dialect, but constituent. This is what the editor of the Old Polish dictionary told me, quote " Nie napisałam, że to dialekt. Napisałam, że to element składowy języka staropolskiego. To duża różnica. Język starokaszubski to element składowy języka staropolskiego." The alternative is also we ignore this, which seems wrong to me as well. Vininn126 (talk) 11:42, 6 May 2024 (UTC)Reply

Another solution: give Old Kashubian an etycode and make it an alt of Old Polish and if a term is attested in Pomerania, we could set the Kashubian and Slovincian reflexes as inherited from that? Otherwise directly from Proto-Slavic. Vininn126 (talk) 14:36, 6 May 2024 (UTC)Reply

@Vininn126 I think this last solution is maybe the best. This is similar to what is done with Old Northern French, which is considered an etym-only variety of Old French even though Old French as normally construed refers to the Old French of the Paris area whereas Old Northern French refers to the Old French of Normandy, and neither is an ancestor or descendant of the other. The two differ significantly in phonology, e.g. Old French chacier /tʃatsiɛr/ -> English "chase" vs. Old Northern French cachier /katʃiɛr/ -> English "catch". Anglo-Norman and modern Norman are both descendants of Old Northern French (although we currently list Norman as a descendant of Middle French, which is wrong) and modern French is a descendant of Old French per se. Benwing2 (talk) 18:38, 6 May 2024 (UTC)Reply

I know @Silmethule also mentioned a similar situation with Ancient and Mycenean Greek and also Old Norse and Swedish/Icelandic. See also my question on WT:About Old Polish. Related to that, I'm unsure how to handle labels for all of this. I think we'd want to list Kashubian/Slovincian in the Old Polish entries if and only if a text from Pomerania has an attestation. And any Kashubian/Slovincian words should still have "inherited from Old Kashubian/Pomeranian". Vininn126 (talk) 18:49, 6 May 2024 (UTC)Reply

@Nicodene As our resident Romance expert, do you agree with changing the ancestor of Norman to be Old Northern French instead of Middle French? This will cause the 5 terms in CAT:Norman terms inherited from Middle French to throw errors, I think. Can you fix up those 5 terms? Also I notice there are 30 terms in CAT:Norman terms inherited from Medieval Latin, which seems impossible and probably need to be cleaned up. Benwing2 (talk) 19:54, 6 May 2024 (UTC)Reply

I've just cleared out the categories in question. Αgreed on removing Middle French as an ancestor of Norman. As for its further ancestor, I would leave it as just Old French, which includes ONF as-is. I think the latter are best treated as one overall language.

I've been meaning to eliminate '[Romance] terms inherited form Medieval Latin' in general, reassigning them to '...inherited from Early Medieval Latin' or '...borrowed from [later] Medieval Latin'. That will take some time. When it's done, perhaps we can make {{inh|romance language|ML.|...}} throw an error message and a brief comment. Nicodene (talk) 00:50, 7 May 2024 (UTC)Reply

@Nicodene Thanks! I think the basic advantage of setting the ancestor of Norman to be Old Northern French is it more clearly shows the ancestry (when you go CAT:Norman language and look at the Ancestors panel) than just setting it to Old French. Since Old Northern French is an etym-only variant of Old French, I don't think it will make any difference in terms of what Norman terms are allowed to inherit from. What do you think? Benwing2 (talk) 01:44, 7 May 2024 (UTC)Reply

Oh, so setting it to ONF won't disallow inheritance from Old French. In that case it sounds fine to me. Nicodene (talk) 01:50, 7 May 2024 (UTC)Reply

Yeah that's right. Benwing2 (talk) 01:56, 7 May 2024 (UTC)Reply

@Nicodene @Benwing2 Here's how it works. If you set a variety (etym-only language) as an ancestor, the descendant can inherit from:

That ancestor and any (sub)varieties of that ancestor (in this case, Old Northern French, and any varieties it might have).
The parent (in this case, Old French) unless the ancestral variety is also explicitly ancestral to its parent (read: the thing it's a variety of), which doesn't apply here. This is for situations like Tajik having Classical Persian as an ancestor: Classical Persian's set as a variety of Persian, but is also set as its ancestor. Since Tajik's ancestor is also Classical Persian, it's only possible for it to inherit from Classical Persian (and any varieties thereof), not Persian in general.

It can't inherit from:

Any other varieties of the parent which aren't in the direct lineage of its ancestor (i.e. it wouldn't be able to inherit from other varieties of Old French, unless they're ancestral to/descended from/a subvariety of Old Northern French). To use an Italic example: if we set the proto-language of Romance to be Vulgar Latin, instead of simply Latin, the Romance languages could also inherit from Classical Latin (its ancestor), Latin (the general parent) and Old Latin (set as the ancestor of Latin), but they wouldn't be able to inherit from varieties like Medieval Latin or New Latin, since they aren't in the direct lineage.

It sounds complicated, but it seems to line up pretty neatly with most people's intuitions in practice. Theknightwho (talk) 15:40, 10 May 2024 (UTC)Reply

So it would be possible to set Old Kashubian as an etym-only variant of Old Polish and then set Kashubian and Slovincian as the children of Old Kashubian but not Old Polish? Vininn126 (talk) 15:43, 10 May 2024 (UTC)Reply

@Vininn126 Per the rules just outlined, we could definitely make Old Kashubian an etym-only variety of Old Polish and set the ancestor of Kashubian and Slovincian to Old Kashubian, but people would still be able to "inherit" Kashubian and Slovincian terms from Old Polish. It'd be like the situation with Old French. If you wanted to avoid that, either we'd need a new flag or rule of some sort, or we'd need to change the name of Old Polish to e.g. "Old Lechitic" and make Old Polish an etym-only variety of Old Lechitic. @Theknightwho Here's a thought though. If we set the explicitly set the ancestor of Old Kashubian to Proto-Slavic, would that make it impossible to inherit Kashubian terms from Old Polish? That would be like a slight generalization of the special-case rule for ancestral-to-parent etym languages. Benwing2 (talk) 21:18, 10 May 2024 (UTC)Reply

I've dreamed of "Old Lechitic", but it doesn't encompass Polabian. Vininn126 (talk) 22:08, 10 May 2024 (UTC)Reply

@Vininn126 Sorry, why does Polabian matter here? It can just be excluded from Old Lechitic just as it would be excluded from Old Polish. Benwing2 (talk) 08:41, 11 May 2024 (UTC)Reply

@Benwing2 I have actually tossed the idea of "Old Lechitic" around before with @Sławobóg and @Silmethule. I suppose since it contains Old Kashubian as well there is more precedent for the name. Vininn126 (talk) 08:45, 11 May 2024 (UTC)Reply

@Benwing2 So far I think the name change and etycodes might be the best solution. I'd like to see if anyone else has any thoughts. If we agree, we can make this change, maybe once I finish adding location information to the quotation templates (or maybe that's not necessary...). Vininn126 (talk) 12:03, 13 May 2024 (UTC)Reply

Also pinging @KamiruPL as the other main Old Polish editor so he can be aware of the goings-on and give his opinion. Vininn126 (talk) 08:34, 11 May 2024 (UTC)Reply

I wouldn't like this. This is almost akin to handling Old East Slavic as an Old Church Slavonic variety. Pomeranian and Polish are two distinct branches, and the fact that an earlier stage was highly influenced in their literary variety by the other doesn't make them one and the same. Thadh (talk) 20:43, 6 May 2024 (UTC)Reply

There's actually a similar issue with texts from Pomerania from {{R:pl:SXVI}} and {{R:pl:SXVII}} but I think we can safely nest these under modern Kashubian with a label, as I have done with Middle Polish. Vininn126 (talk) 19:43, 6 May 2024 (UTC)Reply

Absolutely no to changing name of Old Polish to Old Lechitic or something. Since Kashubian belongs to different group, it should be separate Old Pomeranian L2 language. It would work better as Proto-Pomeranian too. Having etym-only code would be an alternative solution too, but then we are not consistent with our system (that made BG and MK descendants of OCS :)). Sławobóg (talk) 13:12, 14 May 2024 (UTC)Reply

@Sławobóg As to your second point: are you saying we could set Old Pomeranian as an etycode within Old Polish? What about the issue where people would be able to give type e.g. {{inh+|csb|zlw-opl}} with no issues? Vininn126 (talk) 13:30, 14 May 2024 (UTC)Reply

Having Kashubian as descendant of Old Polish is just wrong. Having "Old Pomeranian" as etym-code for Old Polish would be better, but still not as good as having separate lang, but it might make editing easier. Sławobóg (talk) 13:44, 14 May 2024 (UTC)Reply

Would you be able to 1) assist in establishing spelling norms? 2) Dealing with the texts? 3) Understanding the grammar? 4) What about the fact that there are very few texts? 5) What about the fact that all of Old Polish already is a collection of dialects? Vininn126 (talk) 13:54, 14 May 2024 (UTC)Reply

1-3) Probably not. 4) We have languages like that. 5) Pomeranian being part of it is wrong. I'm not gonna fight here, you asked be about opinion, I gave my opinion. And if you plan having Middle Kashubian, having Old Kashubian/Pomeranian as L2 would be a good thing. Sławobóg (talk) 14:48, 14 May 2024 (UTC)Reply

I'm not arguing, I'm just asking questions. I have no problem if you question the points I raised earlier! Vininn126 (talk) 14:50, 14 May 2024 (UTC)Reply

@Vininn126 I think we can fix the issue of {{inh+|csb|zlw-opl}}, if that would help. Benwing2 (talk) 14:54, 14 May 2024 (UTC)Reply

@Benwing2 That could be a good compromise. Vininn126 (talk) 14:55, 14 May 2024 (UTC)Reply

@Benwing2 @Sławobóg also mentioned a solution where we create a Kashubian label old "Old Pomeranian" and not place that information in the Old Polish article. I'm not sure how I feel about this considering how close Old Pomeranian was culturally and even linguistically to Old Polish at the time. Do you have any thoughts? Vininn126 (talk) 11:46, 17 May 2024 (UTC)Reply

@Vininn126 Hmm, I'm not so versed in the ins and outs of Old Polish but given how close Old Pomeranian and Old Polish apparently were, along with the fact that (I assume) there will be relatively few lemmas under Old Pomeranian specifically, I think it might make sense to keep them under the same L2 and just fix the inheritance issue to prevent people inheriting from Kashubian directly to Old Polish instead of Old Pomeranian. I don't think fixing the inheritance issue is such a big deal; we just have to change the logic here Module:languages#L-869 that computes the ancestors of a given language. Benwing2 (talk) 00:02, 18 May 2024 (UTC)Reply

Okay. I'm going to think on this issue a bit more. Vininn126 (talk) 08:18, 18 May 2024 (UTC)Reply

Old Polish regional categorization

Latest comment: 25 days ago12 comments2 people in discussion

As a sort of continuation of Wiktionary:Beer_parlour/2024/May#The issue of Old Kashubian (Old Pomeranian?) and Wiktionary talk:About Old Polish#Regional Old Polish, I'm trying to figure out the best way to handle regional information for Old Polish. I have a document explaining the origin of most texts in Old Polish so it should be easy to figure out which of the 5 lects currently considered Old Polish (those being Masovian, Greater Polish, Lesser Polish, Silesian, and Pomeranian/Kashubian). I think it would be useful for readers to know which region a definition/term has been attested, as Old Polish wasn't a single entity and ultimately is the source of those modern dialects today, so we can see more clearly regional features and the like. My concern about using labels is that they would imply that a term might have been limited to a given lect, which we can't know for sure. What do others think? Vininn126 (talk) 19:17, 6 May 2024 (UTC)Reply

One solution could be to use {{lb}} but print the text {{lb|zlw-opl|attested in|Masovia|Lesser Poland}} etc. @Benwing2, would this be technically bad? Vininn126 (talk) 15:56, 8 May 2024 (UTC)Reply

@Vininn126 No, I don't see why that would be an issue. attested in isn't currently a recognized label but could easily be made one, so that it suppresses the following comma. Benwing2 (talk) 23:21, 8 May 2024 (UTC)Reply

@Benwing2 Alright, that would be fine, and I think that's a good solution. Vininn126 (talk) 07:32, 9 May 2024 (UTC)Reply

@Benwing2 Another solution would be to have the quotation templates categorize by dialect when added to a page. This probably would be a bad idea? Vininn126 (talk) 07:44, 9 May 2024 (UTC)Reply

@Vininn126 Yeah the quotation templates do take a label but I feel uncomfortable categorizing based on that label. You could for example imagine someone illustrating a general-use term with a sentence written in a dialect, and labeling the quotation with the dialect in question; that doesn't mean in this case that the term is in the dialect. Benwing2 (talk) 08:24, 9 May 2024 (UTC)Reply

@Benwing2 Alright so for now I'm going to add the location of creation of the documents and a note saying what label the quotation template should count toward, see {{RQ:zlw-opl:AcCas}}, and I'll add the labels and regions manually from there. Unless it'd be possible to do a bot job after. Vininn126 (talk) 08:26, 9 May 2024 (UTC)Reply

@Vininn126 Might be possible, depends on how regular everything is and you making a list of all the quotation templates and associated lect/labels. Benwing2 (talk) 08:36, 9 May 2024 (UTC)Reply

@Benwing2 I think it might be possible to try. Would it be possible to generate a list of text given in the {{{location}}} parameter? From there I could say which label should be given whenever that region appears under a definition and we could have a bot add those labels. (I might also want to update the output display of a few of the locations). Vininn126 (talk) 11:23, 26 May 2024 (UTC)Reply

I also think this will help me make a decision with Old Pomeranian. Vininn126 (talk) 11:24, 26 May 2024 (UTC)Reply

Finally, and sorry to dump a bunch of bot requests - could we remove any text within parentheses in quotations? They're not part of the original text, but used for things like clarification on the editor's end. Vininn126 (talk) 11:57, 26 May 2024 (UTC)Reply

@Benwing2 Can we add this to the labels module? I'm slowly working through these sources and I think this is the best solution. Vininn126 (talk) 12:01, 13 May 2024 (UTC)Reply

Continental Celtic

Latest comment: 24 days ago5 comments3 people in discussion

We have Continental Celtic as a family, but my understanding is that the consensus among Celticists is that is CC isn't a clade but just a term of convenience for Celtic languages other than the Insular Celtic ones. Isn't our custom at Wiktionary to have only actual genetic families, not convenient groupings? —Mahāgaja · talk 11:28, 7 May 2024 (UTC)Reply

@Mahagaja Yeah we should get rid of this. BTW the Wikipedia article on Continental Celtic was in a terrible state due to a bunch of crap added a month ago, which I reverted. Benwing2 (talk) 22:06, 7 May 2024 (UTC)Reply

Yeah, agreed. Theknightwho (talk) 04:09, 10 May 2024 (UTC)Reply

@Mahagaja @Benwing2 I've done this, given there were no objections after nearly 3 weeks. Theknightwho (talk) 11:01, 27 May 2024 (UTC)Reply

@Theknightwho Thanks! Benwing2 (talk) 18:36, 27 May 2024 (UTC)Reply

Ban one-descendant Proto-Italic and Proto-Hellenic redlinks

Latest comment: 17 days ago8 comments5 people in discussion

There are already far too many one-descendant Proto-Italic and Proto-Hellenic entries, and adding one descendant redlinks to, for example, a descendant tree or an etymology section is only going to encourage more of these entries being created. These redlinks should be banned. -saph 🍏 13:31, 7 May 2024 (UTC)Reply

Right, there should be above-average incentive to create such a page, so unless it is already decided to have one, bots should neutralize these links. Fay Freak (talk) 13:51, 7 May 2024 (UTC)Reply

In practice, what does a 'ban' on making certain kinds of redlinks mean, and what is the alternative it is supposed to incentivize? I guess mentioning the same form but not linking it would be slightly better, as it doesn't encourage creating an entry, but I'm not totally happy with that either in some cases. E.g. if the reconstructed form is itself doubtful, I wouldn't want it to be mentioned anywhere.--Urszag (talk) 15:44, 7 May 2024 (UTC)Reply

For example:

From Proto-Italic *fworom, from Proto-Indo-European *dʰwor-om (“enclosure, courtyard, i.e. something enclosed by the door, or the place outside, i.e. through the door”), from *dʰwer- (“door, gate”).

With the Proto-Italic word displaying as just plain text, rather than what we currently have (forum). As for the reconstructed form being doubtful, we should just list the hypothesised PIE form, e.g.:

Usually held to derive from Proto-Indo-European *swer-yo-s, from the root *swer- (“heavy”). Cognate with Old English swǣr (“heavy, grave, grievous”), German schwer (“hard, difficult, heavy”), Lithuanian sverti (“to weigh, balance”), svarùs (“heavy”). More at sweer.

As opposed to the current etymology given at serius. -saph 🍏 15:58, 7 May 2024 (UTC)Reply

The alternative it is supposed to incentivize is not creating such entries. You would have to have a more serious motive than ticking off a removed red link, since they are not apparent in the first place. Fay Freak (talk) 16:03, 7 May 2024 (UTC)Reply

Agreed. Down the line it may also be worth discussing a general ban of reconstructions (and their associated redlinks) that have only one descendant and no derived terms. Nicodene (talk) 22:49, 9 May 2024 (UTC)Reply

Could someone run a bot to do this? -saph 🍏 19:50, 10 May 2024 (UTC)Reply

One-descendant Proto-Italic entries could be useful for "internal reconstruction", i.e. comparing different Latin words, though; like how Latin verbs change their stem vowel when a prefix is added (e.g. faciō > interficiō). --kc_kennylau (talk) 13:39, 3 June 2024 (UTC)Reply

Add "Muslim", "Hindu" etc. labels?

Latest comment: 1 month ago13 comments5 people in discussion

Proposal to add labels for lemmas used by people of specific faiths (which are not necessarily religious terms, rather they're only used by certain groups. Case in point মিঞা (mĩa) which has a Muslim gloss, but the Muslim label is an alias for 'Islam', though it's not an 'Islamic' term, just used by Muslims. Urdu dictionaries, which I concern myself with, have used these labels for centuries without prejudice. I know this would be useful for languages in the Indian subcontinent, as well as European languages (especially English). نعم البدل (talk) 20:55, 7 May 2024 (UTC)Reply

@نعم البدل There are (at least) two possibilities here. One is to disentangle the labels 'Muslim' and 'Islam' in a language-independent fashion, and the other is to do it for specific languages. I suspect the aliasing of 'Muslim' and 'Islam' was done with English entries in mind, where on the surface it makes a certain amount of sense (e.g. we have 'Muslim finance' as an alias of 'Islamic finance' and 'Christian' as an alias of 'Christianity'). A third possibility is to create a separate label, something like 'Muslim usage' or 'Muslim speakers', which makes it clear that the term is used by particular speech communities. Note that the advantage of doing it in a language-specific fashion is we can create associated categories, such as Category:Muslim Bengali, to categorize such terms, which wouldn't make so much sense if done language-independently. Finally, the adjective-noun issue you're bringing up isn't limited to this case; there is for example the issue of 'British India' (English terms formerly used in British India) vs. 'British Indian' (English terms currently used by Brits of Indian background).

BTW if you think the terms should be disentangled language-independently, you can see all current uses of the label 'Muslim' here: Special:WhatLinksHere/Wiktionary:Tracking/labels/label/Muslim (there are only 9 of them). Benwing2 (talk) 21:58, 7 May 2024 (UTC)Reply

@Benwing2: I think the 'Muslim' (etc.) tag should be detached from the 'Islam' label and made into an independent label and placed under the Module:labels/data/topical so that, as you say, it can generate associated categories, something like Category:Bengali Muslim speech (similar to Category:English women's speech terms, a minor difference between 'Muslim Bengali' as the label I'm proposing should be shed of its religious connotations as much as possible).

you can see all current uses of the label 'Muslim' here – Thank you for this! As far as I can see, apart from marabout, all of the other terms should be placed under my proposed label, as that's what was probably implied. Note how the 'Muslim' tag in মিঞা (mĩa) was encapsulated with Template:a (added by an IP), not the 'Muslim' label – likely because the 'Muslim' label appends the lemma to Category:Islam which doesn't fit. نعم البدل (talk) 02:21, 8 May 2024 (UTC)Reply

@نعم البدل OK, let's see if there are any objections/comments, and if not I'll make this change in a few days. Benwing2 (talk) 03:04, 8 May 2024 (UTC)Reply

Yeah no worries! نعم البدل (talk) 17:34, 8 May 2024 (UTC)Reply

┌────────────────────────────────────────────────────────────────────────────────────────────────────┘ @Benwing2, نعم البدل For a while there was a category named CAT:Musalman Gujarati, which is now empty. The handful of terms that were in it were moved to CAT:Gujarati dialectal terms. It would be helpful if there is a category named something like CAT:Gujarati Muslim speech as a replacement for CAT:Musalman Gujarati.

There is a phenomenon known as being a Cultural Muslim, but not a practising Muslim, who might use the terms in a category such as CAT:Muslim speech but not necessarily identify with the terms in CAT:Islam. The same would probably be applicable to other faiths.

Would greetings such as salaam alaikum that are associated with a Muslim context but may or may not be intended to be Islamic be in the proposed CAT:Muslim speech alongside CAT:Islam? For this particular term, it says on Wikipedia that it is ‘common among Arabic speakers of other religions (such as Arab Christians and Mizrahi Jews)’. The usage notes section of नमस्ते says ‘it is often considered gracious to greet someone in their religion’s greeting’ [even if that differs from their own religion]. Kutchkutch (talk) 03:44, 10 May 2024 (UTC)Reply

@Kutchkutch: I might be drifting away from the subject a little since I'm a little in interested in this :) The case with salaam alaikum is slightly complex, though. In Arabic, it's a common greeting, and used by people who follow Abrahamic faiths. I'm not really sure about the exact perception of that phrase in Arabic but in Urdu, it's sometimes the same, people who speak Urdu, regardless of their faith, might use that term, but some hardliners might be of the opinion that it's even forbidden to say 'Salam' to a non-Muslim, while other Muslims might not even bat an eye to the other's faith, and a label might not even be considered. Generally, I would say it applies to a CAT:Muslim speech (but not Category:Islam) because of alternatives like آداب (ādāb) being considered more 'neutral'. Is नमस्ते (namaste) considered to be inherently an Hindu phrase, as is generally the perception of Urdu speakers – even when it comes it to Hindi, or is it somewhat neutral? نعم البدل (talk) 01:44, 11 May 2024 (UTC)Reply

@نعم البدل: Thanks for the clarification about سَلام عَلَیکُم (salām 'alaikum).

Is नमस्ते considered to be inherently an Hindu phrase, …when it comes it to Hindi, or is it somewhat neutral?

With respect to this proposal, नमस्ते and नमस्कार could go in CAT:Hindi Hindu speech. However, there is inherently nothing Hindu about the words नमस्ते and नमस्कार in of themselves other than Sanskrit being the liturgical language of Hinduism (similar to how Arabic is the liturgical language of Islam). What may considered inherently Hindu/Buddhist/Jain/Sikh about नमस्ते and नमस्कार is when the the salutation (and related hand gesture 🙏) is toward a deity rather than actual person.
Although the words नमस्ते and नमस्कार are found in Vedic literature in the context of worshipping Hindu deities, the words themselves are formations derived नमस्, which is cognate to نماز .نماز was probably associated with Zoroastrianism rather than Islam before the Islamic conquest of Persia, and this is indicative that the term was not inherently bound to a particular religion.
Even though नमस्ते and नमस्कार are considered Hindu greetings, it seems to be neutral when speaking Hindi because it may only be inappropriate to use them if both the speaker and listener belong to a community that has its own community-specific greeting such as सलाम अलैकुम (salām alaikum) among Muslims, जय जिनेंद्र (jay jinendra), among Jains and सत श्री अकाल (sat śrī akāl) among Sikhs. The reason for this may be that India is 79.8% Hindu (according to the 2011 census). If there are no overt indicators to guess the other person’s religion when talking to strangers, using the Hindu greetings (alongside the English greetings) may be considered as neutral since there is an 80% probability that the other person is a Hindu.

Under this proposal, would title of CAT:Judeo-Urdu remain the same or would it be renamed to CAT:Urdu Jewish speech? Kutchkutch (talk) 11:23, 11 May 2024 (UTC)Reply
@Kutchkutch That is a very good question; maybe User:-sche would have some thoughts about this. Often the Judeo-Foo varieties are their own dialects rather than just consisting of extra terms added to the language and writing the language in the Hebrew script (most famously "Judeo-German" aka Yiddish and "Judeo-Spanish" aka Ladino, each of which has its own L2). In other cases however, it is more comparable to the distinction between Hindi and Urdu, and in some situations there is even less of a difference; I'm not sure about Judeo-Urdu. Benwing2 (talk) 18:34, 11 May 2024 (UTC)Reply

@Kutchkutch: My opinion would be to keep Category:Judeo-Urdu as it is as it's a separate variety. نعم البدل (talk) 19:15, 12 May 2024 (UTC)Reply

┌────────────────────────────────────────────────────────────────────────────────────────────────────┘ Thinking about the "Judeo-Urdu" vs "Urdu Jewish speech" question has me wondering how sensible labels/categories for "Jewish speech", "Muslim speech" etc really are...though I don't know what a better alternative is.
In theory, if we add a "Jewish speech" label, all of our entries in any Judeo-X lect, whether we treat it as a distinct language ("Judeo-Italian", "Judeo-Tat") or a dialect ("Judeo-Arabic"), could simultaneously gain this new "Jewish speech" label, by definition, no? Arguably a bulk of our Hebrew and Yiddish entries would also gain it. But is that useful? (So maybe, for such languages, we forgo the label? But where is the line? Do we have a Hindu-speech Hindi category, or do we assume the default for Hindi is Hindu and exceptions must be specified? But even more Bangladeshis are Muslim than Indians are Hindu...)
On a practical level, I worry that users will not grasp or maintain a distinction between "Muslim" and "Islam", because "it's mostly Muslims who use [such-and-such Islamic-religion-related word]" → "I'll label it 'Muslim'" is just too logical a train of thought, as is "only people who believe in Islam use this word" → "I'll label it 'Islam' [even if the word just means 'man' and not a per se religious concept]", so it'll be a perpetual maintenance task to keep the labels straight. I also wonder... would nonreligious people from 'traditionally Muslim' areas not use (e.g.) মিঞা? Is using মিঞা really bound up in being Islamic, something only Muslims do—and if so, why is it not then a {{lb|en|Islam}} term? I wonder if this is not better handled (like also the ostensible "English women's speech terms")* by usage notes, that a particular term is typically used by people from "culturally Muslim" communities...? But I concede that even usage notes imply that if there are many such terms, they could be in a category (and if something is a dialect, even a cultural dialect, well, we do often have labels for that), even if I wonder if it would be possible to find clearer wording ("culturally Muslim"?)...
Does Judeo-Urdu, in particular the spoken form, have a lot of words in common with "general" Urdu? Is the main distinction that Judeo-Urdu is Urdu written in Hebrew script, or are there pervasive "dialectal" differences e.g. in how vowels are pronounced or how words inflect? If, in speech, lots of words are held in common between Jewish speakers' Urdu and other people's Urdu, then it might be weird to call those common words "Jewish speech" words solely because Jews tend to use a different script in writing, no? (Conversely, if lots of words are different and Judeo-Urdu is its own lect, whether an independent language like Judeo-Italian or a dialetc like Judeo-Arabic, is there much benefit to categorizing things as both "Judeo-Urdu" and "Urdu Jewish speech"? But as I said above, I suppose we could set it up so that the labels that would generate "Urdu Jewish speech" and "Judeo-Urdu" were aliases for Urdu and only generated one of those categories no matter which one was entered...
This all seems...thorny.
(*Re "English women's speech": "Women's speech" does not seem to be a distinct lect in English the way it is in e.g. Sumerian ... but until very recently, the nature of our label- and category- system meant that any label that a single small language needed, had to be put into the singular big mishmash everyone saw presented as "labels that are available to all languages", so various people tried to find ways to apply myriad Sumerian-and Chinese- etc- specific labels to English... so I am tempted to change the few entries that use that label in English to instead have usage notes, where such notes/label would even be accurate, saying mostly women use it... and to restrict the label to only those languages which actually have distinct women's speech registers...) - -sche (discuss) 19:25, 12 May 2024 (UTC)Reply

@-sche You've made a lot of good points. I agree with you about converting the terms in CAT:English women's speech terms into Usage notes. Interestingly, all but one are primarily used in foreign contexts; possibly the native languages in those contexts do have a women's register. And the one remaining (bestie) seems questionable; it has a cutesy feel to it, which is probably why it's being considered "women's speech" but I'm sure you can find examples of men using it. As for Judeo-Urdu etc. I think any variety that is predominantly used by a particular ethnic community should probably not be redundantly tagged using that community's speech tag. Hence you could have e.g. "Sikh speech" or "Jain speech" in Hindi but not "Hindu speech". Same goes e.g. for Yiddish and Ladino being tagged as "Jewish speech". Benwing2 (talk) 21:31, 12 May 2024 (UTC)Reply

Sure we should have the labels. In other cases they are or have been even L2, as Christian Palestinian Aramaic and Jewish Palestinian Aramaic, the situation with information exchange in the past was of course more seclusive. For Arabic we agreed that the separate codes were exaggerated, but there were cases where I had to label Moroccan Arabic terms as Jewish-only, and even for Serbo-Croatian we have terms only working for Muslims.

Having to label most Hebrew, Yiddish and Ladino terms as Jewish is a strawman, Arab Israelis do great, of course etymologically many terms used by the minority will have foundation in the historical religion of the majority, the question is whether the respective frequency differs significantly. Fay Freak (talk) 21:51, 12 May 2024 (UTC)Reply

Englishman picture

Latest comment: 1 month ago9 comments7 people in discussion

So User:Shoshin000 (among other trollish activities) has been insisting on adding a picture of an angry football hooligan as the picture of "Englishman". I reverted it once, he restored. I mention this because I know the modus operandi and soon I'll be accused of being a badmin. Check out the entry and you know the previous picture was nicer. Equinox ◑ 22:47, 7 May 2024 (UTC)Reply

I personally think your picture is better (although I wonder, do we need a picture to illustrate this?). Benwing2 (talk) 00:00, 8 May 2024 (UTC)Reply

Honestly, I like Shoshin's pic, as it's more stereotypical.[1] There's nothing inherently Englishman-y about Eq's pic, besides the depicted person being English.

[1] Then again, that's a good argument against the pic. CitationsFreak (talk) 03:25, 8 May 2024 (UTC)Reply

It could be argued that pictures of nationalities, if they exist at all, should show someone of that nationality in characteristic clothing (although that is probably more appropriate for nationalities that actually have characteristic clothing that most people wear on a day-to-day basis). OTOH it's in general very hard to capture a nationality in single picture (for this reason, Wikipedia usually supplies a whole collection of pictures to illustrate a nationality), and in any case this is more encyclopedic than dictionaric (a real but rare word). Benwing2 (talk) 04:06, 8 May 2024 (UTC)Reply

Yeah, I was thinking that a college would be best. I'm not sure what a recognizable British outfit would be, and having one person stand-in for Britain could imply that British people all are X. Highly unlikely, but possible. CitationsFreak (talk) 04:12, 8 May 2024 (UTC)Reply

I don't think nationalities should have photos at all, but I also disagree that File:ENG-BEL (6).jpg is "a picture of an angry football hooligan". The person in that photo doesn't look angry, nor is he doing anything hooliganish. His Englishness is clearly shown by the St George's Cross painted on his face. He arguably does illustrate [[Englishman]] better than the photo of Greg Rutherford, since Rutherford is representing the entire UK (not just England) in his photo. All that said, however, it is probably better to leave such entries unillustrated to avoid stereotyping. —Mahāgaja · talk 08:07, 8 May 2024 (UTC)Reply

I agree, this is not an image that requires an image. Vininn126 (talk) 08:09, 8 May 2024 (UTC)Reply

Aren't photos appropriate where there is an attestable, probably dated and often derogatory or demeaning, definition of a stereotype? Eg, Bavarians with lederhosen, Prussians with spiked helmets, Mexicans with sombreros and/or serapes.

There is no such definition here, nor would I expect us to attest any such definition. DCDuring (talk) 17:34, 8 May 2024 (UTC)Reply

I don't really see this picture as a problem, really, even though I wouldn't pick it myself. It'd probably be fine as part of a collage. Theknightwho (talk) 17:51, 8 May 2024 (UTC)Reply

Fixing Telugu rhymes

Latest comment: 1 month ago4 comments3 people in discussion

For years now, User:Rajasekhar1961 has been adding Telugu rhymes written in Telugu script instead of IPA. There is a special hack in Module:rhymes to deal with this, but IMO Telugu should (obviously) use IPA for rhymes, just like all other languages. Does anyone object to this? Can anyone out there read Telugu script well enough to tell me if the rhymes listed under Rhymes:Telugu (e.g. Rhymes:Telugu/రం) and Category:Rhymes:Telugu are even salvageable, or should just be nuked? I don't know much about Telugu but scripts are generally not 1-to-1 mappable to IPA, so I don't know what it means to have a rhyme listed using Telugu script. Benwing2 (talk) 00:40, 8 May 2024 (UTC)Reply

Strongly agree. Theknightwho (talk) 11:40, 8 May 2024 (UTC)Reply

@Benwing2, Theknightwho Rajasekhar1961 has certainly put effort into creating CAT:Telugu rhymes. However, unless the definition of a rhyme in a Telugu or Dravidian context differs from

‘the second part of a syllable, from the vowel on, as opposed to the onset’

you are correct in pointing out that these do not appear to have been done correctly. From an orthographic perspective, the final consonant (or consonant cluster) followed by the final diacritic (or the inherent schwa) of a word written in Telugu script (which is a Southern Brahmic abugida) does not constitute a rhyme. The entries in CAT:Telugu rhymes categorise words by word-final syllables rather than rhymes because the onset is included.

A Telugu editor could probably rectify the words mentioned on the entries in CAT:Telugu rhymes. However, even if there is a user with the appropriate background to do so, it would be a lot of work, and it would be the equivalent of deleting the entries currently in CAT:Telugu rhymes and starting over again. Kutchkutch (talk) 11:13, 10 May 2024 (UTC)Reply

@Kutchkutch Thanks. User:Rajasekhar1961 can you comment on why you did this? If I don't hear from you in a few days I will go ahead and delete all the Telugu rhymes. Benwing2 (talk) 14:50, 10 May 2024 (UTC)Reply

Kwami is messing with translingual entries, again

Latest comment: 1 month ago43 comments9 people in discussion

Just want to make sure there are some eyes on Kwami, as they've been making mass edits to Translingual entries that seem... worrying. After being reverted by @Theknightwho and @Benwing2 for deleting the translingual section, Kwami has recently begun deleting all the definitions from the translingual section instead.

I reverted all (but one) of the single character edits they've made today. However, they've been editing hundreds of TL entries and I have no idea how many entries are affected, as I've been very busy recently and can't check.

I'm not sure how bad the situation is so I don't want to "call out" Kwami. Just want to make sure people are aware before it becomes out of hand, like the last time this was discussed on here. — Sameer ^{﴾مشارکت‌ها・بحث﴿} 23:54, 8 May 2024 (UTC)Reply

@Sameerhameedy Thank you. I have blocked him for a month this time; I am getting seriously sick of this. I think he has used up all his lives; next time we should consider a permablock. Benwing2 (talk) 00:38, 9 May 2024 (UTC)Reply

Thank you, I'm also a bit annoyed since Kwami has gotten so many warnings and continues to do the same action. Now, Kwami has indicated that they will actually start a discussion on this issue before acting. There's no way to know if Kwami will actually follow through on that statement, but hopefully they do, so we don't have to do this every month. — Sameer ^{﴾مشارکت‌ها・بحث﴿} 00:51, 9 May 2024 (UTC)Reply

Just to clarify, these weren't random articles. I went through the whole Latin Extended Additional block and replaced physical descriptions (e.g. "the letter N with a line below") with requests for definition. I didn't delete actual definitions that would tell the reader what the letter meant or what it was used for.

Sameer, the discussion is the next thread. kwami (talk) 06:16, 10 May 2024 (UTC)Reply

@Kwamikagami That is exactly the issue. You are continuing to fail to see that there is no consensus for doing what you did, after 10+ times that you've been asked to get consensus *BEFORE* doing mass changes. If you're not seeing this now, I doubt you will ever see it, and if you're not willing to defer to and respect consensus, you're in for a permablock. Benwing2 (talk) 08:01, 10 May 2024 (UTC)Reply

@Benwing2, Sameer was concerned that there may be many more such edits, so I clarified what edits I had made. That included the category of articles I had edited, and the kind of edits I had made on them. I thought they might find that helpful.

As to your point, I wonder how possible it is to get consensus to do anything here. Hopefully the discussion below will produce consensus. My hopes aren't high, given that previous discussions got nowhere, but you never know. kwami (talk) 09:06, 10 May 2024 (UTC)Reply

I've been at Wiktionary for almost 20 years and have never yet seen a Beer parlour discussion result in consensus, so my hopes aren't high either. —Mahāgaja · talk 09:16, 10 May 2024 (UTC)Reply

You can't have read many Beer Parlour discussions, then. Kwami is simply trying to convince themselves that what they're being asked to do is impossible, because they can't ever accept they're wrong about anything, ever. It's not complicated. Theknightwho (talk) 10:57, 10 May 2024 (UTC)Reply

@Mahagaja I have seen plenty of Beer Parlour discussions that result in consensus; not sure what you're referring to. Benwing2 (talk) 14:53, 10 May 2024 (UTC)Reply

I guess ironically there's no consensus if there's consensus? And while I think us Wiktionarians like to bicker and we often disagree over certain details and such, I do think there's enough cooperation, compromise, and agreement to say that plenty of threads end inn consensus. Vininn126 (talk) 15:10, 10 May 2024 (UTC)Reply

If I decide that "q" is not a proper English letter unless followed by "u" and I want to get rid of all the English entries with a "q" not followed by "u", there is no way that I can get consensus for that via any process. That doesn't mean that I can go ahead and remove the English entries for words like Qatar and Iraq or even BBQ (it's not a proper abbreviation) because the usual process doesn't work. It means I should find something else to do. The unwritten question underlying all of this is "how can I get my way when I'm right and I can't get people to say they agree with me". Yes, the process isn't perfect, and sometimes doesn't work- but rejecting it entirely won't fix it. Chuck Entz (talk) 16:16, 10 May 2024 (UTC)Reply

That's why I'm here. The question is straightforward: do we have standards for what counts as a definition? If so, what are they? Where can I find them?

In this case, does a graphic description count as a definition? Quite a few editors have said they do not, but there seems to be difficulty in implementing that.

Also, should we have a translingual section without providing evidence of translingual use? Especially when there is no definition in that translingual section?

Do we have consensus that such things should be tagged with RFDef or RFD, and how should I respond if I tag them and someone goes through and deletes the tags without discussion because they don't like the extra work?

It's fine to say 'go to RFD', but why spend months doing that if it should be obvious from the outset that they're not going to pass? That's a waste of everyone's time. That's why I'd like some concrete standards to follow. I assume Wikt must have standards; if you could just show me where they are (I don't see anything in the help pages), I could add a link to my user page and refer to them when making edits. Then instead of arguing over every edit, I could point to the standards and show that I've been following them, or they could point to them and show that I've been violating them. I don't mean about the RFD process, but about the content of our articles. kwami (talk) 19:37, 10 May 2024 (UTC)Reply

An example to provoke thought is zebra. A definition as 'an equid with prominent black and white stripes' would be an accurate description, and would work even if they were not a clade. (An early cladistic study concluded that they were not - morphology is a poor guide to details of relationships.) For Unicode characters - and Unicode provides an important high level classification of glyphs - the general principle is that combining marks are distinguished on the basis of shape. Therefore, graphic descriptions are quite relevant for 'precomposed' characters.

If there's an objection to a claim of translinguality, then raise a request for verification.

If tags are deleted without reason, then complain to the Beer Parlour if you can't find a helpful admin.

What may seem obvious to you is not necessarily even true. There are some very obscure characters around, quite possibly restricted to expensive books.

Wiktionary has a general disdain for wikilawyering, so don't expect everything to be laid down. Wiktionary also seems quite bad at documenting things - I'd love a guide on the anatomy of a definition. --RichardW57m (talk) 12:56, 13 May 2024 (UTC)Reply

@RichardW57m For physical objects, a physical description is fine. And I think it would be fine if we had "the letter a with an acute accent, used for ...," where we went on to give its use or meaning. And a short description without definition would be fine under a 'description' section or even 'etymology', assuming it's accurate (many Unicode names are not, they're just labels that need to differ from all others). But these cases are like defining 'zebra' as "a word spelled Z-E-B-R-A", and placing it under 'translingual' because there are multiple languages that have such a word, even if they mean different things (maybe some languages use it only in the sense of a crosswalk, others only for the equid), and give it only the English pronunciation /ˈziːbrə/ because English is the most important language. (Yes, we have characters listed under 'translingual' even though we give them the pronunciation of a particular language.)

Many people have now said that the Unicode name of a character or emoji, or similar sum-of-parts description, is not appropriate on its own as the definition. The problem I've had is trying to implement that. I've been told to take it to RFD, but that generally doesn't work. It would be nice to have some agreement as to what counts as a definition. kwami (talk) 18:19, 13 May 2024 (UTC)Reply

"the letter a with an acute accent, used for ...,"

I don't know how many languages change the meaning of an accent per character. It would much more efficient to say "the letter a with an acute accent" with the page acute accent explaining what the diacritic does. — SAMEER (؂・؄・؏) 18:49, 13 May 2024 (UTC)Reply

I'd be fine if we had consensus on doing that, but would think we'd want something more. Such a description often wouldn't say anything more that the Unicode name, and conflicts with our sum-of-parts criterion. Plus, people often just use the Unicode name even when it's not an accurate description. Another problem is that it creates a blue link that make it look like we have a definition when we don't have a functional one. For people like me who use red links as a guide to creating missing articles, that can be a problem. Also, not all Unicode characters actually exist, some are errors. And some are rare enough that giving a graphical description as a definition doesn't do much for the reader, who may still not know what the character is used for or what languages it appears in. kwami (talk) 19:10, 13 May 2024 (UTC)Reply

The problem I've had is trying to implement that. I've been told to take it to RFD, but that generally doesn't work. - the small problem with this is that you didn't take things to RFD, though, and it won't become any more true just because you keep repeating it. It's very clear that Kwami will never, ever understand why their approach is wrong. Theknightwho (talk) 18:53, 13 May 2024 (UTC)Reply

I've taken dozens, possibly hundreds, of articles to RFD, or at least tag them so that the people at RFD respond to them. I don't know why you keep denying that. You repeating things over and over doesn't make them true either. Repeated false accusations like this are one reason I have a hard time accepting that you act in good faith. kwami (talk) 19:01, 13 May 2024 (UTC)Reply

@Kwamikagami I've found one, plus this which isn't relevant. Are you referring to all those entries you mis-tagged with {{d}} (speedy deletion), which exists to avoid doing the RFD process for routine deletions? I've just remembered that, after I found this comment where you try to bullshit about that, as well. Good grief. Theknightwho (talk) 19:22, 13 May 2024 (UTC)Reply

Yes, the POV of anyone who disagrees with you is "bullshit", while your POV is "truth". Again, arguing in bad faith and you habitually do.

I don't recall which abbreviation of which template I used for which article. Some of them were probably speedies. Some were RFD. Some later on were RFDef (which I know you think is somehow illegitimate, but I maintain is still a valid use of process). kwami (talk) 20:17, 13 May 2024 (UTC)Reply

@Kwamikagami Saying I've taken dozens, possibly hundreds, of articles to RFD after taking two is bullshit, yes. Theknightwho (talk) 20:23, 13 May 2024 (UTC)Reply

@Kwamikagami Replacing a definition with {{rfdef}} is not a valid process. In general you should never delete content even if you don't like it. Benwing2 (talk) 20:40, 13 May 2024 (UTC)Reply

I understand that. I meant that rfdef itself is a valid process. kwami (talk) 20:48, 13 May 2024 (UTC)Reply

RFDef is not a process - it's just the request template {{rfdef}}. It can't be used instead of RFD, and what you've just said makes absolutely no sense when the only times you used RFDef were in an attempt to circumvent the RFD process. Theknightwho (talk) 20:55, 13 May 2024 (UTC)Reply

If RFD doesn't work, then RFDef is another possibility. If no-one can furnish a definition, then there's an argument that the entry should be deleted. How is that "circumventing" the process? Again, you attribute bad faith to anything you don't like, which simply shows bad faith on your part. kwami (talk) 21:01, 13 May 2024 (UTC)Reply

@Kwamikagami If RFD doesn't work You've only ever taken two things to RFD, and one of those wasn't an entry, so you cannot possibly make that claim. You also seem to be under the bizarre impression that you're entitled to delete entries in other ways if you don't get what you want out of the RFD process.

I'm out of patience with this complete and utter refusal to understand the problem, and it's pretty clear other people are as well. Theknightwho (talk) 21:08, 13 May 2024 (UTC)Reply

A few dozen articles were deleted, so somehow your count is off.

Not deleting entries in other ways, requesting completion or deletion in other ways. I didn't delete these entries I was blocked for. I replaced non-definitions with requests for definition -- and I won't do that again -- but the entries remained. kwami (talk) 22:02, 13 May 2024 (UTC)Reply

@Kwamikagami There's nothing wrong with my counting: you only brought two things to RFD. It's not difficult to understand. Theknightwho (talk) 22:13, 13 May 2024 (UTC)Reply

To clarify, it looks like @Kwamikagami only brought two articles to RFD, but tagged a bunch of articles for speedy deletion, which were deleted before the admins realized those speedy deletions were bogus. RFD is in general the correct process for requesting deletion of pages you believe ought to be deleted that don't meet the speedy deletion criteria; but IMO if you are going to request deletion of a large number of articles, you should not tag every article with RFD, just make a post in RFD with a title "all articles meeting such-and-such criteria" and give your reasons. Or alternatively, bring it up in the Beer Parlour if it's controversial and merits being seen more generally. Benwing2 (talk) 22:31, 13 May 2024 (UTC)Reply

Precisely. Theknightwho (talk) 22:32, 13 May 2024 (UTC)Reply

Okay, so that's what I'm doing: bringing it up at the Beer Parlour, as I was advised to do.

But if it's better at RFD than here, I can start a thread there.

As for "bogus", the opinion I got from other editors at the time was that, if an article had no real content, then it met the criteria and it should be deleted. That specifically included articles that consisted of nothing but the character box and Unicode name in the definition section. I didn't start this, but picked up from where I saw others acting. This has been happening for years, especially with emojis, where someone would go through and create batches of emoji articles defined simply as their unicode names, then someone else going through and deleting them, then someone else recreating them, etc. You can see that in their deletion histories. Less common with non-emoji characters, but there's a history of this there as well. So not only did I have no reason to think this was inappropriate, I was told it was appropriate and was something Wikt needed to keep on top of. kwami (talk) 00:45, 14 May 2024 (UTC)Reply

Yes, and those people would have expected you to take any entries like that to RFD, instead of unilaterally deleting them, as there needs to be consensus that they contain no content. How are you still failing to understand such a simple concept? Theknightwho (talk) 10:42, 14 May 2024 (UTC)Reply

And adding {{rfc|mul|Need meaning rather than graphical description}} to an existing entry is a valid process. --RichardW57m (talk) 14:12, 16 May 2024 (UTC)Reply

Thanks. kwami (talk) 19:41, 16 May 2024 (UTC)Reply

What he wrote was "I've taken dozens, possibly hundreds, of articles to RFD, or at least tag them so that the people at RFD respond to them". Bickering about numbers is not constructive. The issue is whether or not he (or anyone) should be wholesale deleting translingual definitions and it seems pretty clear that the answer is no. —Justin (koavf)❤T☮C☺M☯ 23:20, 13 May 2024 (UTC)Reply

I didn't see that as deleting definitions. I replaced Unicode names with requests for actual definitions. I won't do that again. kwami (talk) 00:47, 14 May 2024 (UTC)Reply

I appreciate that you're acting in good faith and trying to do what you think is best, but it's not clear to me that you're correct and I'm honestly very surprised that you keep on doing things that seem like big unilateral changes without consulting others first because this kind of complaint has come up repeatedly. —Justin (koavf)❤T☮C☺M☯ 00:59, 14 May 2024 (UTC)Reply

Yes, it has. I need to be more careful. I could've just tacked the RFDef tag on the end of the description. kwami (talk) 03:31, 14 May 2024 (UTC)Reply

@Kwamikagami That's not the right thing to do either. IMO you need to get consensus *BEFORE* making changes to large numbers of pages, even just adding {{rfdef}}; if you can't get that consensus, don't make the changes. Benwing2 (talk) 03:44, 14 May 2024 (UTC)Reply

Okay. We'll see how the question below pans out. kwami (talk) 03:45, 14 May 2024 (UTC)Reply

The letter 'a' with a grave accent is a translingual description. The particular meaning of the grave obviously varies from language to language, and sometimes it may merely be an arbitrary diacritic, perhaps even just a word diacritic as in French. Translingually, the accent part is often used as a tone mark, but I think it can also be a symbol used as an input to stress assignment rules (though I may be being confused by my own private invention). Perhaps we need a vote on whether precomposed characters can be dismissed as sums of parts.

We even have some interesting language assignment questions. For example, 'ṉ' denotes the alveolar nasal of Tamil and Malayalam, but I think it's arguable that it isn't a letter of those languages. I'm a bit bothered by a pair of Lithuanian diacritics which don't seem to be any part of standard Lithuanian. --RichardW57m (talk) 12:41, 14 May 2024 (UTC)Reply

Yes, we define 'ṉ' as the Latin transliteration of letters in Tamil and Malayalam scripts, and place that under a translingual heading rather than under Tamil or Malayalam headings. I think that's appropriate. For á, we don't have a translingual heading, as no-one has come up with a translingual use/definition. Not saying one doesn't exist; e.g. there's IPA [a] with high tone, but as you say that's SOP. Again, I think that approach is appropriate. The question then is whether we want to make this our general approach, and reserve the translingual section for definable translingual uses. (Perhaps if we had a translingual use of 'á', we might also give the SOP use in IPA for clarity, but that wouldn't be enough to create the translingual section in the first place.) kwami (talk) 18:12, 14 May 2024 (UTC)Reply

In favor of this. Vininn126 (talk) 09:41, 10 May 2024 (UTC)Reply

Do descriptions count as "definitions"?

Latest comment: 23 days ago38 comments8 people in discussion

I'm not being facetious here. This is a serious question for something I haven't understood for a long time.

For instance, in the article á, would "the letter a with an acute accent" be a valid definition? If so, should such descriptions be added to all letters? If not, should they be removed (perhaps placed under a "Description" heading instead)? And if not, and the only material for an article is such a non-definition, should the entry be tagged as needing a definition, or the article tagged for deletion for having no content?

I suspect that if I were to add a definition to cat as "the word spelled C-A-T", I would be blocked for vandalism. I don't see any meaningful difference between that and defining á as "the letter a with an acute accent". I've been told this is a straw-man argument, but I really don't understand what's appropriate in our entries if graphical descriptions are allowed as actual definitions.

The same applies to emojis, of course. Should an emoji of a face with tears be defined as "a face with tears", or should the definition be what it means and what it's used for? kwami (talk) 00:29, 9 May 2024 (UTC)Reply

@Kwamikagami I agree that "the letter a with an acute accent" is not a good definition. If á had a translingual section it should at least explain how the letter is typically used across languages. I assume it usually represents some kind of /a/? Ioaxxere (talk) 01:43, 9 May 2024 (UTC)Reply

A definition that depended on users understanding IPA, however, would be unsatisfactory. DCDuring (talk) 01:56, 9 May 2024 (UTC)Reply

Personally, I don't see the point of a translingual section, except for things like the IPA or IAST transliteration, and this particular case would only be sum-of-parts in such cases.

But my question is what should be done with articles that have such non-definitions. Since starting this discussion, I was blocked for pasting [rfdef] tags yesterday on a bunch of articles in place of such descriptions.

So,

since it is not a good definition, can it be removed?
should it be replaced with a request for definition, or should the empty section be deleted?

kwami (talk) 01:56, 9 May 2024 (UTC)Reply

@Kwamikagami: I think it's better to improve the definition rather than adding a ton of {{rfdef}}s as this creates lots of work for other editors. Ioaxxere (talk) 02:09, 9 May 2024 (UTC)Reply

I agree, and I've been doing that where I can. But how do I improve the definition when there is no definition? What is the translingual definition of a letter that does not have translingual use? What is the definition of a letter that has no evidence of any kind of use? What I've been tagging are cases where I can't find any definition to provide.

The reason I've been adding rfdef tags is that I'm not allowed to delete empty entries.

So, if there's an empty article or section, one that has no content except a character box, and no definition of what's supposedly being defined, what's the solution? Do we leave it as a joke, or do we try to improve it? If we want to improve it, how do we do that, when there's no available data to improve it with?

If someone added a bunch of articles, all with the definition being "it's a word", shouldn't we at the very least tag them as needing actual definitions, even if that creates work for people? kwami (talk) 02:17, 9 May 2024 (UTC)Reply

I've gone through and added definitions to hundreds of these articles. The ones I've tagged are ones where I can find to definition to give. It's a choice of adding a tag or leaving Wikt looking like a joke. kwami (talk) 02:24, 9 May 2024 (UTC)Reply

Clearly, most definitions are simple descriptions, prototypically, for nouns, having a hypernym and differentia. The descriptions are also supposed to be useful to users. "The word spelled C-A-T" is not useful being redundant to the graphic representation of the headword itself, thus a straw-man. For a Latin letter with a diacritical mark it might be useful for some that the definition explained how the so-marked letter differs from the Latin character without the mark or those with other marks in each relevant character set.

A definition of a word naming an emoji might include a description as well as what the emoji is understood to mean, like a good definition of green light. The entry might also have the appropriate image, too, constituting an ostensive definition, redundant to the headword in the case of Latin characters diacritically marked. DCDuring (talk) 01:56, 9 May 2024 (UTC)Reply

> "The word spelled C-A-T" is not useful being redundant to the graphic representation of the headword itself"

But "the letter a with an acute accent" is equally redundant to the graphic representation of the headword itself. So no, it's not a straw-man, it's reductio ad absurdum.

Because our users include many people who are not familiar with diacritical marks on Latin letters we give them an explanation of what they are looking at. Those descriptions also help those who, like me, don't have great vision and can't necessarily discriminate among the various diacritical marks and don't necessarily know the names of those marks. DCDuring (talk) 12:29, 9 May 2024 (UTC)Reply

I have no problem with that. That's what the 'description' section is for. But it's not a definition.

We also have a pronunciation section for people who don't know how to pronounce a word. But again, the pronunciation of a word is not its definition.

In many cases I've moved the description to a description section, and tagged the definition section as needing a definition. But then people get annoyed that I'm creating work for them, because now they're expected to treat Wikt as an actual dictionary.

Much of the opposition to improving articles seems to center around it being more important for Wikt to be correctly formatted and to look good, than for it to actually contain any content or be useful as a dictionary. kwami (talk) 21:58, 9 May 2024 (UTC)Reply

Definitions of nouns are not descriptions of the word, but of the meaning of the word. Orthography and pronunciation belong in other sections: they are not the definition itself. Why should graphemes (incl. emojis) be different? Many have (graphical) description, etymology and pronunciation sections. Those cover the description of the letter as a mark on paper or as said. A definition concerns itself with meaning. If no meaning is provided, so the reader can't tell what the symbol is for, then we're not providing a functional definition. kwami (talk) 02:00, 9 May 2024 (UTC)Reply

BTW, we also have cases of 'translingual' sections with zero evidence of translingual use. Sometimes letters are specific to a particular language, yet they have a translingual section with no definition. All that does is push the actual definition down lower on the page, where you can't see it without scrolling. That is a minority situation, but we do have hundreds of articles with "the letter a with an acute accent"-type descriptions as their 'definition'. kwami (talk) 02:10, 9 May 2024 (UTC)Reply

I guess my question is, if it's not acceptable to add these fake definitions, and it's not acceptable to tag them for improvement, why is it not acceptable to delete them? kwami (talk) 02:29, 9 May 2024 (UTC)Reply

@Kwamikagami What's not acceptable is deleting them without going through RFV or RFD. Come on, we've said this many times by now. Theknightwho (talk) 02:36, 9 May 2024 (UTC)Reply

We have the similar precedent of ligatures, an explicitly permitted 'part of speech'. Now, for त्र (tra), we have the reasonable definition "In Devanagari an irregular ligature of त and र". For characters, we sometimes benefit from having the composition pointed out. Of course, it's rarely as useful as that of ģ (“g with cedilla”). --RichardW57m (talk) 14:51, 16 May 2024 (UTC)Reply

In the case of त्र, since we have a Translingual entry, what is the point of the Dhivehi, Hindi, and Marathi entries that don't say anything different? —Mahāgaja · talk 14:56, 16 May 2024 (UTC)Reply

I suspect it's that nastiest of translingual issues, pronunciation, which differs between languages. --RichardW57m (talk) 17:24, 16 May 2024 (UTC)Reply

I agree that in cases like त्र, a translingual entry makes the most sense. In many cases we will have language-specific material to add as well, but here even the pronunciations are sum-of-parts. Though the reader might not be sure of that if we don't spell it out. kwami (talk) 19:56, 16 May 2024 (UTC)Reply

Divergent pronunciations can be seen with the ligature ज्ञ (jña); we have documented the divergence. --RichardW57m (talk) 14:06, 23 May 2024 (UTC)Reply

That and ksh are special cases, and are unpredictable graphically and well phonetically. Most consonants + r are just sum-of-parts /Cr/. But that's the opposite question, on whether we should add predictable language-specific entries. kwami (talk) 20:58, 23 May 2024 (UTC)Reply

@Kwamikagami The claim is that त्र is also unpredictable graphically. In some writing systems, the corresponding ligature is also not SoP phonetically, though no examples of both simutaneously came to mind. --RichardW57m (talk) 14:18, 28 May 2024 (UTC)Reply

That makes sense. kwami (talk) 19:01, 28 May 2024 (UTC)Reply

DCDuring, I Kinda agree. People may not know what the diacritics are called + I find it helpful that translingual sections show variations of the same character across all languages. Kwami has, in many cases, moved the character variations template to a specific language. Which is problematic as it makes no sense to put translingual latin character variations under, say, Latvian as Kwami decided to do here. Since, y'know, I doubt Latvian has every Unicode variation of 'g' and every unicode character with a cedilla in its alphabet. — SAMEER (؂・؄・؏) 17:52, 16 May 2024 (UTC)Reply

Our lack of explicit consensus licenses some aspects of the bad behavior being complained about. I'd favor the not-so-radical solution of keeping each letter (with diacritics) only under a Translingual header, with links to language-, language-family-, or script-specific appendices that cover the complete alphabets of each language etc. in which they are used. If there is language-specific content that doesn't fit in said appendices, I would try first to make it fit by expanding the Appendix and only then allow language-specific L2s for the letter. DCDuring (talk) 18:15, 16 May 2024 (UTC)Reply

off-topic thread on RFD

I have gone through RFV and RFD, many times. The result tends to be that ppl get annoyed because I'm making more work for them.

So you're saying that if I added a definition line saying "this is a word" to thousands of articles, you couldn't revert me without going through RFD? That seems implausible.

I put a bunch of empty articles and sections up for deletion, and someone started deleting them, but then there was an objection and the process stopped, never to continue because there was no clarity on what should be deleted. So, what are the criteria for deletion? When does some verbiage formatted like a definition count as a definition? That's what I'm trying to get clarity on. When should I bother putting something up for RFV or RFD, and when should I write it off as a lost cause? kwami (talk) 03:44, 9 May 2024 (UTC)Reply

@Kwamikagami You removed hundreds of entries without going through any process, and never restored them, and a few threads about a handful of entries doesn’t change that. If you still can’t understand this is why everyone got annoyed with you, you may as well get a permanent block right now, because I - and many others - are sick of your shit. Theknightwho (talk) 13:51, 9 May 2024 (UTC)Reply

@Kwamikagami the problem is that you edit like a poorly programmed bot. You just start making mass changes without thinking things through or discussing them properly. You're not always wrong, but when you are you do damage on a massive scale. I have no idea how you could think that large numbers of "translingual" entries using language codes and templates for specific languages was a good idea, for instance. Chuck Entz (talk) 14:51, 9 May 2024 (UTC)Reply

As usual, when I come here with a question of what we should do, I get no answer, only complaints that I don't come here to work out what to do. So why not answer my question when I do ask instead of constantly complaining that I don't ask? kwami (talk) 21:53, 9 May 2024 (UTC)Reply

@Kwamikagami Your question was I guess my question is, if it's not acceptable to add these fake definitions, and it's not acceptable to tag them for improvement, why is it not acceptable to delete them?, which I did answer.

If you want a discussion on whether we should delete them, the forum for that is RFD, and if you wanted a discussion on whether we shouldn't add any more, then you asked the wrong question. Usually I'd be more understanding, but I won't be if you're going to keep trying to spin your out-of-policy mass removals as a reasonable course of action. Theknightwho (talk) 00:10, 10 May 2024 (UTC)Reply

So you won't collaborate, but complain if I don't collaborate.

I'm not claiming it's a reasonable course of action, I'm asking, and have repeatedly asked, what the consensus course of action is. What's so difficult about this? Should we do X, or should we do Y. It's a simple question, but the answers I repeatedly get are some people saying X, some saying Y, so we can do neither.

If I go to RFD, it's argued I shouldn't have done that that because it wastes people's time, and that it's inappropriate to delete items that don't meet our criteria when we apparently can't agree on what our criteria are. Is that why I never get a coherent answer? kwami (talk) 00:35, 10 May 2024 (UTC)Reply

@Kwamikagami Nobody argued that it wastes people's time to go to RFD. Theknightwho (talk) 01:34, 10 May 2024 (UTC)Reply

Someone just argued that I shouldn't RFDef because it creates a bunch of work for people. kwami (talk) 05:02, 10 May 2024 (UTC)Reply

@Kwamikagami RFD always refers to WT:Requests for deletion, which is the appropriate forum to nominate and discuss potential deletions. It has nothing to do with {{rfdef}}, which is for requests for definitions. And before you pull the "how could I have known?" card (as I'm familiar with your MO), I did say If you want a discussion on whether we should delete them, the forum for that is RFD, and you've engaged in discussions there before. Theknightwho (talk) 05:15, 10 May 2024 (UTC)Reply

I'm aware of the abbreviations. (Yes, I must be acting in bad faith. That's the only explanation for my not being satisfied with your non-answers.)

I've tagged entries for both RFD and RFDef, and people have objected to both. The most recent case was for RFDef. That's why I mentioned it as a recent example. If the most recent example had been for RFD, I would've mentioned that. Doesn't really matter, the response from (some) people is the same: don't tag bad articles for improvement, it increases their workload.

So far I've had one short answer to this question, by Ioaxxere. The rest has been mostly complaints. Really, I don't understand the complaining rather than explaining what the consensus is, or how we should approach if there is no consensus. kwami (talk) 05:25, 10 May 2024 (UTC)Reply

To summarise:

You: If I go to RFD, it's argued I shouldn't have done that that because it wastes people's time
Me: Nobody argued that it wastes people's time to go to RFD
You: Someone just argued that I shouldn't RFDef because it creates a bunch of work for people
Me: RFD always refers to WT:Requests for deletion, which is the appropriate forum to nominate and discuss potential deletions. It has nothing to do with {{rfdef}}
You: I'm aware of the abbreviations. (Yes, I must be acting in bad faith. That's the only explanation for my not being satisfied with your non-answers.)

Are you for real? For months people have been saying you need to use the RFD process, so as an excuse this simply comes off as ridiculous. Theknightwho (talk) 05:30, 10 May 2024 (UTC)Reply

Can you not answer a fucking question? Must it always be sidestepping and innuendos and dodging responsibility?

Last time I asked, I was told the proper place for this discussion was the Beer Parlour. So here I am at the Beer Parlor. You say one thing, someone else says something else. Do your opinions take precedent over anyone else's?

The question isn't just about deletion, but what should be done with fake definitions. Should they be tagged, or deleted, or are they what we want for all articles? I could shuffle back and forth between here and RFD, but if people like you refuse to answer in good faith, that won't do any good. Why can I never get a simple answer to a simple question in this place? People ask why I don't get consensus before making a change. Well, this is why: a bunch of quarreling and refusing to engage in the question at hand, so that nothing gets accomplished. kwami (talk) 05:37, 10 May 2024 (UTC)Reply

@Kwamikagami To repeat myself: if you want to nominate existing articles for deletion, then they need to go through RFD. If you think we shouldn't have any entries like that at all, then let this discussion play out, and people will comment on it - it's obviously not a question I can unilaterally decide, and I don't have particularly strong opinions on it.

What I do have strong opinions on is the way you repeatedly try to make your messes everyone else's fault, but I have better things to be doing with my time right now. Theknightwho (talk) 05:43, 10 May 2024 (UTC)Reply

If you think I should let this discussion play out, well, that's what I'm here for.

I'm collapsing this thread as it doesn't address the question I posed. kwami (talk) 05:53, 10 May 2024 (UTC)Reply

English anagrams

Latest comment: 29 days ago9 comments2 people in discussion

English anagrams haven't been updated in a while. Could someone run a bot to update them? Maybe @Kiril kovachev, Benwing2 Ioaxxere (talk) 01:43, 9 May 2024 (UTC)Reply

@Ioaxxere I can try, but I'm not sure if I trust myself to do it properly. Specifically the part of which characters (like punctuation) should be removed when comparing two words. Kiril kovachev (talk・contribs) 12:41, 9 May 2024 (UTC)Reply

@Kiril kovachev: It doesn't matter too much, since the vast majority of English terms don't have any special characters. Punctuation (periods, commas, etc.) as well as different casing should definitely be ignored, but I have no preference with respect to diacritics. Ioaxxere (talk) 15:45, 9 May 2024 (UTC)Reply

@Ioaxxere Okay, I've updated the word list I am using, so it should be ready to run, but I've run it a bit to get some sample edits. Should it consider e.g. lork and kröl to be anagrams? Or Nº and no etc.? In the case of diacritics I also don't have much of a preference, and it can be configured to not remove them if we want. I can make it run as-is if this seem ok. Kiril kovachev (talk・contribs) 13:11, 15 May 2024 (UTC)Reply

@Kiril kovachev: Looks good. Will the bot be running continuously or on a schedule, e.g. updates every 3 months? Ioaxxere (talk) 14:22, 15 May 2024 (UTC)Reply

It's based on what words are currently on Wiktionary (I got the parsed data from kaikki.org, which is updated ~every month), so I can make it run every time the words are substantially different. I can run it every 3 months if you like, as long as we don't spot any obvious problems. Kiril kovachev (talk・contribs) 16:24, 15 May 2024 (UTC)Reply

@Ioaxxere (forgot ping) Kiril kovachev (talk・contribs) 16:25, 15 May 2024 (UTC)Reply

Sounds good. Ioaxxere (talk) 16:32, 15 May 2024 (UTC)Reply

@Ioaxxere We had a bit of a stall, since there's a little problem with the code when the entry contains a "/" in it, but I'll be working on a fix these few days. Actually it shouldn't be too complicated so hopefully we'll be back soon :) I don't know how long this is gonna take though, as it was going for a few hours already before it even managed to crash. I'll let you know again when I figure out how long it'll be... Kiril kovachev (talk・contribs) 15:41, 22 May 2024 (UTC)Reply

becocked, and whether we want Trivia sections

Latest comment: 1 month ago6 comments5 people in discussion

Recently, Trump used the word becocked, which attracted some attention because it's an unusual word, and quite a lot of people thought he'd made it up (even though he didn't). Is this the kind of thing we want to note in trivia sections? To me, it seems like the kind of thing no-one will care about in a month, and that it adds pointless clutter. Pinging @Ioaxxere, who originally added it as a usage note, but later changed it to the little-used Trivia heading. Theknightwho (talk) 02:12, 9 May 2024 (UTC)Reply

His use of the term attracted some media coverage ([4] [5]) making it probably the most notable event in the history of becocked. Does a single sentence about this really add so much clutter? Ioaxxere (talk) 02:18, 9 May 2024 (UTC)Reply

Ask yourself this: in two years' time, if someone came across this in an entry, would they feel like this addition was the cringeworthy result of terminally online recency bias? Almost certainly yes. It's basically just celebrity gossip. Theknightwho (talk) 02:21, 9 May 2024 (UTC)Reply

"X said this word" should never go in Trivia/Useful Notes. It can go as a quote, however. CitationsFreak (talk) 03:08, 10 May 2024 (UTC)Reply

I agree completely. I think trivia sections should chiefly be used for things like noting that a word is thought to be the longest in a particular language, has no vowels, doesn’t rhyme with any other word—that sort of thing. — Sgconlaw (talk) 11:35, 10 May 2024 (UTC)Reply

I agree too. P U C – 19:24, 11 May 2024 (UTC)Reply

Manipuri vs Meitei language (moved to RFM)

Discussion moved to WT:RFM#Manipuri vs Meitei language.

Performing bulk edits for Bengali/Bangla

Latest comment: 12 days ago29 comments7 people in discussion

Discussion moved from Wiktionary talk:Beer parlour/2024/May.

I'm a NLP researcher who uses Wiktionary to collect pronunciation data. As part of this effort we have noticed various inconsistencies in phonemic transcription. For example,

1. According to various sources (Khan, 2010; Dasgupta, 2003, Ferguson & Chowdhuri, 1960; Chatterji, 1970), Bengali have only one voiceless glottal fricative /h/, so /ɦ/ > /h/. E.g.: অকৃতোদ্বাহ 'bachelor' /ɔ.kri.t̪od̪.ba.ɦo/ > /ɔ.kri.t̪od̪.ba.ho/. This IPA symbol is not correctly represented in Wiktionary Bengali transliteration guide. Therefore, I propose to edit the guide.

2. The correct phonemic transcription (ref. Dasgupta, 2003, Ferguson & Chowdhuri, 1960; Chatterji, 1970) for affricates should include the tie-bar, so /tʃ, t͡ʃʰ, dʒ, d͡ʒʱ/ > /t͡ʃ, t͡ʃʰ, d͡ʒ, d͡ʒʱ/. E.g.:চরম 'extreme' /tʃɔɾom/ > /t͡ʃɔɾom/, ছায়াছবি 'film' /tʃʰae̯atʃʰbi/ > /t͡ʃʰae̯at͡ʃʰbi/, জল 'water' /dʒɔl/ > /d͡ʒɔl/, ঝিনুক 'sea shells' /dʒʱinuk/ > /d͡ʒʱinuk/. This tie-bar is not included in Wiktionary Bengali transliteration guide. I proposed to include this tie-bar for affricates symbols.

3. According to various sources (Khan, 2010; Dasgupta, 2003, Ferguson & Chowdhuri, 1960; Chatterji, 1970), Bengali doesn't have palatal plosive /c and ɟ/. Instead it has post alveolar affricates (ref. https://en.wiktionary.org/wiki/Wiktionary:Bengali_transliteration). Therefore, /c/ > /t͡ʃ/ and /ɟ/ > /d͡ʒ/. E.g. : অগোচর 'beyond one's knowledge' /ɔɡocɔr/ > /ɔɡot͡ʃɔr/, অগ্নিযুগ '(figurative) the age of revolution' /oɡniɟuɡ/ > /oɡnid͡ʒuɡ/.

Does there exist any tool or API that could allow us to apply bulk edits? If this sounds right, I will start to make corrections. Arundhatisgupta (talk) 16:06, 9 May 2024 (UTC)Reply

I relocated this post because it was in the wrong place. — Sgconlaw (talk) 16:20, 9 May 2024 (UTC)Reply

The IPA has long held that the tie bar is not necessary when transcribing languages that don't distinguish affricates from stop-fricative sequences. If Bengali doesn't distinguish /t͡ʃʰ/ from ?/tʃʰ/, then our current transcription convention is fine.

In describing the phonetics of a language, you want to be as precise as possible, so the ties are a good thing. But with a key like we have, they're not necessary.

The tie bars clutter a transcription and can make it more difficult to read. If we did implement them, it would probably be better to use the under-tie, ⟨t͜ʃʰ⟩. That's generally more legible because our eyes pick up details better at the top of a symbol, so the under-tie is less distracting. kwami (talk) 05:12, 10 May 2024 (UTC)Reply

While "the tie bar is not necessary", it is good practice to include it and most languages on Wiktionary do. I don't see why Bengali would be an exception. Thadh (talk) 11:36, 10 May 2024 (UTC)Reply

I agree with @Thadh.

@kwami It is not necessary for English as well. Why did you included it in English? Also, there is no consistency. If you think it is not necessary then make sure that you maintain that consistency. E.g.: অগচ্ছিত 'not entrusted to anyone' /ɔɡot͡ʃt͡ʃʰit̪o/ has the tie bar but চরম 'extreme' /tʃɔɾom/ doesn't. What do you think about that? Arundhatisgupta (talk) 16:37, 10 May 2024 (UTC)Reply

Whichever convention is chosen, it should be consistent, and should match the key. kwami (talk) 19:19, 10 May 2024 (UTC)Reply

There will be a confusion when /t/ and /ʃ/ occurs together but they are not affricate. E.g. কুৎসা 'slander' /kutʃa/ and বচসা 'contention' /bɔtʃoʃa/. Without a tie-bar they seems like having similar pronunciation for /tʃ/ but the correct pronunciations are - /kutʃa/ and /bɔt͡ʃoʃa/. Arundhatisgupta (talk) 16:51, 10 May 2024 (UTC)Reply

That can be handled as ⟨kut.ʃa⟩ and ⟨bɔtʃoʃa⟩ or as ⟨kutʃa⟩ and ⟨bɔt͜ʃoʃa⟩ -- or, for maximal clarity, as ⟨kut.ʃa⟩ and ⟨bɔt͜ʃoʃa⟩. Just as long as we're consistent, or people will get really confused. kwami (talk) 19:22, 10 May 2024 (UTC)Reply

I personally think there should be a tie bar and it should go above, which is the more common practice. Benwing2 (talk) 21:06, 10 May 2024 (UTC)Reply

@Kwamikagami 1. If you are introducing a syllable break (indicating with a dot), then it should be applied consistently for all words in Wiktionary.

2. According to Wikipedia, undertie is used to represent linking (absence of a break) in the International Phonetic Alphabet. E.g.: /vuz‿ave/ (Ref. https://en.wikipedia.org/wiki/Tie_(typography)#cite_note-6) Arundhatisgupta (talk) 21:34, 10 May 2024 (UTC)Reply

Linking is used to override the orthographic spaces we insert between words in transcription. In that example, the words are //vuz ave// but the pronunciation is /vu.za.ve/. The /za/ forms a single syllable. That tie is not the same thing as the 'slur' tie used for affricates, which comes from musical notation (slurred notes). kwami (talk) 21:56, 10 May 2024 (UTC)Reply

I think that there should be tie bar and it should go above, which is the more common and establish practice for phonemic/phonemic transcription. It is important to maintain the consistency within language and across Wiktionary.

Is there any objection regarding other inconsistencies mentioned in the proposal? Arundhatisgupta (talk) 09:52, 11 May 2024 (UTC)Reply

@Arundhatisgupta No objections from me although I don't know enough about Bengali phonology to say whether e.g. the use of palatal plosives or affricates is correct. IMO the best way to go about making these changes is either manually or through AWB or JWB, which let you quickly do semi-manual changes based on regexes. Benwing2 (talk) 18:15, 11 May 2024 (UTC)Reply

@Benwing2 Could you please add me to Wiktionary:AutoWikiBrowser/CheckPageJSON ? Arundhatisgupta (talk) 14:39, 14 May 2024 (UTC)Reply

@Benwing2 I tried to sign in AWB and it's saying that my username is not enabled to use AWB. Could you please help me? Arundhatisgupta (talk) 13:52, 22 May 2024 (UTC)Reply

@Benwing2 Thank you for adding me on CheckPageJSON. I am able to edit the IPA pronunciation for some word where the pronunciation is given as- * { {IPA|bn|/IPA/} }, for example, গহির but I'm not able to edit the pronunciation where it is given as - " *{{ bn- IPA} }", for example, ছায়াছবি or '{{ bn-IPA } }', for example. গহনা. Could you please suggest me how can I edit those pages? If they are linked to any database, is there a way to access or update that database? Arundhatisgupta (talk) 12:54, 25 May 2024 (UTC)Reply

@Arundhatisgupta Pages that use {{bn-IPA}} are backed by a module, which auto-generates the pronunciation. In general it's preferred to use {{bn-IPA}}, possibly with respelling, instead of directly specifying the pronunciation using {{IPA|bn|...}}, because it ensures more consistent results. If you want to change the way the module works, you have to edit the Lua code that implements the module, but before doing that you should get consensus from other Bengali editors that this is the right thing to do. Benwing2 (talk) 17:53, 25 May 2024 (UTC)Reply

@Benwing2 I want to do that. Please let me know if anyone has any objection for the changes that I proposed specially 1 and 2. 71.168.227.237 16:59, 28 May 2024 (UTC)Reply

You need to look around to see who the Bengali editors are, and ping them. Benwing2 (talk) 18:40, 28 May 2024 (UTC)Reply

@Benwing2 Where/how should I look for them? Arundhatisgupta (talk) 14:25, 30 May 2024 (UTC)Reply

@Arundhatisgupta Look at the history of various Bengali language pages and see who created and/or edited them. Benwing2 (talk) 22:54, 30 May 2024 (UTC)Reply

Pinging recently-active users from Category:User bn-N: do any of you have opinions on the changes proposed above, in particular changing /ɦ/ to /h/ in notations of Bengali pronunciation, adding tie bars, changing /c/ to /t͡ʃ/, and changing /ɟ/ to /d͡ʒ/? @Syzarn, Smartiphone7, Shaiwala, Sbb1413, Ash wki, AlZawad26. - -sche (discuss) 23:11, 30 May 2024 (UTC)Reply

Adding @Asm sultan, @Deepon,@শরদিন্দু ভট্টাচার্য্য, @Tanweer Morshed, @BrightSunMan,@Countincr, @Hermitage17, @Intellectual Bookworm, @Laser Victor 2017, @Lira Rakshit, @Lira Rakshit, @Sbb1413, @Tanweer Morshed, @Vyutpatti, @Wikispeedier Arundhatisgupta (talk) 01:26, 31 May 2024 (UTC)Reply

@Benwing2 As there is no objection/opinion, can I proceed with Lua module correction? How can I be an "autoconfirmed user" to make any changes in bn-IPA module? Arundhatisgupta (talk) 14:16, 3 June 2024 (UTC)Reply

Yes, you may proceed, and I apologize that we've made you wait only for no-one else with Bengali-specific knowledge to actually weigh in (I had hoped more people (or even any people) knowledgeable of the topic would be able to express informed opinions about it). Are you getting a message that you can't edit that module because of the protection, or is it just warning you that you are editing a protected page? (If it were actually stopping you from editing, you wouldn't see an "edit" button, just a "view source" button.) If you're being stopped by the protection, I'm not sure why... it's set to "autoconfirmed users" (not even "autopatrolled", just "autoconfirmed"), and my understanding based on Wiktionary:Autoconfirmed users is that accounts older than 4 days with at least 10 edits are automatically autoconfirmed; your account is years old and has thousands of edits, so if you actually can't edit the module because of the protection, I guess either Benwing or another 'crat could grant you the "confirmed" group (which I as a mere admin can't grant, but I'm guessing bureaucrats can?), or I or another admin can just temporarily unprotect the module. All of this is assuming you understand how the module works well enough to edit it, yes? If not, one of our technical editors can help. Btw, personally I am inclined to agree with Kwami that if there are cases of tʃ across a syllable break which are specifically not t͡ʃ, it would be useful to explicitly indicate this not only by giving t͡ʃ its tie, but by marking the break in t.ʃ. - -sche (discuss) 01:54, 4 June 2024 (UTC)Reply

Yes, I can turn on the "confirmed user" setting, but Special:UserRights/Arudhatisgupta says you're already an implicit member of Autoconfirmed users, so if somehow you're not able to edit the page, something else is going on and we should investigate. Other than that, I agree with User:-sche about adding both a tie bar and a syllable break in cases where the tie bar isn't appropriate. Benwing2 (talk) 03:29, 4 June 2024 (UTC)Reply

@-sche@Benwing2 It was warning me that I am editing a protected page. It would be great if I get help from one of the technical editor as I am not quite familiar with the the module and I don't want to mess up. Regarding the corrections, I agree all around. Arundhatisgupta (talk) 14:37, 4 June 2024 (UTC)Reply

@-sche and @Benwing2 Update: I updated the module for the proposed changes. I didn't added a syllable break because for Bengali dental consonant is transcribed as [t̪]. Therefore, we don't have any conflict between [t͡ʃ] and [t̪ʃ], for example, বৎসর and বছর. Arundhatisgupta (talk) 15:31, 6 June 2024 (UTC)Reply

@Arundhatisgupta OK, sounds good to me. Benwing2 (talk) 08:11, 9 June 2024 (UTC)Reply

How should we present Latin adjectives that inflect like nouns (or that are really appositive nouns?)

Latest comment: 1 month ago4 comments2 people in discussion

A few times now, I've been puzzled about how to handle showing the inflected forms of certain Latin third-declension adjectives that don't fit well into any of the usual adjective inflection patterns, because they show the endings typical for a noun instead. Currently, these seem to mostly be treated in our entries as third declension adjectives of "one ending", but I think there are some issues with the accuracy of this in terms of showing forms and usage.

A particularly clear case is certain rare words that are attested with adjectival function but that have the form of feminine nouns, such as silvicultrīx, -trīcis and Nīlōtis, -tidis (which have the forms respectively of Latin and Greek feminine agent nouns). The masculine counterparts would presumably be *silvicultor and *Nīlōtēs, but these do not to my knowledge occur, and in any case, we normally treat agent nouns as noun lemmas (distinct for masculine and feminine) rather than combining the masculine and feminine versions under one adjective lemma. Should we lemmatize such words as nouns and include a usage note saying that they're used appositively? Or should we put them as adjectives (as many dictionaries do) but include some kind of special headword and declension table coding to avoid showing masculine or neuter forms, which I think aren't accurate in this case? For example, Gaffiot marks silvicultrix as "adj. f".

Not quite as clearcut are cases like senex, iuvenis, mās that generally have the form of nouns, commonly function as either nouns or as adjectival or appositional modifiers of masculine or feminine nouns, but are extremely rare or unattested in the neuter. (I've found some neuter forms attested in some cases in New Latin.) Functionally, I think there isn't much difference between how mās and fēmina are used, but we treat mās as a noun or adjective and fēmina as only a noun. Urszag (talk) 02:49, 11 May 2024 (UTC)Reply

@Urszag We have a whole category Category:Latin first declension adjectives for words like amnicola and indigena that don't seem so different from the words you've cited, and there are unquestionably third-declension non-i-stem adjectives (e.g. vetus, concolor) that "show the endings typical for a noun", so I don't see an issue treating these as adjectives. Benwing2 (talk) 05:33, 11 May 2024 (UTC)Reply

Yes, we have that category for first-declension adjectives. The full inflection of those words as adjectives is actually a bit questionable also (there was an RFV that I closed based on New Latin examples, but I added some notes in Appendix:Latin_first_declension discussing how the neuter plural nominative/accusative/vocative forms in -a and dative/ablative forms in -īs are rather hypothetical and ambiguous, as they've often (at least since Priscian's time) been interpreted as belonging to a second-declension paradigm instead (e.g. that of indigenus).

Third-declension non-i-stem adjectives such as vetus exist, but are rare (aside from comparative forms). When there are attested neuter forms distinct from the masculine/feminine forms (such as vetus in the accusative singular, or vetera in the nominative/accusative plural), this establishes that a word is formally distinct from an appositive noun (and also establishes whether the neuter plural ends in -ia or -a, which can't always be predicted from the forms of the ablative singular or genitive plural). But I think that such neuter forms are often unattested (except sometimes in very late periods of the language) and in that case it's arguably misleading to just present a single full declension table. E.g. I found iuvenia once in Medieval Latin and occasionally in New Latin, and a couple of New Latin cases of iuvena (both from the same author), but I think it's more misleading than not to present either of these as established or standard Latin forms: a late imperial-era grammarian says that this word simply lacks neuter plural forms. In cases like this, there's an existing parameter to mark an adjective as lacking neuter forms, so I ended up using that and mentioning other forms in usage notes. But in cases like silvicultrīx and Nīlōtis, I don't know how to best present the fact that they occur only as adjectival modifiers of feminine nouns: for now I've just removed the declension table from the second word (since several forms are unattested, or only attested in post-Classical texts, and the Greek origin makes it tricky to actually infer what missing forms would be), but for silvicultrīx it seems fairly clear that it would simply inflect like victrīx. If we continued to categorize these as adjectives, does it sound reasonable to establish a parametric way to mark them as feminine-only?--Urszag (talk) 06:33, 11 May 2024 (UTC)Reply

@Urszag Yes, I think so. We have done that for some other languages, e.g. French adjective headwords have an |onlyg= parameter that you can set to a gender (m or f), a number (s or p) or a gender-number combination (e.g. m-s, f-p). Now mind you, some of the terms that make use of this (e.g. enceinte) would IMO be better treated as conventional adjectives that are simply rare in other genders or numbers; there's even a usage note for enceinte that says

The masculine form enceint is occasionally used with regard to transgender men, for species with male pregnancy such as seahorses, as well as in metaphorical, jocular, or fantastic contexts.

And indeed you will find that in Spanish, the corresponding words like encinto, embarazado and preñado are given in the masculine, with quotes establishing that such usage does exist. But if the term is indeed unattested in some genders, I would definitely support adding a flag to suppress those genders in the declension table and make sure that the title next to the declension table reflects this. Benwing2 (talk) 06:53, 11 May 2024 (UTC)Reply

Default font size for polytonic Greek

Latest comment: 1 month ago8 comments4 people in discussion

Does anyone else think the default font size for polytonic Greek should be increased? It looks small to me, especially in SBL Greek, which is the font with the highest priority in the CSS. Weylaway (talk) 19:55, 12 May 2024 (UTC)Reply

It seems OK to me; can you post screen shots showing how it looks for you? BTW I do notice when comparing polytonic Ἀριστοτέλης (Aristotélēs) to non-polytonic Αριστοτέλης (Aristotélis) that the latter seems relatively ugly because it uses a sans-serif font. Benwing2 (talk) 21:17, 12 May 2024 (UTC)Reply

FWIW, this is what it looks like for me (top is the level of zoom I usually use, bottom is 100% zoom). - -sche (discuss) 22:13, 12 May 2024 (UTC)Reply

Interesting; your polytonic looks sans-serif, while your non-polytonic looks more serif, which is the reverse of what I see. Benwing2 (talk) 22:19, 12 May 2024 (UTC)Reply

For non-polytonic, it's using Gentium for me, the second font in the list (because I don't have Athena, the first font in the list). For polytonic it was using DejaVu Sans (the third font in the list, because I didn't have SBL Greek or Athena). Now that I've downloaded SBL Greek, it looks like this, displaying polytonic in SBL Greek, which is heavily (IMO distractingly) serifed and slanted and handwritingesque. SBL Greek polytonic text is indeed smaller than other text, though not unreadably so IMO. (But I do think SBL Greek looks worse than other fonts in the list, so I might be tempted to move it down in the ranking... but perhaps it is first because it has the best diacritic support?) - -sche (discuss) 06:56, 13 May 2024 (UTC)Reply

@-sche Hmm. I checked using the Computed tab in Chrome and, if it's correct, my polytonic font is using Times (postscript name "Times-Roman"), which is way down the list, and my non-polytonic font is Arial Unicode MS, which is likewise way down the list. Maybe this is because I'm on a Mac, although I'm surprised there aren't more fonts installed by default. BTW here is what it looks like: [6] Benwing2 (talk) 07:25, 13 May 2024 (UTC)Reply

I personally think SBL Greek looks the best and I use a custom style to display it at 130% size. I was just thinking of learners who may find the default hard to read – in -sche's example the x-heights of the Greek letters are smaller than those of the Latin letters next to them. But clearly there is variation across operating systems and difference in personal preference, so maybe it's better for me just to use my custom style. Weylaway (talk) 17:46, 13 May 2024 (UTC)Reply

Thank you @Weylaway for your question. Greek (script Grek, polyonic and monotonic alike) look miserable and small at en.wikt. I have no idea what the default font is for this site (as in sc=Latn), or what the designers wish their readers to view. Default looks much beter. Or perhaps an equivalent making sure that grave accent is shown (not a vertical accent). User:Sarri.greek/fonts#default. Ancient Greek inflectional tables, which should have a 110%, are even smaller. Even the prosody marks look better with normal default fonts. Thank you again, for putting this. ‑‑Sarri.greek ^♫ I 21:38, 13 May 2024 (UTC)Reply

Blottoism

Latest comment: 1 month ago1 comment1 person in discussion

Don't let him die forgotten. Talk:upput. I hope my gastric bad temper will also survive. True story: our librarian/researcher has asked me why our system contains some non-existent records. It's because Artefactual's API is wrong. But I've got half a day of billable debugging before I can prove it. Equinox ◑ 02:00, 13 May 2024 (UTC)Reply

Old Pskovian

Latest comment: 1 month ago4 comments3 people in discussion

I propose to add an etymological code for Old Pskovian (~zle-ops?), as part of Old Novgorodian (zle-ono) in the branch of East Slavic languages. Cases of mention of Old Pskovian. This is a dialect and variety of Old Novgorodian, which was in ancient Pskov and its environs (https://ru.wikipedia.org/wiki/Древнепсковский_диалект). What do you think @Thadh? AshFox (talk) 04:29, 13 May 2024 (UTC)Reply

Don't see any issue with this. If nobody opposes, I'll add it in a week or so. Thadh (talk) 19:23, 13 May 2024 (UTC)Reply

No objections. Benwing2 (talk) 20:32, 13 May 2024 (UTC)Reply

@Thadh it seems good, no one is against it. A week has passed. Please add Old Pskovian when you have free time. AshFox (talk) 09:55, 20 May 2024 (UTC)Reply

Should we split up multi-language pages?

Latest comment: 28 days ago52 comments13 people in discussion

Currently, a user trying to get to da#Zhuang on desktop has to:

Type "da" into the search bar.
Wait for the massive page to load (this could take a while on older devices or on slower connections).
Scroll for a very long time until reaching "Zhuang" in the table of contents.
Click it.

On mobile, the situation is even worse since in fact there is no table of contents.

Maybe a better option would be to have da function as a sort of disambiguation page which lists all of the available languages in a compact table. In this case, the user would quickly be able to locate and click "Zhuang" which would take them to da/Zhuang. Also, since da and da/Zhuang would both be very compact, the loading times would be practically instantaneous.

Also, doing this would also solve all of our Lua-related problems (at least for the near future). What do we think @Chuck Entz, Benwing2, Theknightwho? Ioaxxere (talk) 05:50, 13 May 2024 (UTC)Reply

@Ioaxxere This has been proposed various times but it would be an enormous undertaking and would (of course) have some downsides, such as requiring more clicks to view anything and not so easily being able to see the similarities among different languages that share the same spelling. Maybe a less radical solution for the time being would be, as Chuck proposes, to move letter information out of letter pages into an Appendix or something. Benwing2 (talk) 05:58, 13 May 2024 (UTC)Reply

(Not saying I think we should split pages, but) something I suggested in past discussions which would address the "more clicks to view anything" problem is : if we split, transclude the subpages back onto the 'main' entry, so someone looking up e.g. the main sender page rather than sender/da still sees all the languages. Transclusion could be the default, and for the few pages with excessively many language sections where it wouldn't be feasible (particularly because I think transcluding a page causes it to count 2x against the PEIS limit? and causes any templates it transcludes to thus count 4x? and even Tim Starling has said that raising the PEIS limit is not something the devs will do), we could fall back on having a table like Ioaxxere suggests. BTW I think the usual proposal is to use language codes in naming the subpages, rather than language names, which may be long and contain untypable characters. Either way, we have to watch out for conflicts with pages that actually contain slashes, e.g. s/he.
In this case, I'm inclined to agree that moving the letters to alphabet appendices is a solution to most of the immediate problem. Let something like Appendix:Dutch alphabet give the names and pronunciations of all the Dutch letters in one place, rather than giving them on a, b, etc. Just have a ==Translingual== entry on a and maybe categorize all the Appendices that use a into a category like "Category:Alphabets that use Latin a" or something and then have a link in the Translingual entry to that category...? - -sche (discuss) 06:31, 13 May 2024 (UTC)Reply

@-sche: I'm very confused as to why letter entries are being mentioned? The initial page mentioned is da which isn't even about a single letter. There's only one "letter name" entry on it being Tagalog da. mi is one of the worst pages, if not the worst, when it comes to this, so again, trying to focus on letters is not the way to go about it. This type of proposal was proposed in 2020 and was not passed then either. Also, as I mentioned on said vote, most of the bytes on a aren't from letter entries either. Let's focus on finding a solution that actually fixes the overarching problem, rather than throwing us into the issue of letters again. AG202 (talk) 14:13, 13 May 2024 (UTC)Reply

Additionally, the notion of moving letter entries comes from a clear, whether intentional or not, Latin script language-bias. As I mentioned in the vote, entries like ㄴ (n) should not belong in an Appendix or Translingual just because some other languages on Wiktionary do their letter entries poorly. AG202 (talk) 15:24, 13 May 2024 (UTC)Reply

@AG202: It's definitely not a Latin-script bias. The same applies to Cyrillic, Greek, Perso-Arabic, Georgian... The fact that certain writing systems are slightly more complex and language-specific doesn't mean that all those that aren't deserve an entry for every language and every grapheme.

By the way, I don't see a Jeju entry for ㄴ, and something tells me that if you were to duplicate this content three times (Middle Korean, Korean, Jeju) you will not be such a fan of keeping the three entries on one page. Thadh (talk) 16:36, 13 May 2024 (UTC)Reply

I would be. Just as I am with every other letter entry. The only reason I haven't created them myself is because I haven't had the time to. Also, even if it's not from a Latin-script bias, I still do not think that several smaller language communities are being considered.

That being said, this still doesn't address my main point that this doesn't actually fix the problem. If you don't want letter entries that's fine, that's another conversation, but let's not pretend that it's going to fix this current lua memory issue. It's not even a solution to "most" of the immediate problem at pages like a, nor does it fix anything at all at pages like da or mi or la. AG202 (talk) 17:21, 13 May 2024 (UTC)Reply

For reference at a: there are 170 L2s, and 64 letter entries (with the header "Letter"). So not even half of the L2s there have letter entries. It's frankly overblown. That's not even considering the L2s that have significantly much more content in their non-letter entries compared to the letter entry such as English a. AG202 (talk) 22:17, 13 May 2024 (UTC)Reply

@AG202 Not sure it's overblown, since a is the article causing the most headache in terms of Lua and parser limits. a is currently at 1.887MB out of an allowed 2.097MB in post-expand include size, and removing all the letter entries would bring that down noticeably. People are also thinking ahead to the fact that there are 5,000+ languages that use the Latin script, and we can't possibly have an entry for the letter a in every such language; whereas the number of languages where a is a word (excluding those where it's the name of the letter a) is much more limited. (Note also that when I just previewed a, I got a CPU timeout. User:Surjection may have inadvertently made this worse by lite-ifying a bunch of the templates again; the preview showed 43 seconds of CPU time and 53 seconds of real time, vs. 23 seconds of CPU time and 30 seconds of real time when previewing a slightly earlier version not using the lite templates. YMMV though, as there is a lot of variation in the CPU times.) Benwing2 (talk) 22:44, 13 May 2024 (UTC)Reply

@Benwing2: "Removing all the letter entries would bring that down noticeably", can we actually get the numbers for this? Because when I tested that back in 2020, that wasn't the case. CC:@Surjection

"Whereas the number of languages where a is a word (excluding those where it's the name of the letter a) is much more limited": I also don't think that's the case. Again, looking at what's on the ground right now, there are significantly more non-letter entries that are taking up "space". There are only 64 L2s with letters or letter names, out of 170. Clearly the focus should be elsewhere.

Letter entries don't even take up that much space relatively; they don't have quotations like Sassarese a and they don't need usage notes like Serbo-Croatian a. I'm much more worried about 100 more L2s with non-Letter POSs as that's more realistic and takes up significantly more space, instead of a very rare possibility of 5000+ letter entries. Hell, the English entry at a has twelve etymologies outside of the letter entry, and is itself equivalent to several languages' letter entries.

Let's focus on actual long-term solutions, like the TOC option being discussed below, rather than taking out information that users like myself and others find useful. AG202 (talk) 23:18, 13 May 2024 (UTC)Reply

User:Thadh/a I've removed all noun senses for letter names (except for the Norwegian figurative ones) and letter senses. I don't know how to measure whether the page loads better, but at least there are no lua errors anymore. Thadh (talk) 13:47, 14 May 2024 (UTC)Reply

Thank you! Yeah looking at the page you've linked, removing the letters (and letter names which I thought were a separate issue) only removed 27225 bytes, which may seem like a lot, but that's out of 197738 bytes initially. That means that letters & letter names only account for ~14% of the bytes on the page, which is exactly what I was talking about. Letters on their own account for even less. We'd reach the max byte limit again in no time even if we barred letters from being added. (Also by my count only 14 L2s have solely a letter and/or letter name entry out of 170) AG202 (talk) 14:19, 14 May 2024 (UTC)Reply

«no table of contents»? What do you mean? Tollef Salemann (talk) 06:19, 13 May 2024 (UTC)Reply

Are the pages titles "Appendix: Variations of the letter [LETTER]" complete? Could they be made to include all the languages that use the given letter? If they could, that would eliminate one advantage of the current letter-page structure: comparison of letter use across languages. DCDuring (talk) 12:00, 13 May 2024 (UTC)Reply

At the moment, the "Variations of" appendices are language-neutral. We could agree to change that, but I think a separate set of appendices would probably be the better approach. —Mahāgaja · talk 12:38, 13 May 2024 (UTC)Reply

That's not what those pages are for - those are to list confusable/similar terms so that we don't clutter {{also}} with massive lists at the top of a page. Theknightwho (talk) 12:39, 13 May 2024 (UTC)Reply

Right, I am wondering about extending their purpose to overcome a short-coming of off-loading letters to language- or script-based appendices: that one loses the ability to compare across languages. If we need to create something additional to preserve their purity of their current purpose, I would not object. It just seems that their current narrow purpose could be broadened to make them more effective even at achieving their current purpose. DCDuring (talk) 13:52, 13 May 2024 (UTC)Reply

I suppose, but I'm not sure how useful it'd be (especially with letters like "a"). It'd make more sense with less common letters, though. Theknightwho (talk) 14:01, 13 May 2024 (UTC)Reply

Based on the persistence of this problem over at least a decade, it seems that we are forced to use incremental solutions. Not all incremental solutions have to be technical. As was observed above, letters (and symbols) are not like the rest of our content, so perhaps we can use a different content model for them. If the different content model allows us to reduce the number of module-error pages, our attention will be led to a somewhat different set of violations, requiring or suggesting different partial solutions. A different content model for letters may lead to a better (more comprehensive) handling of letters. There are more than 60 Letter L3/4 headers on a. That does not seem a negligible amount of content to offload, but it seems likely to be smaller than the number of languages likely to use the letter a. DCDuring (talk) 17:19, 13 May 2024 (UTC)Reply

The problem isn't just individual letter pages, though. A page like [[an]] is also difficult to navigate around and could benefit from being split up. On the other hand, at [[bachall]] I find it very convenient to have the Old Irish entry and its homographic Irish and Scottish Gaelic descendants all on the same page. —Mahāgaja · talk 12:34, 13 May 2024 (UTC)Reply

@Benwing2: Well, the number of clicks is equal if you count the table of contents. Also, it seems like splitting the letter entries off doesn't address the Lua issue since letters make up only small proportion of the content at a.

@Tollef Salemann: If you don't have a phone handy, go to https://en.m.wiktionary.org/wiki/da and resize your browser to make it narrow — the table of contents disappears.

@Mahagaja, -sche: Yes, there could be some kind of controller template on da that would automatically transclude the pages if the number of languages is under a certain reasonable value (say 5), otherwise display the disambiguation table. Ioaxxere (talk) 13:32, 13 May 2024 (UTC)Reply

What I find convenient about having all three languages at [[bachall]] is not so much being able to read them all at once as being able to edit them all at once, and if they're transcluded from three separate pages named "bachall/ga", "bachall/sga", "bachall/gd" or the like, then that's not convenient anymore. —Mahāgaja · talk 14:30, 13 May 2024 (UTC)Reply

No, it doesn't want to disappear on my handy. Not on my neighbors' either. We both use Apple handy (iphone). What do you all mean it disappears? It just being smaller, so you must touch on it, so it becomes bigger and you easily can navigate, like in Wikipedia. Are you all talking but non-Apple devices? Tollef Salemann (talk) 17:44, 13 May 2024 (UTC)Reply

How about only splitting up pages over a certain size? At the moment when I look up a short word I type "da#Zhuang" in the search bar to get me straight to the entry I need, but that's still annoying. —Caoimhin ceallach (talk) 13:52, 13 May 2024 (UTC)Reply

I don't like the idea of splitting pages. Is there any way to personalise the table of contents? Being able to collapse language names per letter of the alphabet would go a long way. Thadh (talk) 14:03, 13 May 2024 (UTC)Reply

@Thadh This may be possible with CSS. I know for example that User:Sarri.greek has been experimenting with different layouts for the TOC. If not, and you can create a clear plan for what functionality you'd like, the MediaWiki devs might be amenable (e.g. if you contact Tim Starling directly; he's the one who increased our memory and timeout limits). Benwing2 (talk) 20:32, 13 May 2024 (UTC)Reply

@Thadh See [7] for an example of what Sarri did. Benwing2 (talk) 21:16, 13 May 2024 (UTC)Reply

That looks good! Maybe this at least solves the issue of navigation. Thadh (talk) 21:17, 13 May 2024 (UTC)Reply

I agree - it does look good. Theknightwho (talk) 21:19, 13 May 2024 (UTC)Reply

Not bad. Is this intended as a default for certain types of pages (What kind?), opt-in, or only in custom JS/CSS? DCDuring (talk) 01:38, 14 May 2024 (UTC)Reply

@DCDuring I think we could clean it up a bit and use it for long pages where we'd otherwise compress the TOC by omitting subheadings. Benwing2 (talk) 01:59, 14 May 2024 (UTC)Reply

@Benwing2, Thadh, Theknightwho: I rewrote the template: {{minitoc}}. Maybe it can be automatically added to any entry with more than (say) ten languages. Ioaxxere (talk) 05:14, 14 May 2024 (UTC)Reply

@Ioaxxere Looks good to me but let's solicit more comment first. Also it would be great if there was a way, after you expand it, to further expand it to show the subheadings. Some people (maybe User:RichardW57?) have complained about the shortened TOC's that you can't so easily navigate to the subheadings of a particular language. Benwing2 (talk) 05:19, 14 May 2024 (UTC)Reply

M @Ioaxxere, thank you for your Template:minitoc! and your help for [8], [9], [10], [11],[12], ... Also, perhaps variations for few (1-5) languages by L2 something like this? For related language periods like this? (Reconstruction of the magic word __TOC__ because it might be taken away from us in future skins like vector22? (e.g. discussion@el.wikt for modifications) Thank you, thank you! ‑‑Sarri.greek ^♫ I 05:36, 14 May 2024 (UTC)Reply

@Sarri.greek: Yes, a multi-column TOC is certainly possible. You can add the following into your commons.css:

div.toc > ul { display: flex; flex-direction: column; gap: 0 20px; flex-wrap: wrap; overflow: auto; max-height: 30em; /* change max-height as desired */ }

div.toc { width: 100%; }

I can't promise you that it'll look very good, though. Ioaxxere (talk) 06:08, 14 May 2024 (UTC)Reply

@Ioaxxere @Sarri.greek I've rewritten Module:minitoc somewhat to take advantage of the pre-computed list of L2s that is already calculated by Module:headword/page, since it can cope with a bunch of weird edge-cases that can't be dealt with by a simple Lua pattern (and it's also faster, since it means we don't need to parse the page again). Theknightwho (talk) 14:00, 14 May 2024 (UTC)Reply

Re complaining when TOCs are collapsed, FWIW I have also complained that when TOCs are collapsed you can't easily navigate to subsections of a given language section, but as long as we're only deploying that on entries with a truly excessive number of L2s, like a, I'll live with it. If we're deploying it on tons of pages, e.g. cat—11 L2s—I'm less happy. Maybe if we're only deploying it on mobile, that's better than also deploying it on desktop, OTOH someone was just complaining in another discussion that entries are hard to navigate because TOCs are collapsed or sometimes hidden(?) on mobile. So maybe the ideal would be to make it a gadget/pref, whether opt in or opt out, so people who wanted collapsed TOCs could get them—maybe even on all entries, if they wanted—and people who wanted uncollapsed TOCs on all entries could keep them. Or to make it possible to expand the collapsed TOCs (all at once or on a per-L2 basis) as mentioned above.
Not directly relevant to this specific concern, but relevant to the general topic is Wiktionary:Grease_pit/2021/June#Experience_on_mobile which also links a number of other prior discussions; see also some history at Wiktionary:Beer_parlour/2021/April#collapsed/minimized_language_headers. - -sche (discuss) 15:37, 14 May 2024 (UTC)Reply

I can understand why __NOTOC__ has been included, since it avoids having two TOC with Vector, but with Vector 2022 it's a bit detrimental since it means there's now no longer a TOC in the left-hand sidebar, which can normally be used even if you're scrolled halfway down the page. Theknightwho (talk) 15:46, 14 May 2024 (UTC)Reply

@Theknightwho: That's a good point. It's actually possible to specify by-skin behaviour by changing MediaWiki:Vector-2022.css and similar, so maybe we could use that to override __NOTOC__ in Vector 2022 since the TOC doesn't take up space in the document flow anymore. Ioaxxere (talk) 16:04, 14 May 2024 (UTC)Reply

Instead of moving the Zhuang entry for da to da/Zhuang, we could move it to Zhuang/da, and we could move all the Zhuang entries that way and add a specialised search bar searching only in entries starting with “Zhuang/”, like we do with the search bar on top of the beer parlour here. That would reduce the scrolling and the clicks for people interested in Zhuang. MuDavid 栘𩿠 (talk) 01:54, 14 May 2024 (UTC)Reply

Iff we split (which I am not saying I'm in favor of), I like the idea of putting the language first so people can make language-specific searches. But what do you think of using language codes rather than language names? Language names can be very long (e.g. "Southern Valley Yokuts") and can contain many hard-to-type characters (how likely is it that the average user can type "ǃXóõ"?)... although I concede we do name reconstructed entries using language names. - -sche (discuss) 02:06, 19 May 2024 (UTC)Reply

@-sche Personally I wouldn't use Reconstruction sections as an example of good UI design; I find it annoying to have to type out the whole language name (not to mention the word "Reconstruction"; is there an abbreviation for this namespace?). As for putting the lang name or code first, I see advantages and disadvantages. Assuming we have a page at the bare word that links to all the lang-specific pages, the advantage for experienced users is that the lang specific pages won't so easily show up in autocomplete; but this may be a disadvantage to new users, who will see the lang-specific pages autocompleted if the lang follows but not if it precedes, and who are unlikely to be familiar with Wiktionary lang codes.Benwing2 (talk) 03:07, 19 May 2024 (UTC)Reply

Re abbreviation: RC, e.g. RC:Proto-Germanic/-janą. (As you probably recall, people objected to past proposals to set up RC entries like regular entries i.e. put all languages that have a word *jab on one page, because reconstruction orthographies are language-specific and what j or a or b means in the notation for one language is different from what it means in another. I...can't say I find that persuasive, because what non-reconstructed orthographies mean by j, a, b also differs—Hmong uses b to represent a tone, vs other languages using it for /b/, /β/, etc—but...) - -sche (discuss) 03:39, 19 May 2024 (UTC)Reply

@-sche Thanks. Yeah I don't find that argument persuasive either for the reasons you give. Benwing2 (talk) 03:51, 19 May 2024 (UTC)Reply

I understand the logic between separating languages in this case but I agree that it's extremely annoying to search for and definitely unintuitive for new users. Maybe we should go as far as to treat reconstructed terms the same way as attested entries, i.e. have Reconstruction:Proto-Germanic/-janą and a term with the literal spelling *-janą on the same page or (under the proposal) page group. Ioaxxere (talk) 04:01, 19 May 2024 (UTC)Reply

(@Benwing) If we display all the language-specific subpages on the bare entry (i.e. transclude the Zhuang da subpage onto da), then I am not concerned about whether new users see the subpage (whether "da/Zhuang" or "Zhuang/da") in the search bar when they type "da...", because I wouldn't expect such users to know/care about subpages, and I figure it's enough that they can type "da" and get to the "da" entry, where they can see the Zhuang content. However, it occurs to me that an advantage to "da/Zhuang"-style naming might be : if whatever template we use at [[da]] can just pull and display all subpages of whatever page it's on (and we would only need special handling for a small number of pages, e.g. s/he is not a subpage of s)... whereas if we use "Zhuang/da"-style naming, it seems like it would have to go through a list containing all of the thousands of possible languages we include, to check which exist for a given page. (No?) Would there be a difference in which approach is faster, Zhuang/da vs da/Zhuang...? I do think the ability to type "Zhuang/..." into the search bar and thus narrow whatever you next down so that you're only searching in Zhuang would be a benefit to that ordering — say I want to quickly check whether any Zhuang entries start with str-, I could type "Zhuang/str" into the search bar and see what results autocomplete suggests/finds — but not if there are drawbacks, like if it would complicate or slow down the 'transclude all subpages onto the main page' template. And I'm still not actually in favour of splitting entries onto subpages at all, although if it came to a vote I don't know if I'd oppose or just abstain.) - -sche (discuss) 04:05, 19 May 2024 (UTC)Reply

@-sche: No, the pages can't be transcluded since that would lead to the massive pages we were trying to avoid in the first place. More realistically there would be a {{minitoc}}-like navigation table which could dynamically transclude pages through JavaScript (of course, there would be a less-convenient alternative for users not running JavaScript). By the way, I prefer da/Zhuang over Zhuang/da since it makes it easier to autocomplete queries. A user only has to type "da/z" for da/Zhuang to be the only valid completion. Ioaxxere (talk) 04:29, 19 May 2024 (UTC)Reply

@-sche I didn't even think of that; my assumption was that adding a new language would entail both adding a split lang-specific page and modifying the combined page to know about the language in question, which is definitely a drawback to the split-lang approach. But if we put the lang name or code last, yes is should be possible to use the prefix-pages functionality to automatically find the split-lang pages. Benwing2 (talk) 04:29, 19 May 2024 (UTC)Reply

@Ioaxxere It should still be possible to transclude in most cases, e.g. only not transclude if there are more than say 20 or so languages. Benwing2 (talk) 04:31, 19 May 2024 (UTC)Reply

Yes, to clarify, because two ideas were discussed earlier and I get the sense Ioaxxere and I may be talking about different ideas(?)... I was saying it'd make sense to have a template automatically find any subpages and transclude them, rather than making users manually add new subpages to a list (since we already see how often users tag but neglect to list RFVs, RFDs, etc!), if we split all pages. But if we only split the large, Lua-memory-error-having or PEIS-limit-reaching pages that need splitting, Ioaxxere is right that it doesn't make sense to transclude anything. I haven't run the stats, but I would hazard a guess that the majority of pages on Wiktionary have only 1 L2 (maybe 2), so splitting all our millions of pages just because a few are too large to display does have obvious drawbacks, letting a tiny tail wag an enormous dog. It would be a lot less disruptive to only split the handful of pages that need it. - -sche (discuss) 06:34, 19 May 2024 (UTC)Reply

100% agreed. Benwing2 (talk) 07:25, 19 May 2024 (UTC)Reply

On the other hand, the overwhelming majority of pages for English lowercase 4-letter nouns starting with 'ta' have pages with entries for more than one language, and many of our inflection tables are littered with orange links. --RichardW57m (talk) 14:40, 23 May 2024 (UTC)Reply

Enabling categories for logged-out users

Latest comment: 30 days ago20 comments15 people in discussion

Tracked in Phabricator
Task T365323 Resolved

Currently, categories are hidden on mobile unless a user is logged in and has "advanced mode" enabled. I don't think there's any good reason to do this since categories are a pretty important part of the site. Apparently we need to get community consensus and then open a Phabricator request to set $wgMinervaShowCategories['base'] = true; Would you support this? Ioaxxere (talk) 14:16, 14 May 2024 (UTC)Reply

@Ioaxxere

Support I had always assumed there was some reason for not doing so already such as:

not making entries look too cluttered
categories are too technical for viewers of Wiktionary who do not edit
or there are technical difficulties involved

Kutchkutch (talk) 14:27, 14 May 2024 (UTC)Reply

Support. Binarystep (talk) 14:42, 14 May 2024 (UTC)Reply

I assume this would slow things down a bit for all users. How much? DCDuring (talk) 14:49, 14 May 2024 (UTC)Reply

Support. Benwing2 (talk) 14:55, 14 May 2024 (UTC)Reply

I don't think this would cause any noticeable slowdown, even on very large pages. Theknightwho (talk) 14:57, 14 May 2024 (UTC)Reply

Support — SAMEER (؂・؄・؏) 18:01, 14 May 2024 (UTC)Reply

Support, and I wish Wikipedia would follow suit but alas. lattermint (talk) 23:38, 14 May 2024 (UTC)Reply

Support - -sche (discuss) 01:32, 15 May 2024 (UTC)Reply

Support Fay Freak (talk) 01:55, 15 May 2024 (UTC)Reply

Strong support — SURJECTION ^{/ T / C / L /} 12:48, 15 May 2024 (UTC)Reply

Support CitationsFreak (talk) 16:45, 15 May 2024 (UTC)Reply

Support Vininn126 (talk) 17:23, 15 May 2024 (UTC)Reply

Support Theknightwho (talk) 17:26, 15 May 2024 (UTC)Reply

Strong support AG202 (talk) 06:14, 16 May 2024 (UTC)Reply

Strong support Thanks for proposing this. I too had always assumed there must have been some kind of technical/UX reason for not implementing this already, but none has been forthcoming. Voltaigne (talk) 07:59, 16 May 2024 (UTC)Reply

It seems the consensus is overwhelming. Has a Phabricator request been opened yet? — SURJECTION ^{/ T / C / L /} 19:45, 18 May 2024 (UTC)Reply

@Surjection: phab:T365323. Ioaxxere (talk) 01:29, 19 May 2024 (UTC)Reply

This has been merged into the next MediaWiki release, so it should start working relatively soon. Theknightwho (talk) 08:47, 21 May 2024 (UTC)Reply

it looks like it was just deployed an hour ago. It seems to be fully functional when testing it in private-browsing mode. — SAMEER (؂・؄・؏) 20:22, 21 May 2024 (UTC)Reply

Sign up for the language community meeting on May 31st, 16:00 UTC

Latest comment: 1 month ago1 comment1 person in discussion

Hello all,

The next language community meeting is scheduled in a few weeks - May 31st at 16:00 UTC. If you're interested, you can sign up on this wiki page.

This is a participant-driven meeting, where we share language-specific updates related to various projects, collectively discuss technical issues related to language wikis, and work together to find possible solutions. For example, in the last meeting, the topics included the machine translation service (MinT) and the languages and models it currently supports, localization efforts from the Kiwix team, and technical challenges with numerical sorting in files used on Bengali Wikisource.

Do you have any ideas for topics to share technical updates related to your project? Any problems that you would like to bring for discussion during the meeting? Do you need interpretation support from English to another language? Please reach out to me at ssethi(__AT__)wikimedia.org and add agenda items to the document here.

We look forward to your participation!

MediaWiki message delivery 21:23, 14 May 2024 (UTC)Reply

New TOC scheme

Latest comment: 1 month ago8 comments6 people in discussion

After thinking about our TOC issues a bit more I feel like it's impossible to have a one-size-fits-all system that's convenient for every user. Instead, I think we should have a different scheme for different skins. What do you think about this proposal? Pinging those who participated in the discussion above: @-sche, Benwing2, Theknightwho, DCDuring, Thadh.

	1-4 L2s	5-9 L2s	10-19 L2s	20+ L2s
Vector (and other old skins)	default TOC			mini TOC
Vector 2022	default TOC		Both
Mobile	default TOC	mini TOC

Ioaxxere (talk) 14:31, 15 May 2024 (UTC)Reply

Testing would be good, especially if the system is to be imposed on IPs as a default, as we have very little information about how normal users use our entries, even basic facts like how many use English L2s only. I assume that other users will be able to override the defaults. DCDuring (talk) 14:56, 15 May 2024 (UTC)Reply

@DCDuring: Users would be able to override the table with some simple CSS settings (see User:Ioaxxere/common.css for an example). The code is admittedly ugly but this comes with the benefit of virtually complete control over the output. As for what normal users think, maybe this is a good opportunity to "connect with readers" @Vininn126? Ioaxxere (talk) 16:30, 15 May 2024 (UTC)Reply

We probably need a larger followership before we start making questionnaires. Vininn126 (talk) 10:29, 17 May 2024 (UTC)Reply

Sounds good to me in principle. Thadh (talk) 15:08, 15 May 2024 (UTC)Reply

@Ioaxxere Like @Thadh I would say "sounds good in principle"; I have no specific objections but I think there should be a period of testing before committing to particular numbers of L2's for controlling the TOC appearance. Benwing2 (talk) 00:04, 18 May 2024 (UTC)Reply

Could we have numberings? (too difficult to count without them), possibly 'show' the frame default? & trial examples also at pages like this one? (probably with not centered text)? Thank you ‑‑Sarri.greek ^♫ I 00:32, 18 May 2024 (UTC)Reply

@Benwing2: For testing, here are the current pages which use the template: a (169 L2s), da (91 L2s), rock (13 L2s), small (8 L2s), and fish (2 L2s, so no effect). The L2 numbers can be edited at any time here: Module:minitoc/styles.css. Ioaxxere (talk) 03:40, 18 May 2024 (UTC)Reply

Should entries categorized according to speculative etymologies?

Latest comment: 29 days ago8 comments4 people in discussion

If a word’s etymology is sufficiently disputed or uncertain to merit an {{uncertain}}, should that word be categorized as coming from the proposed sources? For instance, should Vulgar Latin *tīrāre be categorized as coming from Old Persian, from Proto-Germanic, and from Greek according to the various competing theories? I’m inclined towards ‘no’, since we know for a fact that at least two of the three categorizations will inherently be wrong. (And in all likelihood all three of them are.) Yet I often find entries where this sort of categorization has been done anyway. Nicodene (talk) 03:39, 16 May 2024 (UTC)Reply

I agree — the entry being in a category called "terms derived from Y" misleadingly implies a level of certainty that doesn't exist. FYI, {{etymon}} lets you explicitly specify whether a derivation is "confident" or "uncertain" so maybe we could use that to generate better categories. Ioaxxere (talk) 05:02, 16 May 2024 (UTC)Reply

On the other hand, if one is looking for Latin words coming from Old Persian, I'd want it in that category rather than having to also look at the category of Latin words possibly coming from Old Persian. One usually has to at least look for the etymology section because of the possibility of Latin homographs. --RichardW57m (talk) 13:55, 16 May 2024 (UTC)Reply

That’s doable by searching for Latin terms of uncertain origin, along with the keyword ‘Persian’. I’d really rather not have such a word be part of the actual category in question since it almost certainly doesn’t belong. Nicodene (talk) 14:30, 16 May 2024 (UTC)Reply

So what is the search URI? Is it not obscure and prone to false detections? It's also a second search specification, and nothing like as simple to type as https://en.wiktionary.org/wiki/cat:Latin_terms_derived_from_Old_Persian. (I only need to type the scheme and underscores to make it look good on this page!) --RichardW57m (talk) 17:07, 16 May 2024 (UTC)Reply

This, and his argument can be inverted: He can search entries containing the category but not {{uncertain}}. Overall both and either is easier. 😃 Fay Freak (talk) 20:59, 16 May 2024 (UTC)Reply

Categories have no inherent truth value. They as well have the issue of containing surface analyses, affixations from previous chronolects: intended to have utility value. Fay Freak (talk) 20:59, 16 May 2024 (UTC)Reply

I don't see what's so confusing. The search is as follows: incategory:"Latin terms with unknown etymologies" "Old Persian". If an argument is to be made that the command in question is obscure, so is the notion of someone wanting to find Latin (~Proto-Romance) words of unlikely Persian origin. Nicodene (talk) 02:07, 23 May 2024 (UTC)Reply

"undo" has become "cin gbere le" on History tab

Latest comment: 29 days ago6 comments4 people in discussion

The "undo" button on the edit history is appearing for me as "cin gbere le". I don't know what language this is (if it is a language) but I guess something has gone wrong somewhere in the Wiktionary backend. Smurrayinchester (talk) 12:14, 16 May 2024 (UTC)Reply

Still says "undo" for me. —Mahāgaja · talk 12:15, 16 May 2024 (UTC)Reply

@Smurrayinchester: See w:Wikipedia:Village_pump_(technical)#'Undo'_button_now_says_'cin_gbere_le'. —Mahāgaja · talk 12:17, 16 May 2024 (UTC)Reply

Thanks! Smurrayinchester (talk) 13:08, 16 May 2024 (UTC)Reply

@Mahagaja: That link doesn't work (no such section). Any chance of a permalink? I don't have this problem but I am really curious what "cin gbere le" means. Equinox ◑ 03:30, 23 May 2024 (UTC)Reply

It's Nupe language: translatewiki:MediaWiki:Editundo/nup. See the archive w:Wikipedia:Village pump (technical)/Archive 212#'Undo' button now says 'cin_gbere_le'. Vriullop (talk) 05:57, 23 May 2024 (UTC)Reply

Creoles using "inh" template for words from lexifier language?

Latest comment: 1 month ago2 comments2 people in discussion

The whole time, I've been using the "der" template for Macanese words that derive from Portuguese, but I've seen some Kabuverdianu entries that use "inh" instead, as well as Solombala English entries that also use "inh" for words from Russian and English, and that's not even a creole. On the other hand, Haitian Creole uses "der" for words from French, and I believe Papiamentu also uses "der" for words from Portuguese and/or Spanish. So is there a specific preferred template for these kinds of things? Are creoles really considered to be "inheriting" words from their lexifier languages? Insaneguy1083 (talk) 17:56, 16 May 2024 (UTC)Reply

@Insaneguy1083: Old, unresolved issue: Wiktionary:Beer parlour/2018/May#Lexifier etymology template? Fay Freak (talk) 21:02, 16 May 2024 (UTC)Reply

Chinese: how should we display alternative readings?

Latest comment: 1 month ago7 comments4 people in discussion

(Notifying Atitarev, Benwing2, Fish bowl, Frigoris, Justinrleung, kc_kennylau, Mar vin kaiser, Michael Ly, ND381, RcAlex36, The dog2, Theknightwho, Tooironic, Wpi, 沈澄心, 恨国党非蠢即坏, LittleWhole): Currently we have a lot of ways to separate alternative readings (e.g. thâng / thóng). I have compiled the currently covered Chinese lects and displayed them here:

In the input, / and , are used to separate alternative readings.
In the input, ; and / are used to separate different sub-lects. (Hakka specifies them using = while Wu specifies them using :.)
In the collapsed output (before you click "more"), ,, ;, and / are used to separate alternative readings.
In the expanded output (after you click "more"), ,, ;, and / are used to separate alternative readings. Different lects are also displayed in different sections. One thing that is consistent is that , is used for IPA, because / would be confusing. (Note that in Hokkien, Pe̍h-ōe-jī and Tâi-lô use / but Phofsit Daibuun uses ,.)

While it would be harder (though not impossible) to unify the inputs, what is more realistic is to unify the outputs. Can (and should) we come to one standard for this? (Obligatory XKCD 927) --kc_kennylau (talk) 18:48, 18 May 2024 (UTC)Reply

I'm not a Chinese editor, but FWIW my instinct would be to use commas like e.g. {{alter}} and other "lists", but... do we anticipate ever needing to list alternative readings of a phrase that contains commas (like 一枝草，一點露)? Because commas would become confusing there. Of course, that is a general issue, also hitting {{alter}} et al (in all languages). If anyone has the energy to code this feature, maybe all these templates (not just for Chinese) that use commas could default to commas, but switch to separating items with semicolons when the lemma form or alt forms / alt readings contain commas? Or provide a parameter someone could set to make the item-separating commas switch to semicolons in such cases? - -sche (discuss) 19:12, 18 May 2024 (UTC)Reply

Yes, that situation does occur, in e.g. 潮州音樂——自己顧自己 which currently displays as ciu4 zau1 jam1 ngok6, zi6 gei2 gu3 zi6 gei2 / ciu4 zau1 jam1 ngok6, gi6 gi1 gu3 gi6 gi1 (with superscripts). --kc_kennylau (talk) 19:14, 18 May 2024 (UTC)Reply

Personally, I consider it suboptimal for the Chinese version and the transliteration/reading/etc to use different punctuation there (dash vs comma) and would prefer if the transliteration/reading also used a dash (or if the Chinese used a comma). (But as long as some Chinese entries like 一枝草，一點露 do use commas, I take the point that the situation does occur.) - -sche (discuss) 01:31, 19 May 2024 (UTC)Reply

Coming from a predominantly Northern Wu editing basis I would prefer for us to go with what we do with |w= to be adopted in all other lects. However, one thing I would like to ask about is ;, as it can be very confusing (see prev. discord convo about 吳. Currently I think there should be some sort of case-by-case arbitration regarding ; usage rather than fully depreciating it, though if anyone thinks otherwise do speak out — nd381 (talk) 19:46, 18 May 2024 (UTC)Reply

Yes, thank you for bringing that up, I think that would be very relevant to this thread. Basically, to summarize for other people, Wu currently covers more than 10 lects all in one parameter that corresponds to "Northern Wu" (aka "Taihu Wu"), and we still don't quite have a consensus on what to do under a situation such as:

Lect A: Reading P, Reading Q
Lect B: Reading Q, Reading R
Lect C: Reading P, Reading Q, Reading R.

What is clear is that in such a situation, the collapsed display would most certainly be P / Q / R, but anything beyond that is unclear:

Should the input be A:P,Q;B:Q,R;C:P,Q,R (group by lect), or A,C:P;A,B,C:Q;B,C:R (group by reading)? Or should we allow both?
- Grouping by lect would be inefficient if it happens that a lot of lects share the same reading P (which actually, the Wugniu romanization takes great care to prioritize compatibility between different lects).
- Grouping by reading would make it hard to see at first glance what readings a given lect has, which would arguably be an important information (say for someone who is only learning a single lect).
- Allowing both inputs is also not ideal, because standardized inputs are easier to monitor and track (and bot).
I think it should also be noted that to my knowledge Hokkien seems to be grouping by reading.
Similarly there is an issue of how to group the expanded display.
And when it scales up, for say a word of 4 characters, we would definitely run into issues where the listed readings are all very similar, because say Lect A would have a slightly different reading of the first character, while Lect B would have a slightly different reading of the third character, and so on. (See 世界 for an example.)

--kc_kennylau (talk) 21:11, 18 May 2024 (UTC)Reply

I think we should use slashes, and also put all readings on new lines (currently Mandarin gets new bullets for each reading, while for Cantonese everything is crammed into one line). New lines should nullify confusion with IPA slashes.

(also. shilling User:Fish bowl/p/mul#Chinese for consideration. —Fish bowl (talk) 22:10, 18 May 2024 (UTC))Reply

Chinese: Spaces around ellipsis in transliteration?

Latest comment: 28 days ago6 comments6 people in discussion

(Notifying Atitarev, Benwing2, Fish bowl, Frigoris, Justinrleung, kc_kennylau, Mar vin kaiser, Michael Ly, ND381, RcAlex36, The dog2, Theknightwho, Tooironic, Wpi, 沈澄心, 恨国党非蠢即坏, LittleWhole): Apologies for the double ping. 一面……一面…… has pinyin as yīmiàn ... yīmiàn ... (with spaces), while 關……屁事 has pinyin as guān...pìshì (without spaces). Both pinyin are currently created. One can see from the synonyms of these two pages respectively that their synonyms follow the same rule as the pages themselves. The Cantonese module has recently been updated to explicitly only allow the variant without spaces. What should the standard for this be? --kc_kennylau (talk) 18:53, 18 May 2024 (UTC)Reply

spaces look cleaner — nd381 (talk) 19:46, 18 May 2024 (UTC)Reply

also consider: one space, only on the right side of the ellipsis —Fish bowl (talk) 22:15, 18 May 2024 (UTC)Reply

Ditto. Anatoli T. ^{(обсудить}/^вклад) 00:14, 19 May 2024 (UTC)Reply

I prefer spaces on both sides of ellipses. — justin(r)leung _{{ (t...) | c=› }} 18:23, 20 May 2024 (UTC)Reply

With spaces only after the ellipsis. – wpi (talk) 15:06, 23 May 2024 (UTC)Reply

Copyright of definitions

Latest comment: 30 days ago8 comments3 people in discussion

An organization with a partnership with Wikimedia Foundation wanted wiki editors to translate a definition of theirs. They did release the definition with an open copyright license. WMF's position is that sometimes this works.

They are currently getting more information but I wanted to ask here - what is copyright status of definitions, and what is the copyright status of translations? While WMF legal has a project for definitions this could be a good time to talk.

meta:Talk:International_Museum_Day_2024

Bluerasberry (talk) 18:56, 18 May 2024 (UTC)Reply

@Bluerasberry: if you look at the bottom of every page here, you will see the following sentence: "Definitions and other text are available under the Creative Commons Attribution-ShareAlike License; additional terms may apply." In addition, every time an editor submits text, it is also irrevocably released under the GNU Free Documentation License—there is also a message to that effect. Does that answer your question? — Sgconlaw (talk) 22:34, 18 May 2024 (UTC)Reply

@Sgconlaw Thanks, but no it does not.

The Wikimedia Foundation legal team has made a soft legal assertion that translation of definitions are not subject to copyright. In the current WMF project a definition claims a conventional copyright, but since copyright does not apply to translations of definitions, that definition can be directly translated into Wikimedia projects.

I get the Wiktionary policy. My surprise here is about the official WMF position of the copyright of translated definitions. If anyone wishes to press for clarity then getting a WMF commitment to legally back this theory for Wiktionary could be interesting. Bluerasberry (talk) 14:18, 19 May 2024 (UTC)Reply

@Bluerasberry: with respect to the WMF, that makes no sense. A definition is licensed under CC-BY-SA-4.0. This means people are free to "remix, transform, and build upon the material for any purpose, even commercially"—which would include a translation—but subject to properly attributing the material and distributing any new material based on the original material under the same licence as the original. Obviously, then, translations of CC-BY-SA-4.0 material cannot possibly be "not subject to copyright" but must be released under CC-BY-SA-4.0 as well. The position is the same under the (more or less superseded) GFDL—any modified version (which includes a translation) must be released "under precisely this License". — Sgconlaw (talk) 18:30, 19 May 2024 (UTC)Reply

@Sgconlaw At meta:International Museum Day 2024/Translation call the WMF organized the translation of a definition as an outreach program. The talk page the gives a rationale why. They requested time to sort things.

I share your perspective on all these things, and think that what you are saying is conventional but that this WMF outreach program is surprising. I appreciate the outreach and partnership. I wish that their programs matched Wikimedia editor practices.

The conversation there links to this conversation, and I think that is as much as I wish to react. Bluerasberry (talk) 15:46, 21 May 2024 (UTC)Reply

@Bluerasberry: I read the discussion at "meta:Talk:International Museum Day 2024" and now have a clearer understanding of what you are asking. You weren't specifically asking about the copyright status of definitions here at Wiktionary, which are clearly licensed under CC-BY-SA-4.0 and GFDL, and not free of copyright. You were referring to a definition of the word museum published by the International Council of Museums (ICOM) at https://icom.museum/en/resources/standards-guidelines/museum-definition/. It's not a very long definition, but I don't think it is short enough to be de minimis so it is plausible that it is subject to copyright. ICOM is based in France, so French copyright law would presumably apply. I don't know what that law says about WMF's arguments on "functional language" and fair use. If English copyright law applied, I'm not sure there would be any "functional language" copyright exception. If ICOM is collaborating with the WMF in a project, ideally it should just clarify that it is licensing the text in question under a free licence for the purpose of the project. — Sgconlaw (talk) 16:34, 21 May 2024 (UTC)Reply

I once published a Catalan-English dictionary, entirely copied from en.wiktionary. But I was at the time the third-biggest contributor to the site, and the ninth-biggest Catalan one, so figured it was fair game. I made 70 euros from it. P. Sovjunk (talk) 21:25, 19 May 2024 (UTC)Reply

That is fair. In that case you are going Wiki -> off wiki, selling free content in book form for a fee. In the case above, content is going off-wiki -> wiki. Bluerasberry (talk) 15:46, 21 May 2024 (UTC)Reply

User:Ioaxxere/minitoc.js

Latest comment: 1 month ago5 comments2 people in discussion

I've created a gadget that allows users to specify "preferred languages" for {{minitoc}}, similar to the preferred languages system for our translation tables. These preferred languages are linked to on the header with the goal of allowing users to navigate to languages they're interested in faster (as requested in e.g. [13]). If anyone wants to try it out, go to User:YourName/common.js and add the line importScript("User:Ioaxxere/minitoc.js");

If we adopt {{minitoc}}, could we add this script as a default gadget so it can be used by logged-out users? Pinging interface administrators who participated in the previous discussion: @-sche, Benwing2. Ioaxxere (talk) 02:16, 19 May 2024 (UTC)Reply

@Ioaxxere Hmm, I think this is useful. I note that after changing your preferred languages, you have to refresh the page for them to display; not sure if this is fixable. Benwing2 (talk) 04:41, 19 May 2024 (UTC)Reply

@Benwing2 That's strange — so what happens when you press "save"? Is there an error? Ioaxxere (talk) 05:00, 19 May 2024 (UTC)Reply

@Ioaxxere Hmm. I couldn't reproduce it, even after clearing my cookies, so I commented out the import, cleared my cookies again and uncommented the import, and now I don't get the functionality at all. Benwing2 (talk) 05:13, 19 May 2024 (UTC)Reply

BTW no JavaScript errors coming from your gadget, only from the PreviewPopup gadget and complaints about Wiktionary using third-party cookies to access enwiki and mediawiki.org. Benwing2 (talk) 05:14, 19 May 2024 (UTC)Reply

Infrastructure: Southern Pinghua vs. Nanning Pinghua

Latest comment: 1 month ago5 comments2 people in discussion

(@Benwing2) Recently we've added Nanning Pinghua to zh-pron (e.g. 捱更抵夜). Nanning Pinghua is a variety of Southern Pinghua, which is either a branch of Sinitic directly, or a sub-branch of the Yue branch. However, there is a problem with the categories. Currently zh-pron by default categorises them to Category:Southern Pinghua lemmas, but there is also a dialectal label {{lb|zh|Nanning Pinghua}} which categorises entries independently into Category:Nanning Pinghua (by complete accident I discovered that Category:Southern Pinghua is also populated as a label category), and these two categories are not directly connected:

Category:Southern Pinghua lemmas ← Category:Southern Pinghua language ← Category:Pinghua languages ← Category:Sinitic languages
Category:Nanning Pinghua ← Category:Pinghua Chinese ← Category:Dialectal Chinese ← … ← Category:Chinese language ← Category:Sinitic languages

If you ask me, I think "Pinghua languages" / "Pinghua Chinese" shouldn't exist at all, because Northern Pinghua and Southern Pinghua do not form a linguistical sub-branch by themselves. In User:Wpi/zh-dial-list we can also see that "Northern Pinghua" and "Southern Pinghua" are treated as their own branches in Module:zh/data/dial.

Actually, upon more exploration, it seems that this issue is not unique to Nanning Pinghua. There are also Category:Cantonese Chinese and Category:Cantonese language which are also not directly related, but they seem to have "solved" the issue by using a "See also" right under the header. --kc_kennylau (talk) 08:24, 19 May 2024 (UTC)Reply

@Kc kennylau Yes, this is unfortunate and a known issue. I would like to eliminate the 'Foo Chinese' categories in favor of 'Foo lemmas', whenever possible. Benwing2 (talk) 19:09, 19 May 2024 (UTC)Reply

@Benwing2: Can we make "Nanning Pinghua" (the label) instead point to a dialect of "Southern Pinghua" (the language csp)? --kc_kennylau (talk) 20:48, 19 May 2024 (UTC)Reply

@Kc kennylau Sure. I didn't mean to imply that dialects are forced to categorize directly into lemma categories, but rather that we should eliminate the parallel hierarchy underneath Category:Dialectal Chinese in favor of a more conventional hierarchy that categorizes either directly into 'Foo lemmas' or into dialectal subcategories. So for example the label Cantonese should categorize into Category:Cantonese lemmas not into Category:Cantonese Chinese, and specific varieties of Cantonese go into their own categories, which are subcategories of Category:Regional Cantonese. So the label Nanning Pinghua would categorize into Category:Nanning Pinghua, which would be a subcategory of Category:Regional Southern Pinghua (which doesn't seem to exist yet). BTW there is an open WT:RFM topic under WT:RFM#Ramifying/filling out Yue Chinese on cleaning up the Yue branch, which is currently a hopeless mess. Benwing2 (talk) 21:02, 19 May 2024 (UTC)Reply

@Benwing2: I didn't forget about that discussion. In fact, we are discussing that amongst ourselves, and it is also a bit of a mess right now. --kc_kennylau (talk) 21:06, 19 May 2024 (UTC)Reply

Giving Appropriate Credit for Critical Semi-Anonymous Works

Latest comment: 1 month ago3 comments2 people in discussion

Here (diff), I used Wiktionary to highlight one of what is probably the earliest appearances of the word 'Haidong'. I use Foreign Broadcast Information Service translations frequently for some words which rarely appear elsewhere, or only appear decades later. I had recently seen some discussion about the appropriateness of the use of citations/quotations that do not name specific authors. Are citations/quotations that are semi-anonymous legitimate targets for a Wiktionary citation/quotation? They can be used to meet a goal like Wiktionary:Quotations#Choosing_quotations vision to "Extend the time range that we have quotations for, or fill long time gaps;" and/or "Show the variety of genres, regions and registers that a term is used in."? There is a critical area of cultural heritage for any word that is written semi-anonymously. Here, in the above cite, we have an anonymous propaganda author and an anonymous propaganda broadcaster (at "Xining Provincial Service"), with an anonymous propaganda transcriber and an anonymous propaganda translator (at Foreign Broadcast Information Service). It would be calamitous to the project of documenting actual usage if this citation/quotation were deleted on the basis that we don't know the name of the translator who would have been "creating" this English language loan word. (Please feel free to review the materials and find any relevant names. But I didn't see any.) Have I given appropriate credit to everyone involved in that text, as far as practically possible? Is that quotation a proper subject for a Wiktionary citation/quotation? Thank you. --Geographyinitiative (talk) 20:41, 19 May 2024 (UTC)Reply

@Geographyinitiative I don't see any issue whatsoever in citing "semi-anonymous" sources, as you say, as long as you include whatever bibliographic information is available. Has anyone threatened to delete them? AFAIK these are well-accepted. Benwing2 (talk) 20:53, 19 May 2024 (UTC)Reply

I just want to give everybody appropriate credit, but the genre of "spy reports", which is very useful for some of the vocabulary I'm looking at, is super opaque. --Geographyinitiative (talk) 20:59, 19 May 2024 (UTC)Reply

Interslavic language

Latest comment: 16 days ago28 comments14 people in discussion

As the Interslavic language have got the ISO 639-3 code, is it welcomed now on the en.wikt? There is an online dictionary containing around 18,000 words ([14]), that can be used as a base for introducing it here. Wojsław Brożyna (talk) 01:27, 21 May 2024 (UTC)Reply

Typically, constructed languages are in the appendix namespace, especially ones that don't have a long history with actual speakers, etc. In principle, I'd be okay with Interslavic in the appendix namespace. —Justin (koavf)❤T☮C☺M☯ 02:02, 21 May 2024 (UTC)Reply

See the recent Wiktionary:Beer_parlour/2024/April#CFI_for_constructed_languages. The general attitude is most editors don't want more mainspace conlangs, as Justin mentioned. Appendix seems fine. Vininn126 (talk) 07:31, 21 May 2024 (UTC)Reply

We would still need to add a language code for it, but I don't really see the issue with doing that if it's appendix-only. Theknightwho (talk) 08:46, 21 May 2024 (UTC)Reply

Thanks for the replies. I hope that the code would be quickly included on the WT:LL. At the meanwhile - as the words itself should be added in appendix, it is acceptable to link them from the mainspace? For example, as the descendants of the Proto-Slavic words in the reconstrucion namespace? --Wojsław Brożyna (talk) 14:00, 21 May 2024 (UTC)Reply
No, it is not acceptable to link them from mainspace, with the exception of terms in mainspace languages that derive from that language - so if a mainspace language has borrowed a word from Interslavic, then it can be linked, otherwise not (not as a translation, not as a descendant in mainspace or reconstruction space, etc.). I've now added the isv code as an appendix-only language code. — SURJECTION ^{/ T / C / L /} 14:07, 21 May 2024 (UTC)Reply
To be sure that I properly understood: the reconstruction namespace is treated also as a part of mainspace, yes? --Wojsław Brożyna (talk) 14:10, 21 May 2024 (UTC)Reply
Essentially, yes. Theknightwho (talk) 14:11, 21 May 2024 (UTC)Reply

To be perfectly honest, I don't really see the point. When I look at Category:Appendix-only constructed languages, I see mostly languages with little or no practical use. The only possible outcome of Interslavic in an appendix-only construction would be duplicating stuff that can already be found here and/or here, an additional risk being that we might end up with divergent versions. So it's not clear to me what the value would be. For the record, Interslavic is not some hobby language, and its community is at least ten times bigger than that of Ido and Interlingua — language for which apparently an exception has been made. IJzeren Jan (talk) 14:51, 21 May 2024 (UTC)Reply

@IJzeren Jan I think it was a mistake to include Ido and Interlingua in the mainspace; these should be moved to the Appendix, as we already did with Novial for example. They should not be considered precedents. Benwing2 (talk) 19:02, 21 May 2024 (UTC)Reply

We might want to eventually take this to a formal vote or perhaps a a thread in WT:RFM. We have enough BP threads to do so. Vininn126 (talk) 19:05, 21 May 2024 (UTC)Reply

I would much rather have a vote as to what the criteria for conlangs being included in mainspace is, which would conveniently decide this question as well. I personally favour "has or has had a native speaker", but it'll need to be more fleshed-out than that to be rigorous. Theknightwho (talk) 22:28, 21 May 2024 (UTC)Reply

Some objective criterion would certainly be helpful. However, native speakers won't do, because auxiliary languages are not even meant to have native speakers. Even in the case of Esperanto it's not the ultimate fulfilment of its goals but merely a side effect of its usage. And note that even though Esperanto reportedly has about a thousand native speakers, it still does not have any monolingual native speakers. If anything, I'd argue that a criterion like "has or had a user community of at least 1,000 people" (or whatever other number you prefer) would be more suitable, although I'll admit such figures are not without issues either. IJzeren Jan (talk) 07:14, 22 May 2024 (UTC)Reply

That's assuming people want more conlangs; I'd say most editors don't based on the thread I linked above. Vininn126 (talk) 07:17, 22 May 2024 (UTC)Reply

You don't seem to get my point. First of all, native speakers are gravely overrated. Although Esperanto is probably the only constructed language with native speakers, it's not like they set any linguistic standards, and as far as there is any form of natural development at all, that's done by L2 speakers. Besides, there are many languages here that never had native speakers. Let me just mention Old Church Slavonic and Rumantsch Grischun. To go even further, it's questionable whether classical Latin ever had native speakers in the form it's being presented. Secondly, it's wrong to assume some kind of binary distinction between natural and constructed languages. Languages show various degrees of deliberate human intervention, which goes especially for artificially created standardisations (like Nynorsk, Rumantsch Grischun, Euskara Batua, Limba Sarda Comuna) and revived languages (like Modern Hebrew, revived Cornish). At last, I don't really understand where that "allergy" to languages that were created at some point in history comes from. Nobody complains about some obscure language with very few speakers, so what's the problem with a few constructed languages with a lot of speakers? IJzeren Jan (talk) 10:56, 22 May 2024 (UTC)Reply

I have a feeling you didn't read through the thread. Vininn126 (talk) 11:02, 22 May 2024 (UTC)Reply

Those first counterpoints are questionable. And secondly it is meaningful to have native speakers - they have a natural intuition for what should or shouldn't be correct. Vininn126 (talk) 11:06, 22 May 2024 (UTC)Reply

@IJzeren Jan: When we record languages like Latin or Old Church Slavonic, we don't simply record the written language, but we simultaneously record the spoken language associated with it: Vulgar Latin and Pre-Bulgaro-Macedonian. For auxlangs, you don't have a natural counterpart that is recorded together with the written one. Thadh (talk) 08:12, 23 May 2024 (UTC)Reply

In the meantime, do you think that it'd be worth it to have a vote to remove Interlingua, Ido, and Volapük? I honestly don't see a criteria for constructed languages being agreed upon in the near future, and as is, the only constructed language that'd fall under any criteria that's been mentioned is Esperanto anyways. AG202 (talk) 14:12, 22 May 2024 (UTC)Reply

I'd rather do that only if a vote to have a general criterion fails. Otherwise, it might end up being a lot of work for nothing. Theknightwho (talk) 15:00, 22 May 2024 (UTC)Reply

I am fine with having a vote about Interlingua, Ido and Volapük in the near term, as I think User:AG202 is right that it will be difficult to find a criterion that everyone agrees on and I don't think it will be a waste of effort to have this vote. Benwing2 (talk) 18:02, 22 May 2024 (UTC)Reply

It should be simple enough to set the entry barrier for a given conlang as ‘there exists a consensus on Wiktionary to admit it’. I imagine only Esperanto will clear that hurdle. Nicodene (talk) 23:24, 22 May 2024 (UTC)Reply

We, more precisely, have to ask ourselves whether it would probably be child abuse if the conlang were taught to children as a native language. In Esperanto it wasn’t because the parents met and spoke it organically. In Interslavic it would be, because why don’t you teach the bairn your native harder Slavic language if you are proficient in it? No reasonable person … In Klingon the children also became weird. Were it to suffice that a native speaker has existed, one would set perverse incentives. I maintain my position that conlangers are suspect to suffer personality disorders. Fay Freak (talk) 12:40, 23 May 2024 (UTC)Reply

@Fay Freak: Is it child abuse if you raise your kid as a native speaker of both a natural tongue and some constructed language? I do agree that having a conlang as one’s sole native language is wrongheaded, but speaking a conlang as a first language would make someone a good Wiktionary editor working on that conlang, and is harmless as long as they continue speaking in a natural language. Inqilābī 16:59, 1 June 2024 (UTC)Reply

@Inqilābī: I have been clear enough. Probably. I don’t find an article about a neckbeard father who raised his child bilingual with an artlang which she then suppressed just right now, perhaps it was a Reddit story and they swept it under the carpet. Maybe this hypothetical is difficult at your age, since personality development is not finished, myself I would not have been able to follow this line of thought half a decade ago. Fay Freak (talk) 19:01, 1 June 2024 (UTC)Reply

@AG202: I support creating a vote to determine the status of those three languages although I'm undecided myself. Ioaxxere (talk) 22:00, 22 May 2024 (UTC)Reply

Are there easily accessible objectively measurable activity stats for these conlangs? Such as the number of edits per month or the rate of adding new lemmas? Just to see the current situation. That said, any activity stats can be gamed and artificially propped by certain individuals via tons of low quality edits if this criteria were to become a part of the official policy. --Ssvb (talk) 07:49, 23 May 2024 (UTC)Reply

@Ioaxxere, @Benwing2, @Theknightwho, @Koavf, @Nicodene: Wiktionary:Votes/2024-06/CFI for mainspace constructed languages has been created. There's no set start date yet, so feel free to leave comments either here or (preferably) on the talk page. AG202 (talk) 22:19, 4 June 2024 (UTC)Reply

Feedback invited on Procedure for Sibling Project Lifecycle

Latest comment: 30 days ago1 comment1 person in discussion

You can find this message translated into additional languages on Meta-wiki. Please help translate to your language

Dear community members,

The Community Affairs Committee (CAC) of the Wikimedia Foundation Board of Trustees invites you to give feedback on a draft Procedure for Sibling Project Lifecycle. This draft Procedure outlines proposed steps and requirements for opening and closing Wikimedia Sibling Projects, and aims to ensure any newly approved projects are set up for success. This is separate from the procedures for opening or closing language versions of projects, which is handled by the Language Committee or closing projects policy.

You can find the details on this page, as well as the ways to give your feedback from today until the end of the day on June 23, 2024, anywhere on Earth.

You can also share information about this with the interested project communities you work with or support, and you can also help us translate the procedure into more languages, so people can join the discussions in their own language.

On behalf of the CAC,

RamzyM (WMF) 02:26, 22 May 2024 (UTC)Reply

‘Surface analysis’: The end of an era?

Latest comment: 26 days ago29 comments10 people in discussion

As tired as we all are of the matter, the previous discussion ended in a possibly promising proposal: replacing all current usages of {{surf}} with a new template {{af+}}, a clone of {{af}} with the accompanying text ‘derivable from X + Y’.

It seems to me that this is the first proposed phrasing to feature all three of the following:

Precision: word-derivation is actually a ‘thing’ in linguistics, unlike ‘surface analysis’ (edit: and also unlike “X is equivalent to Y + Z”)
Comprehensibility: any educated person should be able to understand it
Compatibility: it works with all (valid) usages of {{surf}}

Thoughts? Nicodene (talk) 02:44, 23 May 2024 (UTC)Reply

Can we not just do this by a hard redirect of {{surf}} to {{af+}}? So doing would avoid notifications of change being sent out for the pages modified. --RichardW57m (talk) 10:27, 23 May 2024 (UTC)Reply

I do not like the idea of {{af+}} printing "derivable from". I would prefer a different name. Vininn126 (talk) 10:46, 23 May 2024 (UTC)Reply

Me neither, it is not explicit about mere non-diachronic derivability.

I avoid schematic phrasing anyway, for literary style: writing sometimes “analyzible as”, sometimes “equivalent to” etc. Fay Freak (talk) 12:29, 23 May 2024 (UTC)Reply

@Fay Freak As far as I can tell it is inherently synchronic. I cannot think of a non-synchronic way to read the statement “quickly is derivable from quick + -ly”.

As for those other phrasings, they are not precise, which is what leads to silliness like “month: equivalent to moon + -th”.

@Vininn126 I understand that the proposed phrasing doesn’t actually contain the word that af stands for, but I’m not sure there is any that does which fits the scope of {{af}} / {{surf}}. We can’t well say **affixable from X + Y, and even if we could, it would be inaccurate for compounds like greenhouse, which fall within the current scope of {{af}} / {{surf}} but do not involve any actual affixes.

The mismatch between template name and description could be fixed another way: by renaming {{der(+)}} to {{ult(+)}} “ultimately from”. If anything that is actually a better description than “derived from”, such that many editors (myself included) have found themselves manually writing out ”ultimately from {{der|…}}”. Once that is done, {{af}} can be renamed to {{der}} and {{surf}} to {{der+}} “derivable from”.

With bot assistance it would be fairly straightforward to implement this, I think. And the resulting system would be a good deal simpler/more transparent than our current one. Notably {{compound}} could finally be retired, since “derivable from” is applicable for compounds, while af is technically incorrect. Nicodene (talk) 19:52, 23 May 2024 (UTC)Reply

I'm fine with the concept; like I said, it's the naming I don't like. {{der+}} is also not ideal. {{ult}} is so far the best I've seen, but it feels clunky. Vininn126 (talk) 19:53, 23 May 2024 (UTC)Reply

Clunky in what way? Nicodene (talk) 04:18, 24 May 2024 (UTC)Reply

Well to be honest it's not less clunky than {{surf}}. Vininn126 (talk) 09:10, 24 May 2024 (UTC)Reply

I've got no strong opinion as to what it should be replaced with, but I too would like to get rid of this template; I never liked the wording. P U C – 15:04, 24 May 2024 (UTC)Reply

We should first agree on the wording, and then consider the best way to implement it. --Lambiam 16:57, 24 May 2024 (UTC)Reply

What about "analysable as"? I'm not sure it's a good idea to use derive as it's already doing a lot of heavy lifting—we already use it in etymologies to refer to a term being derived from other languages, as well as in the "Derived terms" heading (and it was pointed out in an earlier discussion that these two uses are already possibly inconsistent). — Sgconlaw (talk) 19:07, 24 May 2024 (UTC)Reply

As a bonus we could switch from {{surf}}'s up to {{anal}}. Vininn126 (talk) 19:10, 24 May 2024 (UTC)Reply

Yes, instead of just scratching the surface let's go deep with {{anal}}. P U C – 19:51, 24 May 2024 (UTC)Reply

Are we done with sniggering and drawing penises on the toilet stall door?

— Sgconlaw (talk) 22:23, 24 May 2024 (UTC)Reply

I have no idea what you're referring to. Vininn126 (talk) 22:23, 24 May 2024 (UTC)Reply

@Sgconlaw Vague formulations like ‘analysable as’ (or per @Sokkjo ‘equivalent to’) are as mentioned problematic in that they result in users claiming synchronic impossibilities like “month = moon + -th” or “again = on- + gain”.

The use of “derived” in derived terms sections is actually an argument in favour of this proposal, now that you mention it, as there’d be an increase in symmetry here. The derivation of quickly from quick is already mentioned on both entries; why not use matching language in both places?

As for the (unrelated) use of the template der, that is as mentioned fixable by renaming it to {{ult(imately)}}, which seems more descriptive really, and then renaming af/surf to der/der+. Nicodene (talk) 22:58, 24 May 2024 (UTC)Reply

@Nicodene: I would not be comfortable with changing {{der}}/{{der+}} to “ultimately”. We have been using “derived” in etymology sections to denote derivation from one source language to another, and now suddenly repurposing it to mean something else is, I feel, a step too far. Moreover, “ultimately” suggests to me the omission of intermediate steps of derivation to some remote source language like Proto-Indo-European. (For example, term X is derived from Old French, which is derived from Latin, which is ultimately from Proto-Indo-European (skipping over Proto-Italic).) — Sgconlaw (talk) 23:18, 24 May 2024 (UTC)Reply

(Edit: see below.) Nicodene (talk) 05:00, 25 May 2024 (UTC)Reply

Oppose The only verbiage I support is equivalent to in areas where people instead use {{surf}}. I also do not support users going around changing From {{af}} to {{af+}}, as creating {{af+}} would promote. -- Sokkjō 21:11, 24 May 2024 (UTC)Reply

@Sokkjo: I guess I'm OK with "equivalent to" as well if there's consensus for that. — Sgconlaw (talk) 22:23, 24 May 2024 (UTC)Reply

I would also prefer "equivalent to" over any specific template, and I'm using it regularly. Thadh (talk) 23:03, 24 May 2024 (UTC)Reply

(Edit: see below.) Nicodene (talk) 05:00, 25 May 2024 (UTC)Reply

Would this be a thread to take to a wider audience? Vininn126 (talk) 23:11, 24 May 2024 (UTC)Reply

Certainly. Nicodene (talk) 23:40, 24 May 2024 (UTC)Reply

Support Theknightwho (talk) 21:39, 24 May 2024 (UTC)Reply

@Theknightwho: thoughts on the wording? — Sgconlaw (talk) 22:23, 24 May 2024 (UTC)Reply

"Equivalent to" is fine, though @Nicodene raises a good point about the difference between long-range derivations which contain combinations of roots completely alien to a modern speaker (e.g. analysing health as whole + -th) versus readily apparent formations that just-so-happened to enter the language as ready-formed borrowings. The problem with "equivalent to" is that it could refer to either, so it would be helpful to establish an alternative to refer to one of them. Theknightwho (talk) 10:11, 25 May 2024 (UTC)Reply

I’m not aware of any serious linguistic source that would publish a comment like “health is equivalent to whole + -th”, or the same equation using any of the other phrases mentioned above. Personally I don’t see the point of it. If one really feels the need to do that sort of thing, one can simply spell it out in words: “…from Proto-Germanic *hailaz + *-iþō, which correspond to the modern English whole and -th.” I don’t see why this should need a template. Nicodene (talk) 11:52, 25 May 2024 (UTC)Reply

Here is my revised proposal.

At the moment we handle distant etymological relations like "surgical: ultimately from Ancient Greek χειρουργία" using the template {{derived}}. This is a bit strange given that our "derived terms" sections never feature these words, instead having only language-internal formations like cuteness (< cute + -ness), for which we do not use the template {{derived}} but rather {{affix}} and its various "children" like {{compound}}.

My solution is to replace the current template {{derived}} with {{etyl}} and then rename the current {{affix}} to {{derived}}. Hence, finally, {{derived}} will match "derived terms".

(Also we'll avoid the awkwardness of using a template called {{affix}} for compounds like greenhouse, where there is no affixation going on at all. And, perhaps, we can one day retire {{compound}} and other such templates, replacing {{blend|en|emotion|icon}} with {{der|en|emotion|icon|blend=1}} and so on. Cutting down the absolute jungle of ety templates to a more modest size.)

Following this the infamous {{surf}} can be renamed to {{der~}} with the displayed text changed to "derivable from". This is the only phrasing proposed thus far which is understandable to just about anybody yet precise enough to discourage nonsense like "health: analysable as whole + -th" or "husband: equivalent to house + bond".

Thoughts? (Pinging @Benwing2.) Nicodene (talk) 08:22, 25 May 2024 (UTC)Reply

Categories of child languages also be a subcategory of parent language

Latest comment: 28 days ago3 comments3 people in discussion

Currently, when I would like to look at "Tagalog terms borrowed from Spanish", terms from category: "Tagalog terms borrowed from Mexican Spanish" do not show up. I would like to propose that "Tagalog terms borrowed from Mexican Spanish", "Tagalog terms borrowed from Early Modern Spanish" be a subcategory of "Tagalog terms borrowed from Spanish".

Of course, should be applicable in the entire Wiktionary categorization:

Borrowed from Chinese

Subcategories:

- Borrowed from Mandarin
- Borrowed from Cantonese
- Borrowed from Hokkien

[…]

Similar to currently implemented in;

Derived from Latin
- Derived from Vulgar Latin, Ecclessiastical Latin, Medieval Latin […]

Seems to me it's already working in derived terms but not in borrowed terms. Ysrael214 (talk) 12:41, 23 May 2024 (UTC)Reply

Chinese isn't a good example here, as it works differently for reasons that aren't worth going into in this thread. I agree that it makes sense to subcategorise borrowings like this in general, though, but we shouldn't have categories like "Tagalog terms borrowed from West Iberian languages" and so on, even though Category:Tagalog terms derived from Spanish is in Category:Tagalog terms derived from West Iberian languages. Theknightwho (talk) 13:32, 23 May 2024 (UTC)Reply

I agree that "X terms borrowed/calqued/etc. from [language variety]" should be a subcategory of "X terms borrowed/calqued/etc. from [language]" (in a similar manner to "X terms derived from [language variety]" being a subcategory of "X terms derived from [language]"). Einstein2 (talk) 19:11, 23 May 2024 (UTC)Reply

The pos= parameter

Latest comment: 20 days ago16 comments9 people in discussion

This is widely used in links as a way to give non-glosses (e.g. Jeju (island in South Korea)), because giving the definition as a gloss would be incorrect (e.g. Jeju (“island in South Korea”) is incorrect, because "Jeju" does not mean "island in South Korea" in general). @Fenakhay has decided today that this is "misuse" of a parameter, which is something that came up at the entry Dagelet. I have no idea why this is a problem, how this is misuse of anything, or what they are hoping to achieve by objecting to this, really, but given this is clearly proving of some difficulty should we rename the parameter? Theknightwho (talk) 01:21, 25 May 2024 (UTC)Reply

Never used the parameter myself, but it sounds confusing in the use that you describe. Support renaming it. CitationsFreak (talk) 03:01, 25 May 2024 (UTC)Reply

We could introduce |q=. It'd make sense to harmonize the "comment" parameter across different templates. Nicodene (talk) 04:23, 25 May 2024 (UTC)Reply

I don’t really see why it is a “mistake” to use the |t= parameter in cases like that, with the gloss enclosed in quotation marks. Don’t we define geographical places and other proper nouns (e.g., names of languages) without any special type of formatting anyway? If so, why should they be treated unlike other definitions here? — Sgconlaw (talk) 04:28, 25 May 2024 (UTC)Reply

@Sgconlaw Leaving aside place names specifically, there needs to be a way to express non-gloss definitions that doesn't use quote marks, since quote marks imply it's a gloss. Theknightwho (talk) 04:30, 25 May 2024 (UTC)Reply

Yes. I find it a bit jarring to encounter non-glosses placed in quotation marks as if they’re glosses. Not a terribly urgent thing, but still. Nicodene (talk) 13:29, 25 May 2024 (UTC)Reply

q= seems like a qualifier, rather than a non-gloss 'definition', so people would surely end up using it for things like "archaic" or "British English", and then it'd be inconsistent if {{m|...|q=...}} generated (like pos= does) an unitalicized qualifier while {{m|...}} {{q|...}} generated an italicized one... so I think we'd need to make it italicize for consistency... and then the question is, do we consider that "archaic" and "island in Korea" are or should be treated (and formatted) as the same type of information? (If so, then no problem, I guess! OTOH, if we consider them different kinds of thing, and iff we don't want to keep using pos= for "island in Korea", another idea is ngd= or ng= (taking inspiration from {{ngd}}/{{ng}}). - -sche (discuss) 15:38, 25 May 2024 (UTC)Reply

I'd prefer ng= for brevity, which is another one of its aliases: {{ng}}. Theknightwho (talk) 15:42, 25 May 2024 (UTC)Reply

Here and there I’ve found myself wanting to specify things like “archaic” or “with silent ⟨f⟩”. But I don’t mind either way. Nicodene (talk) 16:16, 25 May 2024 (UTC)Reply

If we want non-glosses we should have a parameter for that. I use the POS parameter a lot for designating part of speech. Vininn126 (talk) 14:02, 25 May 2024 (UTC)Reply

I agree that in links |pos= should be limited to specifying parts of speech and shouldn't be used for formatting purposes. I also have no objection to the introduction of a separate |ng= (non-gloss definition) parameter, with the text italicized in line with {{non-gloss definition}}. However, we need to ensure consistency between how glosses and non-glosses are indicated in entries and within links. As I mentioned above, at the moment we seem to treat definitions for geographical terms as glosses. For example, one sense of Jeju is "An island, province, and city in South Korea", and this is not enclosed within {{non-gloss definition}}. If this is so, then we should not treat this definition as a non-gloss within a link. — Sgconlaw (talk) 16:47, 25 May 2024 (UTC)Reply

@Sgconlaw To explain my thought process: "island in South Korea" is a truncated definition (given for the purpose of clarification) which unambiguously isn't a gloss, as "Jeju" is not a generic term for South Korean islands. Whether the output of {{place}} should be treated as a gloss or non-gloss is a separate question, really. Theknightwho (talk) 21:15, 25 May 2024 (UTC)Reply

I'm fine with a new param |ng= (although on a practical level it will take significant work to implement it everywhere; to future-proof this, we might want a put a list of pass-through link params somewhere convenient, like in Module:headword/data or Module:links/data, and rewrite the places that have pass-through link params to use the list instead of hardcoding the set of params). I'm not sure whether it makes sense to italicize it; keep in mind that currently |pos= is often used for arbitrary text like "all meanings" that aren't necessarily non-gloss definitions. Benwing2 (talk) 22:09, 25 May 2024 (UTC)Reply

An edge case here is what exactly counts as a “part of speech”, especially with affixes. I’ve often used and seen pos used to give descriptions like “verb-forming affix”, “suffix forming agent nouns”, “diminutive suffix”, etc. These are not solely confined to part-of-speech information, but they are basically a concise way of stating what the part of speech is along with what the affix is used for.--Urszag (talk) 05:39, 26 May 2024 (UTC)Reply

If this proposal includes changing the formatting of what is currently output using the pos param in templates like {{m}}, then I must oppose, as we have used this extensively in JA entries as a way of including non-transliteration, non-translation information that is still within the parentheses, and reworking that will be a sizable PITA.

If this proposal is just about adding a parameter that will have identical output as pos currently does, I fail to see the point; quibbling that pos is "supposed" to mean "part of speech" seems rather silly to me. How params are used changes, much as how words are used also changes. That said, I would not be opposed to an additional parameter, provided that there is no compulsory need to rework existing content. ‑‑ Eiríkr Útlendi │^{Tala við mig} 20:05, 31 May 2024 (UTC)Reply

Don't you find the naming to be... inconvenient? Imagine if we took a {{{gloss}}} and used it to provide a transliteration or something, and several people did it cause they didn't have a translit parameter simply. Vininn126 (talk) 20:08, 31 May 2024 (UTC)Reply

Shouldn't Wiktionary:No personal attacks be official?

Latest comment: 23 days ago22 comments13 people in discussion

Is there any good reason why that draft policy isn't official? Should we go about starting the process to make it official? Purplebackpack89 15:19, 25 May 2024 (UTC)Reply

It's unnecessary. P U C – 15:30, 25 May 2024 (UTC)Reply

Why is it unnecessary? What good can come of allowing personal attacks? Purplebackpack89 19:36, 25 May 2024 (UTC)Reply

I didn't say we should allow personal attacks, I said having an official policy about it is unnecessary and I don't see what good is going to come of that. Common sense and common courtesy are enough; things might derail every once in a while but I see no need for all that time-intensive Wikipedia-style lawmaking. P U C – 22:07, 25 May 2024 (UTC)Reply

As I see it there are really only two alternatives: 1) enacting a Wikipedia-style policy, or 2) this place becoming or staying the Wild West where users are allowed to harass and bully editors off the project. You say common sense and courtesy are enough but it's been my experience that some of the supposedly most-trusted editors on this project see no need to be courteous to other editors. Purplebackpack89 22:17, 25 May 2024 (UTC)Reply

+1, I'd support it becoming official policy. AG202 (talk) 23:07, 25 May 2024 (UTC)Reply

They don’t need to see a need. I believe in them to strive towards the optimal results rather, quite different from that which is “needed”. Sometimes it is making their stance clear of how low they esteem the opinions of certain editors in certain respects. Not needed for you, since you don’t have insight either way. Or for me, since I know I am retarded. I even make personal attacks against myself, what does the policy proposal say about it? Does it get your nose out of joint? Why does it happen when your pseudonymous internet identity is disrespected at some particular point in time? It’s not like we harass and bully you. Look at Kiwifarms, that’s bullying. Consistency is key. Fay Freak (talk) 23:12, 25 May 2024 (UTC)Reply

The question is rather what good comes from a formal policy-page, save your personal stickling. The page has been added in February 2006, not changed since, when everyman has had made his experiences with the WWW for three years on 800 x 600 monitors via AOL CD-ROMs. Internet users have lost their virginities and even endured the MAGA era. There is extensive documentation how bad arguments and campaigns work over this medium; we call them out, and infer attributes of parties from their behaviour, sometimes also spanning decades, and vice versa, inasmuch as the factual argument is supported thereby.

Purplebackpack89 does not even understand the policy proposal he cites, which creates a false dichotomy by bidding to abstain from that but “discuss the facts”, which are quite different a thing in Wikipedia ~ early 2000s Richard Dawkins ideology of internet nerds: we also ignore “the facts”, as opposed to “the language”, and the audience having to make out its stance on it.

Behavioural addiction is a concept only paradigmized in this decade, our sensitivities of it, doomscrolling and other internet-related individual obsessive-compulsive behaviours conceptualized towards the end of the last. We should address people personally we suspect to have problematic behaviours; sometimes even to the point of remote diagnostics. If you read User talk:Surjection#what is wrong with you, or otherwise are attentive to the eccentricities added to Wiktionary and its discussions, you know that there is nothing left over, than perhaps refer to the psychiatrist; if there is data for this, an editor will get his personal homework. You see, some IP attacked me personally very heavily but I won. Nothing became better from a ban or something preventing a specific mode of discussion. Fay Freak (talk) 23:12, 25 May 2024 (UTC)Reply

I think we need a vote to make it official. CitationsFreak (talk) 15:55, 25 May 2024 (UTC)Reply

No objection if there's consensus to make it official. I agree it needs to be formally voted on. — Sgconlaw (talk) 16:38, 25 May 2024 (UTC)Reply

I have drafted the vote at Wiktionary:Votes/2024-05/Make No personal attacks an official policy Purplebackpack89 21:21, 25 May 2024 (UTC)Reply

agreed — SAMEER (؂・؄・؏) 01:01, 26 May 2024 (UTC)Reply

Agree with PUC: We don't have a policy forbidding murder either; does that mean murder is allowed on Wiktionary? No of course not. National laws prohibit murder, and basic human decency prohibits personal attacks. MuDavid 栘𩿠 (talk) 04:28, 26 May 2024 (UTC)Reply

How can one murder on wiktionary? Word0151 (talk) 21:23, 26 May 2024 (UTC)Reply

I agree with User:PUC that this policy seems unnecessary. It's not as though anyone is under the impression that personal attacks are allowed... Ioaxxere (talk) 04:57, 26 May 2024 (UTC)Reply

For folks that state that we don't need a policy, I'll point to Wiktionary:Beer parlour/2023/July § How to report a user? and how even though there was a consensus to take action against said user, there has yet to any taken. CC: @bd2412 I would hope that we wouldn't need a policy like this, but it will at least force more action vs handwaving it away as we've seen happen here again and again. It doesn't look good on the community as is currently; we have too many instances of personal attacks, even in more public places like Beer parlour, for a userbase with this few active editors. (As seen by where this proposal came out of) AG202 (talk) 08:41, 26 May 2024 (UTC)Reply

Reading back through that discussion, I'm truly disappointed that several admin and many active users said that something should be done but yet nothing was done. If that doesn't show why we need more explicit policy, I don't know what does. It just shows pure favoritism at this point, almost like an old guard. AG202 (talk) 08:50, 26 May 2024 (UTC)Reply

@AG202 In that particular case, I don't see how an official policy would have made any difference, unless it mandated that specific things must happen under certain circumstances. The current proposal doesn't do that, and any that did would need to be formulated very carefully. Theknightwho (talk) 23:35, 26 May 2024 (UTC)Reply

I agree. We must have some way to punish those who break the rule, or it's worthless. CitationsFreak (talk) 02:43, 27 May 2024 (UTC)Reply

I support updating the policy to have an enforcement mechanism, but if I'm going to be real, I'm disillusioned with the whole thing. Even if we had a policy at that time, knowing how little enforcement goes on around here, I doubt anything would've been done anyways. ¯\_(ツ)_/¯ AG202 (talk) 17:43, 28 May 2024 (UTC)Reply

@AG202: Having an official policy should make it harder to sweep violations under the rug, at least in theory. That being said, most of the abusive behavior I've witnessed here came from the same user, so I wouldn't be surprised if it naturally becomes less of an issue from here on out. Binarystep (talk) 11:03, 29 May 2024 (UTC)Reply

I support this. I didn't know it wasn't policy already. —Caoimhin ceallach (talk) 17:04, 27 May 2024 (UTC)Reply

Wu lects

Latest comment: 26 days ago3 comments2 people in discussion

With current Yue subdivision discussions underway, I would like to ask for a parallel discussion regarding the exact same subject but for Wu. Similar discussions have already happened and the consensus is as follows:

wuu Wu Chinese 吳語
- wuu-nor Northern Wu 北部吳語 (alias: Taihu Wu 太湖片)
  - wuu-sha Shanghainese (Shanghai Wu) 上海小片
  - wuu-sji Sunan Wu (Suzhounese) 蘇嘉小片 (Sunan: 蘇南)
  - wuu-sdc Shadi Wu 沙地話
  - wuu-pil Piling Wu 毗陵小片
  - wuu-txh Tiaoxi Wu 苕溪小片 (Huzhounese 湖州小片)
  - wuu-lsx Linshao Wu 臨紹小片
  - wuu-nby Yongjiang Wu 甬江小片 (Ningbonese 寧波小片/明州小片)
  - wuu-hzn Hangzhounese 杭州話
- wuu-tzo Taizhounese 台州片 (Taizhou Wu)
- wuu-wzh Oujiang Wu 甌江片 (Wenzhounese 溫州片)
- wuu-jhw Wuzhou Wu 㜈州片 (Jinhua Wu 金華片)
- wuu-lsc Chuzhou Wu 處州片 (Lishui Wu 麗水片)
- wuu-sqx Xinqu Wu 信衢片 (信 Xin = Shangrao)
- wuu-xww Xuanzhou Wu 宣州片 (Western Wu 西部吳語)

Notes:

1. Shanghainese and Taizhounese adopted as their respective concepts encompass most if not all lects within the branch they describe. In particular, "Shanghainese" can also refer to urban-like suburban lects, or even all suburban lects in general, and there is no representative lect for the Wu spoken in Taizhou prefecture

2. Hangzhounese is an isolate within the Northern Wu family; it has significant Northern Mandarin influence and has even so far as to have been classified as a Mandarin language by some

3. Inland Wu subdivisions (much like some Yue areas) is highly contentious and the scheme adopted here is one that the Wu editors stick to, and is essentially a slightly modified version of the 1987 Atlas subdivisions

4. Northern Wu will be a family code, like the potential future Yuehai code. Its further subdivision is purely practical as both active Wu editors tend to focus on Northern Wu, and is not a value judgement regarding the significance of Southern Wu sub-subbranches

5. Southern Wu is highly likely to be areal and thus will not have a corresponding code

If there are no objections, we shall follow through with this and implement these codes as soon as possible. @Musetta6729 (Notifying Atitarev, Benwing2, Fish bowl, Frigoris, Justinrleung, kc_kennylau, Mar vin kaiser, Michael Ly, RcAlex36, The dog2, Theknightwho, Tooironic, Wpi, 沈澄心, 恨国党非蠢即坏, LittleWhole): — nd381 (talk) 21:14, 25 May 2024 (UTC)Reply

@ND381 This is overall fine with me and seems suitably conservative and flat. My main concern is with the codes; I'd prefer to use codes that are less arbitrary and use the first three letters of the lect name whenever possible, otherwise using the first two letters of the first part along with the first letter of the second part. Benwing2 (talk) 22:03, 25 May 2024 (UTC)Reply

The codes already for the most part adhere with what you want.

Northern
- Shanghai
- (ISO 639-6 code for Sujiahu)
- Shadi, Chongming (Shadi is a place name)
- Piling (no representative lect)
- Tiaoxi, Huzhou (Tiaoxi is a place name)
- Linhang-Shaoxing (Linshao is an abbreviation of these two place names; Shaoxing is a prefecture whereas Linhang is a county)
- Ningbo, Yongjiang (spoken only in Ningbo prefecture and the tiny Zhoushan prefecture)
- Hangzhounese
(ISO 639-6 code for Taizhounese)
(ISO 639-6 code for Wenzhounese)
Jinhua, Wuzhou (spoken only in Jinhua prefecture)
Lishui, Chuzhou (spoken in Lishui prefecture)
Shangrao-Quzhou, Xinqu (Xinqu is an appreviation of these two prefecture names; X included at the end to signify the historical abbreviation)
Xuanzhou Western Wu (Western Wu is a common name for the branch)

Some of these could have their letters shuffled around but for the most part they are already, in fact, two letters from one name and one letter from another name — nd381 (talk) 10:29, 26 May 2024 (UTC)Reply

Luwian hieroforms template

Latest comment: 26 days ago1 comment1 person in discussion

Some Hieroglyphic Luwian lemmas have multiple spellings. Therefore I need a template similar to {{egy-hieroforms}} for Hieroglyphic Luwian for them. Can this be done? Antiquistik (talk) 08:49, 26 May 2024 (UTC)Reply

Latin pronunciations in English entries

Latest comment: 24 days ago13 comments6 people in discussion

There are some unadapted Latin borrowings in English that were given Latin pronunciations by Doremitzwr along with English ones. They are (at least): argumenta ad populum, argumentum ad populum, opere citato, operibus citatis, pactum de non petendo, simpliciter. It seems to me such pronunciations don't belong, so I'm going to delete them. I also found an Aramaic entry מלכתא with an Ashkenazi Hebrew pronunciation, which I suspect doesn't belong either, but I'm leaving it for now. Bringing this up here in case people have other thoughts. Benwing2 (talk) 09:18, 26 May 2024 (UTC)Reply

BTW the Latin pronunciation was the only one given for pactum de non petendo. Can someone supply an English pronunciation? Benwing2 (talk) 09:23, 26 May 2024 (UTC)Reply

Done. This is almost never said aloud in English, as far as I can tell. Theknightwho (talk) 21:47, 26 May 2024 (UTC)Reply

Guy has done so because they are not English, notwithstanding the header and templates. Save for simpliciter, everyone of them should be moved to Translingual according to my correct custom of viewing the things. I mean you already admitted it for op. cit., so why is opere citato “English”? Both can have both English and classicizing Latin pronunciation under a Translingual header. The linked terms are used in German as well but German dictionaries would not include them and claim them to be German, less propense to see foreign terminology as integrated. Fay Freak (talk) 22:48, 26 May 2024 (UTC)Reply

@Fay Freak You are probably right. Benwing2 (talk) 22:50, 26 May 2024 (UTC)Reply

I think that's a misunderstanding of what it means to be Translingual: if something is Translingual, then it very much can still part of English, but it's convenient to avoid having tons of duplicate entries all with the same spelling. There's also the issue of differing pronunciations, which has come up before in relation to Translingual A4 (paper size) (which is part of everyday English in the UK, even though we only have a Translingual entry for it). Theknightwho (talk) 22:52, 26 May 2024 (UTC)Reply

Just FYI, we can have Pronunciation sections in Translingual entries with the pronunciation in multiple languages. An example where this occurs with several languages is Homo sapiens. I just created {{IPA+}} to make this a bit easier; it prefixes the output with the langname, similar to {{m+}}. Benwing2 (talk) 23:49, 26 May 2024 (UTC)Reply

And the paper size, which we etymologize as from a German standard, is interestingly in German speech DIN A4 [dɪn ʔaː fiːɐ̯], as the regulatory authority is die DIN f [diː ˈdɪn]; A4-Papier is also a word, but we can form compounds even from a third language + section language via |langN=, not only translingual + section language. And yet it does not completely outrule that even this, DIN A4, is translingual.

It is a bit like with family names and other stuff, we had nasty votes about, that is not subjected to normal language rules by the community (Verkehrsanschauung, the jurist says). I ask whether something is intended to be translingual in the beginning: as certain now famous units, the meter, Kelvin and what not, which started in the monolingual scientific communities of certain countries.

For the bejel disease I added Latinate translingual translations in 2018 some of which were only ever found in German medicine works—because the common microbial identity of the diseases of various countries was only later recognized when hygiene ousted them.

As for pactum de non petendo, this is in turn one of the kind of terms that has been left over from the Middle Ages or so when they spake Latin in college, and ius commune and hence Latin as a law language was superseded only by the German Civil Code in 1900, with not every dogmatic term translated however. I probably have heard pactum de non petendo, but this is practically not so important so I bring forward more entropic examples like venire contra factum proprium also connected to § 242 BGB, and a long baggage around the law of unjust enrichment such as condictio indebiti, condictio ob rem, datio ob rem, datio obligandi causa, condictio ob turpem vel iniustam causam etc., which are mentioned and said to be found in a provision of the German Civil Code if the individual jurist likes so, by how much he is instilled by knowledge of the legal system’s historical background (the most illustrative example of this is the student book by Hans Josef Wieling on unjust enrichment last edition 2020), which also counterindicates all these terms being German or English.

Science and academia, whether debating arguments, for which it refers to known terminology about fallacies and cognitive biases, or just teaching, has to explain loan formations and intellectual history as contrasted from outside languages and debates, and hence carries forward terms and creates false impressions of regular inclusion of a term within a language.

Where in Arabic or Indian classrooms some scientific fields are difficult to discuss in the local native language at all, we are better in England and her former colonies and Germany to translate all education, but some Latin terms have been left over for centuries, and in a German psychology paper, not to speak of IT, there is a whole lot of untranslated English because you just got to know it anyway to get the grades.

It happens in all classes; practically you can’t understand this German drill I quoted at rambizzy without also being proficient in English. For law students they publish booklets like Latein für Jurastudierende; guess for some sciences you technically have to have some limited proficiency in a foreign language, morphology knowledge + basic stems, again indicative of these terms not being German.

Guess this is intuitive for me because I have atypical attention to details rather than to absorb every word-string as a social feedback integrating in the language community where it has been encountered; I don’t blame editors for not having thought around the corner so much when their language, English, has a low threshold of integrating foreign terms; and I had to illustrate the problem with German to mark the contrast of foreign vs. borrowed; and I know people don’t have the same propensity to systematize, or even the capacity, without both hyperpolyglottism and interdisciplinary excursions at play, even though it actually happen most closely to how I have described it. Fay Freak (talk) 00:17, 27 May 2024 (UTC)Reply

I feel like it's the same level of difficulty for translating in non-European-language classrooms. People who want to discuss science topics without the words needed to do so coin them, either by borrowing them (as you say), calquing them, or describing what the term refers to.

Also, that German quotes does not belong in an English entry. The song is a German song which uses code-switching. I don't think that a random English entry is the best place to demonstrate the fact. Maybe an appendix page? CitationsFreak (talk) 08:10, 27 May 2024 (UTC)Reply

It would not be completely unreasonable to call all Latin-derived expressions that retain inflection used in learned and not-so-learned writings (eg, "scientific", medical, and legal Latin) Latin. But here our rules against including SoP expressions as well as the pronunciation differences work against it, despite the administrative simplicity that would result.

It would also not be unreasonable to call such expressions Translingual, once there is evidence of use embedded in running text in multiple (2, 5, 10?) languages. This violates few of our firm rules, I think, merely requiring that we allow pronunciations for multiple languages. A problem is the one of evidence of use in multiple languages. As a practical matter, it would be sufficient IMHO to start an entry for such an expression in whatever languages had sufficient evidence, typically including English. Once there were 2, 3, or more L2s, the L3s could be merged. Another demonstration of Translingual use might be single (ie, not triple) instances of use in a number of languages: eg, one use in Chinese, one in Japanese, one in Arabic, one in Spanish. DCDuring (talk) 14:48, 27 May 2024 (UTC)Reply

What concerns me is that this will make attestation requirements more complicated, since by merging them into one “language” it’s only necessary to find 3 uses in any language at all, which is not particularly difficult. For example, pactum de non petendo is comparatively hard to attest in English, since it’s not terminology used in English law or any of its descendants, from what I can tell (the equivalent term used in such situations being estoppel, at least in England). Edit: I forgot about Scots Law, but the general point still stands. Theknightwho (talk) 15:02, 27 May 2024 (UTC)Reply

IMO three uses, not all in the same language, should suffice the CFI criterion. --Lambiam 16:33, 27 May 2024 (UTC)Reply

Urgh. I've just spotted that Translingual isn't in the list of well-documented languages, which technically makes it a limited documentation language (i.e. only one citation is required). This obviously doesn't reflect actual practice, but we should probably correct that for clarify; particularly if there's consensus that we want to put more kinds of terms under the Translingual heading. Theknightwho (talk) 16:42, 27 May 2024 (UTC)Reply

Stalking/harassment by User:Theknightwho

Latest comment: 5 days ago81 comments21 people in discussion

User:Theknightwho is an admin, but a controversial one. Last year, he faced a de-sysop vote. While he was allowed to retain his admin privileges, people on both sides expressed concern about his combative behavior, with comments such as, "he seems to not know when to stop and distance himself from a fight.", "Maybe Knight is a hothead, maybe Knight broke the rules" and "I have my issues with User:Theknightwho...and I agree with the statement that he's a hothead and argues way too much"

I'm afraid he's up to his questionable behavior again. I believe he is targeting me, stalking me, harassing me. And on top of that, some of the edits he made in targeting and harassing me...

On Saturday, he modified or undid my edits on three different pages in a fairly short amount of time. Per the "quacks like a duck" test, it is unlikely those
- I told him that those edits were inappropriate. He refused to see any problem
Later that day (it was either Saturday night or Sunday morning), he undid an edit I made to hot dog
- His edit was so bad it had to be modified only a few minutes later.
And today, he deleted seven redirects I created over the years. Some of them are acceptable redirects...for example, he deleted busted my neck, which as a conjugation of bust one's neck

What is to be done about Knight's stalking/harassment of me? Can somebody tell him to cut it out? Purplebackpack89 15:58, 27 May 2024 (UTC)Reply

Keeping an eye on a newly active (even returning) user, especially one making questionable edits, is not stalking or harrassing. Vininn126 (talk) 16:12, 27 May 2024 (UTC)Reply

It's acceptable to a point. Knight is beyond that point. He's following me around hither and yon after I told him it bothered him. It's likely he's following me around BECAUSE it bothers me. If the edits were that questionable, somebody else would've undone them. And please also address the fact that, in his haste, Knight himself has made questionable edits Purplebackpack89 16:16, 27 May 2024 (UTC)Reply

This accusation seems silly, to be honest. Vininn126 (talk) 16:20, 27 May 2024 (UTC)Reply

I reviewed your contributions, changed a small number of them, and deleted some (unused) redirects you created a long time ago because we don't tend to have redirects in mainspace. None of this is out of the ordinary, and it's something that lots of admins do as a form of quality control. It's not anything personal, and the overall aim is to guide you to being a better editor, which is why I tagged you with explanations of the changes I made when I edited don't tread on me.

Since then, you have posted on my talkpage multiple times trying to pick a fight ([15] [16] [17]), as well as that of another admin ([18]). I explained to you that I review the edits of lots of other users, and @Equinox also told you that it is perfectly normal for admins to review the edits of users who are potentially making mistakes. I'm not going out of my way to target you, I'm not drawing lots of attention to the changes, and I'm not being mean; what I mostly care about is making sure that entries are up to standard.

Accusing someone of "harassment and stalking" is quite a serious thing to do (and I'm not sure you've really grasped how serious an accusation that is), and quite honestly I'm mostly just baffled at this level of reaction to some pretty basic oversight. That being said, I'm not going to reply to this thread any further, because I don't think there's much point: you've made up your mind that everyone is out to get you, and that's just how it's going to be. Whatever your motivations here, I don't see this discussion going anywhere productive. Theknightwho (talk) 16:18, 27 May 2024 (UTC)Reply

Knight, Where it needs to go is you need to a) understand that the way you're going about this bothers me, b) admit that some of your edits themselves were problematic and c) disengage from me

Even though making a claim of harassment is serious, I stand by it 110%. The fact is, AFTER I said that your "monitoring" bothered me, you went out and "monitored" my edits some more, even though a) some of the things you changed or deleted had been that way for months or years, and b) you KNEW that it was likely to bother me. Many other editors would have stepped away in that situation and it's surprising that you didn't. And, as witnessed from your behavior last year, this isn't the first time you've been combative like this Purplebackpack89 16:55, 27 May 2024 (UTC)Reply

So we shouldn't monitor editors and let them make quesitonable edits, like on sandwich? No thanks. Vininn126 (talk) 17:01, 27 May 2024 (UTC)Reply

If the edits were that necessary, somebody other than Knight would make them. As for hot dog and sandwich, see below about why sandwich had to go Purplebackpack89 17:03, 27 May 2024 (UTC)Reply

That's ridiculous. Monitoring edits is monitoring edits, whether it's one editor or not. This is making a mountain out of a molehill. Memifyed classyficaiton can be presented neutrally in definitions, if you're not aware of that I'm glad someone is keeping an eye on you. Vininn126 (talk) 17:06, 27 May 2024 (UTC)Reply

There's two parts to this here. It's not only (a) should my edits be monitored, but (b) should Knight be the one doing it, and (b1) doing it in the manner he's doing. Just because my edits MIGHT need monitoring doesn't mean that it has to be Knight specifically doing it. Please also note another editor has expressed concerns with Knight's editing Purplebackpack89 20:16, 27 May 2024 (UTC)Reply

I saw. I also saw the "hounding" that was happening and I'd say it's an overstatement. Vininn126 (talk) 20:17, 27 May 2024 (UTC)Reply

I'm gratified to hear I'm not the only editor whom TKW stalks, harasses, and tries to intimidate. —Mahāgaja · talk 17:05, 27 May 2024 (UTC)Reply

Please don't make personal attacks - I have done no such thing, as you well know. If you have grievances, raise them, but what you've just said is a pretty clear-cut attempt to poison the well. Theknightwho (talk) 19:53, 27 May 2024 (UTC)Reply

Anyone who wants to see how you interact with me is welcome to peruse User talk:Mahagaja/Archive 28#Link templates in etymologies and User talk:Mahagaja#+ templates. In both cases, you went into my contribs to revert or re-edit my edits to bring them into conformance with your preferences, and in the first one you threatened to "report" me to the Beer Parlour for not editing the way you want me to. I consider that stalking, harassment, and an attempt at intimidation. —Mahāgaja · talk 20:17, 27 May 2024 (UTC)Reply

It's one thing to change a plus template to its non-plus equivalent, it's another to switch it to {{der}}. Vininn126 (talk) 20:19, 27 May 2024 (UTC)Reply

@Mahagaja could you link to this discussion, or at least give the approximate time it occurred?

@Vininn126 Irrelevant. This isn't about being right or wrong. That doesn't give an editor carte blanche to be unkind. Purplebackpack89 21:32, 27 May 2024 (UTC)Reply

@Purplebackpack89 how can you comment when you haven't even seen the conversation? Vininn126 (talk) 21:36, 27 May 2024 (UTC)Reply

@Purplebackpack89: The links are in my comment above. —Mahāgaja · talk 06:59, 28 May 2024 (UTC)Reply

@Mahagaja Alright, let's go over things, shall we? Aside from this thread, we've only properly interacted twice:

On the first occasion, what I took issue with was the fact you have been replacing many instances of {{inh}} with {{der}} purely because you're worried someone might change it to {{inh+}} ([19]). Leaving aside your view that {{inh+}} "is an abomination of a template that makes me angry every time I see it" (truly the Olympus Mons of all molehills), what you were doing - and may still be doing, for all I know - is genuinely detrimental to entries, because it reduces the accuracy of categorisation. This is something that I ([20]) and later Thadh ([21]) told you, and at no point (even now) have you made it apparent that you even understand this is what I (and others) are taking issue with, nevermind say you'll stop doing it. Instead, you ignored me, carried on doing it anyway, then asserted that there's no policy saying you can't add {{der}} instead of {{inh}} ([22]) - which was never the issue - so I told you that I would start monitoring and reverting your edits, given that you'd made it very clear you had no intention to stop ([23]). This is not intimidation, harassment or stalking: it's a statement that the issue will be raised in a public forum for dispute resolution, and that harmful edits will be reverted; a very standard escalation of an issue in response to someone making it clear they have no intention of backing down. And let's be absolutely clear, here: the issue is not - and has never been - an issue of personal preference, because the display outputs of {{inh}} and {{der}} are identical. You are obviously a very intelligent person, so I find it extremely unlikely that you didn't understand precisely what the problem was, but evidently you didn't care: your worries about the plus templates took priority over the quality of entries, in your view, and that's all that mattered to you.
On the second occasion, Vininn126 reached out to you to try to compromise with you, since they added a plus template to a Lower Sorbian entry, which you then reverted. This wasn't a problem in and of itself, but I (and Vininn126) took issue with you saying "Even if I'm not doing a lot with Lower Sorbian at the moment, I did put in a lot of work on it when I created the entries, and it upsets me to see my hard work ruined." ([24]), which (a) was a pretty clear attempt to claim ownership, and (b) is just a completely ridiculous thing to say under the circumstances, really, which is what I said to you then ([25]); obviously this was not said in my capacity as admin. But... let's be frank, here: what is there in that thread which constitutes harassment, stalking or bullying? It's a firm conversation, where you gave as good as you got. No more; no less. Theknightwho (talk) 21:43, 27 May 2024 (UTC)Reply

You don't get it, do you, Knight? You say, "you gave as good as you got". NOTHING justifies you going low. NOTHING. And the fact that you're an admin now means you should be behaving BETTER, not worse. Purplebackpack89 23:36, 27 May 2024 (UTC)Reply

"A firm conversation" is not "going low". We're well beyond the looking-glass at this point, judging by the state of this thread. Theknightwho (talk) 23:47, 27 May 2024 (UTC)Reply

I'll pile on here as someone who didn't see this conversation before: replacing {{inh}} or {{bor}} with {{der}} just so people won't change it to another template later is something I can only describe with two words: abject stupidity. People who willingly make stupid edits and proudly proclaim that they're making stupid edits have no right to complain when people look over their edits to make sure they haven't done anything stupid. — SURJECTION ^{/ T / C / L /} 16:24, 29 May 2024 (UTC)Reply

ALL editors are entitled to dignity. They shouldn't be hounded or attacked. And Knight and Equinox have a rather sordid history in that regard. Purplebackpack89 16:26, 29 May 2024 (UTC)Reply

In this particular case, i.e. with regard to that template monitoring, it's safe to say that that did not happen. The complaint is not based in anything that happened. Vininn126 (talk) 16:31, 29 May 2024 (UTC)Reply

Just to clarify: stupid edits that the editor is expected to know are stupid, especially after multiple people have pointed out to them why they are stupid. — SURJECTION ^{/ T / C / L /} 16:51, 29 May 2024 (UTC)Reply

Sometimes you have a point (Equinox shouldn't have called you a moron the other day), but those reverts of Theknightwho's you talked about are unobjectionable. (entree instead of sandwich to describe a hot dog manages to be both obscure and ambiguous.) —Caoimhin ceallach (talk) 16:55, 27 May 2024 (UTC)Reply

Sandwich is actually more controversial than you might think: there's actually widespread disagreement about whether or not a hot dog is a sandwich. Entree probably wasn't perfect but Sandwich is questionable as well, albeit for different reasons Purplebackpack89 17:02, 27 May 2024 (UTC)Reply

@Purplebackpack89: TKW actually faced two desysop votes last year, the second being initiated by myself. Beyond that, I'm tired. There's a reason that I don't edit Wiktionary any more. WordyAndNerdy (talk) 22:43, 27 May 2024 (UTC)Reply

I also see Equinox is still grinding his idealogical axe around here. Threw a transphobic insult at me out of the erroneous presumption I'm a trans woman. He ought to have remembered I'm a cisgender woman given he used misogynistic language in reference to me ("abused wife") last year. Whatever. This nonsense is the exact reason I don't contribute any more. Wiktionary collectively refuses to do anything about toxic editors and there's only so many times I can keep butting my head into the same wall. WordyAndNerdy (talk) 23:30, 27 May 2024 (UTC)Reply

It's experiences like yours, @WordyAndNerdy that make it paramount to have real "no personal attacks" and "harassment" policies on this project Purplebackpack89 23:34, 27 May 2024 (UTC)Reply

Does collecting password-protected dossiers about Wiktionarians you don't like count as hounding or doxxing? WordyAndNerdy (talk) 15:32, 28 May 2024 (UTC)Reply

What does something Equinox did have to do with this thread? Theknightwho (talk) 15:48, 28 May 2024 (UTC)Reply

TKW's sudden unfamiliarity with indentation aside, can an uninvolved admin please look at this? Equinox releasing what appears to be a Pastebin doxx file is extremely concerning. WordyAndNerdy (talk) 17:13, 28 May 2024 (UTC)Reply

@WordyAndNerdy: This is going to be my first and hopefully only comment in this thread. If I'm going to be real, most folks here don't care. Compare the response to the announcement below, even some of the same users agreed that he should be sanctioned for his behavior and previous racist & anti-LGBTQ comments. It's disappointing but not surprising. The amount of negative experiences (especially for queer & BIPOC folks) the project is willing to overlook just because someone is active is baffling. It's an old guard in the truest sense, and that's why I'm trying to focus more on making sure that the work I do here is as silo-ed as possible, for better or for worse, though I understand why you wouldn't want to contribute at all. AG202 (talk) 17:40, 28 May 2024 (UTC)Reply

@WordyAndNerdy Yes it's concerning to me, although I can't read the files to know what's in them (they appear to be mostly screenshots of some sort?) and I'm not sure what can be done at this point since Equinox is already de-sysopped. In light of this and previous behavior, it might make sense to require a vote if Equinox decides to come back. Benwing2 (talk) 19:09, 28 May 2024 (UTC)Reply

Perhaps it would have been best to check they were password-protected before throwing around accusations of doxxing. Nobody has any idea what they are, and there is no way for any of us to find out. To be clear: I'm not saying it was a good idea - it's not, and it concerns me a lot - but WordyAndNerdy is not helping by fearmongering either. Let's try to keep our heads. Theknightwho (talk) 19:11, 28 May 2024 (UTC)Reply

@Benwing – Voluntary relinquishment of one's admin tools is not "de-sysoping." There has been no formal sanction against Equinox, nor does there seem to be any desire to take a critical look at the repeated, sustained policy transgressions on his part. Every few years the WMF launches another inquiry into the Wikimedia gender gap. The answer is quite simple, I'd hazard. How many women, POC, and LGBT people want to stick around in an environment that's become a toxic boys' club? Where it's seemingly acceptable for an admin in good standing to call people "abused wife" and the F-slur? Where pointing out a missing stair means being bullied, ostracised, and gaslit? "Information about Wiktionary's so-called Vanguard Party" isn't some incomprehensibly cryptic reference. Equinox has demonstrated a personal animus against users who called out his anti-LGBT axe-grinding on multiple occasions. WordyAndNerdy (talk) 21:21, 28 May 2024 (UTC)Reply

I agree that there is a problem with a lack of contributions from non-male, non-white etc. people, just like the huge imbalance in the wider software industry. And there seems little appetite to address problematic but productive contributors, partly because there are so few such people who are both competent and willing to put in the time and partly because many people are conflict-averse (I admit to being one of them). People have been blocked in the past for racist etc. behavior (e.g. I permablocked Dan Polansky for this), but it often takes much longer than it should. At the same time, do you think it makes sense to hold such an inquiry now for Equinox? It seems to me a bit like closing the barn door after the horse has escaped. This should clearly happen if/when Equinox decides to return (and you can see from Special:UserRights/Equinox that this happened once before, along with a de-sysopping for deleting the main page, followed by a later vote to restore his adminship), but I doubt there is an appetite to do anything now. Benwing2 (talk) 22:03, 28 May 2024 (UTC)Reply

I do wonder if we could change the culture here, make people less conflict-adverse. (And also get some new faces in here.) CitationsFreak (talk) 22:23, 28 May 2024 (UTC)Reply

I agree - I think much of the issue comes from having no structured conflict-resolution mechanism. That cuts both ways, of course: accusations made to harass other editors also go unpunished, which is a perennial problem we encounter as well. It's obviously no coincidence that WordyAndNerdy chose now - of all times - to reappear after many months. Theknightwho (talk) 22:36, 28 May 2024 (UTC)Reply

@Theknightwho What does the last sentence mean? Are you saying that User:WordyAndNerdy is somehow harassing Equinox by calling out examples of his prior bad behavior? That seems patently unfair, to say the least; she was on the receiving end of (what to me certainly looks like) abuse, and doesn't seem to have reciprocated the abuse. Benwing2 (talk) 05:39, 29 May 2024 (UTC)Reply

@Benwing2 I'll be completely clear: I think WordyAndNerdy saw this thread as an opportunity to start bashing me in a public forum again, which is why she picked this thread to make her first edit in over 8 months ([26]) and why 6 of the 12 edits she's now made have been here. She has had a grudge against me for a long time, which is no secret.

Let's take a step back for a second and actually look at what's going on here: Purplebackpack89 has made some very serious allegations with absolutely no basis in reality, Mahagaja saw this as an opportunity to do the same (and - surprise surprise - has mysteriously disappeared once I gave a thorough rundown of what actually happened), and WordyAndNerdy has decided to crawl out of the woodwork to pour fuel on the fire. If anything's harassment, it's threads like this, which I've had to put up with time and again because there are never any consequences for lying about other users, and some users have clearly figured that out. We're not Wikipedia, but I'm pretty certain all three users would have been sanctioned there for their conduct in this thread - potentially quite severely - since stalking, harassment, bullying and intimidation are all serious things to accuse someone of, and should not be done lightly.

I'm not going to defend Equinox's transphobic comments, but that happened after WordyAndNerdy commented in this thread, so let's not kid ourselves about why she was here in the first place or what her intentions were. Theknightwho (talk) 05:57, 29 May 2024 (UTC)Reply

@Theknightwho Thanks for the clarification; I do appreciate it. This deserves more discussion but I'm too tired now to write anything substantive, so it'll be tomorrow. Benwing2 (talk) 07:05, 29 May 2024 (UTC)Reply

Exaggerating? Maybe. It's not lying. If this were Wikipedia, you'd have never been made an admin. And you need to approach this with some introspection about how you interact with other people, because it should be obvious from this AND ELSEWHERE that several people don't like the way you interact with others. Purplebackpack89 16:39, 29 May 2024 (UTC)Reply

The examples you've taken are just correcting incorrect information. That's harrasing? Vininn126 (talk) 16:43, 29 May 2024 (UTC)Reply

I didn't want to comment further. But I see that TKW's modus operandi hasn't changed. Namely, he habitually responds to every complaint about his conduct as an admin as if it's an isolated incident to obscure the long-term pattern, then shifts to deflection when he can't deny his problematic conduct but steadfastly refuses to be held accountable for it. So I become a bully "crawl[ing] out of the woodwork" in service of a "grudge" instead of a past target of admin abuse coming forward to offer factual information to a current target. The thing TKW conveniently left out here is that my reversion at febfem was three minutes after my first BP post ([27][28]), was undoing a months-old drive-by edit by an IP rather than something Equinox did ([29]), and wasn't the only belated edit I made to entries on my watchlist ([30][31]). There was little reason for Equinox to intercede at febfem and absolutely no justification for the transphobic insult he threw at me. But TKW is seemingly invested in the narrative that only bad-faith actors criticise his conduct and that of his personal friend Equinox ([32][33]). That's why TKW apparently felt it necessary to state "but that happened after WordyAndNerdy commented in this thread" as if this somehow explains or justifies Equinox's abusive remark. It's easy to make sweeping and largely unevidenced accusations. It's a lot trickier to rebut bad-faith smears while also presenting a serious and painstakingly evidenced case of your own. That, I'd hazard, is what TKW banks on. That his targets get too worn out to pursue redress. That they stay silent for fear of retaliation. That Wiktionary's collective attention moves on before it examines his conduct too closely. He's played this game with me. He's done it with User:Huhu9001 and User:LlywelynII. And now he's doing with PBP89. WordyAndNerdy (talk) 21:39, 29 May 2024 (UTC)Reply

@Benwing2 This is the kind of comment I was talking about. It's largely irrelevant (I am not Equinox, and WAN's conduct after first commenting in this thread can't possibly explain away why she decided to post here in the first place), or simply untrue (I've condemned what Equinox did; I never tried to justfy it). WAN has also tagged users she knows have personal issues with me, as a way to draw them into this thread, and used her own previous attempt to harass me (the desysop vote) as a justification for doing it again here, which is probably the thought-process that explains why this keeps coming up, quite frankly: you can't just keep re-trying someone over and over until you get the result you want, while saying "well, this has happened lots of times before so it must be true!"

It's textbook harassment, and it needs to stop. Theknightwho (talk) 23:34, 29 May 2024 (UTC)Reply

It is strange. User:Theknightwho has never been anything but civil and polite to me, and does all the right things: apologizes when he's wrong or makes a mistake, thanks me for contributions, etc. Yet, I see a pattern of fights happening between TKW and several other contributors that isn't the pattern for other admins. It's true that not everything User:WordyAndNerdy says is reasonable (e.g. just because TKW met Equinox once doesn't make them personal friends). It's also true that some of the contributors TKW has engaged in fights with are (IMO) either annoying, incompetent or generally hard to work with. But at the same time, TKW I think you need to ask yourself, is there a way I could have avoided low-level arguments with various users escalating into major personal conflicts? In any conflict or dispute, regardless of who's right or wrong, the onus is on both people to de-escalate; and this goes even if the other person isn't doing so. Not every provocation deserves a response; often, silence is golden, and will lead to the person who made the outburst ending up with egg on their face. Remember the Spanish proverb en boca cerrada no entran moscas. Benwing2 (talk) 00:43, 30 May 2024 (UTC)Reply

To be honest, TKW is starting to sound like how he thinks PB6 sounds. CitationsFreak (talk) 01:00, 30 May 2024 (UTC)Reply

I walked away last August as a form of de-escalation/self-care. That doesn't mean the matter was ever resolved. Nothing will ever change if TKW gets to keep shutting down any scrutiny of his pattern of problematic admin conduct by insisting it's "harassment" or a "grudge." This isn't the first time he's leveraged this get-of-jail-free card. He made similar arguments I was acting out of a "personal grudge" and "harassing" him during last year's second desysop vote. I could point to instances of Equinox/TKW leaping to the other's defence in disputes. But Equinox's departure probably makes that point moot from an enforcement angle. I do agree with the suggestions made in this thread that 1) another RfA should be required if Equinox wants his tools back, 2) Wiktionary needs an official user conduct policy, and 3) Wiktionary needs a formal dispute resolution process. WordyAndNerdy (talk) 01:24, 30 May 2024 (UTC)Reply

@Benwing2 I don't disagree with you, but it doesn't address the larger issue that there is no way to properly resolve disputes on Wiktionary. Theknightwho (talk) 01:31, 30 May 2024 (UTC)Reply

@Theknightwho That's not really a good enough response to what @Benwing2 said...you didn't admit any wrongdoing or room for improvement, instead trying to pawn off the issue on a lack of dispute resolution. You either DON'T seem to understand how to avoid escalation, or you don't care to. For example, when I expressed concern that you were targeting me, you could've stopped interacting with me and my edits for awhile. You literally did the exact opposite, digging deeper and deeper into my edits, knowing full well that I was unlikely to be happy about that. You instead could've waited awhile to dig into my edits, or let somebody else handle it. You need to remember that a) there is no deadline, and b) it's not necessary that any one edit make any one edit Purplebackpack89 01:44, 30 May 2024 (UTC)Reply

Maybe the obvious answer is he just sucks up to you. And it surely worked. You showed up in disputes involving him, instinctly assuming he is right before any investigation. -- Huhu9001 (talk) 03:58, 30 May 2024 (UTC)Reply

Apparently en.wikt has some structural problems within its administrative design. Sysop rights here are virtually unrestrained. Admins have little motivation to control theirselves or to confront their colleagues who share the same privilege. Victims of power abuses tend to leave, making desysop votes extremely difficult to succeed. And finally, someone who at least try to argue will soon be convicted of wikilawyering and made to shut up.

TKW is just another predictable result of en.wikt's failure to put admin rights in check. -- Huhu9001 (talk) 04:12, 30 May 2024 (UTC)Reply

I don't know about "another" since they've been far more immature, abrasively hostile, and abusive of power than any other admin I've ever run into on a Wiki project but the other admins (a) obviously dislike me more and have no interest in my opinion on the matter and (b) presumably either see some good they accomplish that the rest of us don't and/or use them as a designated enforcer for 'troublemakers'. Hard to think of another reason they've ignored the apparently ongoing issues with the guy. — LlywelynII 10:10, 30 May 2024 (UTC)Reply

Members of one social group usually have a preselected tendency to agree with each other unless there are strong reasons to not do so. Admins being "us", non-admins being "them" is enough to push them to downplay or justify abusing of sysop rights. What en.wikt displays here are typical symtoms of Internet forums with no restriction on its admins' behaviour.

I won't complain about those Internet forums because after all they can do whatever they want on their own private websites. But Wiktionary, which claims to be "the free dictionary", hmm. En.wiki is better in this aspect as it tried to design new policies or systems to alleviate this problem, while en.wikt just pretends everything is fine and TKW is the naughty boy everyone loves. -- Huhu9001 (talk) 00:40, 31 May 2024 (UTC)Reply

@WordyAndNerdy: Hey, I've no stake in the "TKW stalking/harrassment" discussion (I mean, none of what Purple say looks like stalking or harrassment, but that's all I care to say), but for context some might not have access to, Equinox posted this sometime in 2022 on the Discord server:

anyway. basically i've realised that most editors are shit. so i thought, what if we create a group, for only GOOD editors? i am going to call it the Vanguard Party. PM me if you want in.

He did not follow that with anything, it was kinda out of nowhere, so I have no idea what that was about or whether anything really happened, but given that's all he said about a "Vanguard Party" there ever, I don't think there's anything to fear except for a potential furtive circlejerk. (If this reads like there is no cabal, it's not, I'm really not one to defend Equinox's bullshit. I don't think he'd want a cabal with me in it, anyway.) Hythonia (talk) 22:10, 28 May 2024 (UTC)Reply

As an admin who has encountered unacceptable behavior on the part of other admins here and blocked for revdeled in the past, I have no problem blocking anyone for hate speech and have solicited that anyone who sees it, let me know. I also am not part of any cabal and am concerned about 1.) gross CHUDs ruining the Internet and 2.) encouraging marginalized populations to share free knowledge. —Justin (koavf)❤T☮C☺M☯ 22:34, 28 May 2024 (UTC)Reply

It's a joke, clearly. Nicodene (talk) 05:55, 29 May 2024 (UTC)Reply

Sure, my point was more so that even if it's not a joke, there's no need to ring the village bell in alarm. Hythonia (talk) 14:19, 29 May 2024 (UTC)Reply

I probably would have called it a sandwich too. I didn't know it was controversial, but I'm grateful to learn that it is. I see we're having quite some difficulty defining hot dog. We've gone from sandwich to entree, back to sandwich, to snack, dish, and finally food item now! I must say I am thoroughly amused. Who knew American cuisine was so elusive. —Caoimhin ceallach (talk) 23:06, 27 May 2024 (UTC)Reply

"Sandwich-style food item" could offer the best of both worlds. Includes "sandwich" early on in the definition for clarity. Remains agnostic on whether hot dogs qualify as sandwiches. WordyAndNerdy (talk) 23:57, 27 May 2024 (UTC)Reply

Fwiw, Christ, no. (Standard) hot dogs have absolutely no relation to sandwiches or sandwich-adjacent food items whatsoever, except among linguists and intolerable middle schoolers who misunderstand reductio ad absurdum arguments against poor definitions of sandwich and treat it like a "fun fact" instead.

Anyway, to @Caoimhin ceallach, American cuisine isn't usually subtle about much and this isn't either. Hot dogs just happen to be a sui generis thing that falls into a lot of cracks. You can treat it as an main course (American "entree"), snack, or side dish depending on the setting and some people—particularly in the upper class—might never associate it with some of those (or, in the upper middle class, feel obliged pretend they never lived in a home where it was). Absolutely no one in the wild asks for a sandwich, gets a hot dog on a bun, and thinks the other person is sane and not trying to be obnoxious; but it is something that lexicographers needlessly struggle with because they don't want to take the time to define sandwich with all the cut-outs it actually has. — LlywelynII 10:33, 30 May 2024 (UTC)Reply

Whether or not you consider a hot dog a sandwich, you have to withdraw from "hot dogs have absolutely no relation to sandwiches or sandwich-adjacent food items whatsoever". For starters, it fits our definition, to wit "dish or foodstuff where at least one piece, but typically two or more pieces, of bread serve(s) as the wrapper or container of some other food," perfectly. I am not expressing a view on the nature of a hot dog. I know it involves more than just coolly evaluating a definition. But what you said is absurd. —Caoimhin ceallach (talk) 10:54, 30 May 2024 (UTC)Reply

My position is a hot dog is enough NOT a sandwich that we CAN'T use sandwich in a dic def. Purplebackpack89 12:16, 30 May 2024 (UTC)Reply

TKW is not hounding. But I'm happy to take on that role, and hound the hell out of Purple! Denazz (talk) 22:47, 27 May 2024 (UTC)Reply

(I was going back and forth about whether to say anything, and find myself perhaps reassured to have been edit-conflicted by Benwing saying something somewhat similar.) I lament, not for the first time, that when disputes like this come up, it's usually two people who have each tended (in the specific case or in general) to behave suboptimally enough that it's hard to find consensus to take action to change anything, especially towards the more productive/powerful of the two. (Even in disputes involving me, like when I blocked Dan or AP295, it became clear in subsequent discussions that there are people who think I was misbehaving to do that.) Yes, TKW tends to think he's right and is combative (here's a less intense example from ~11 days ago). And as I remarked on Victar's talk page in a similar circumstance, (and as I see Benwing has observed above,) it does stand out just how many of the disputes like this that keep coming up TKW is part of. Yes, PBP's edits are often in need of being monitored and cleaned up (e.g. DONT TREAD ON ME). On a volunteer project, if no-one else is stepping up and saying 'I have time to monitor this', there does not seem to be appetite to tell the productive editor who is monitoring it (and doing myriad other useful things) to stop. (And some impatience when cleaning up edits from editors who keep making sub-par edits seems expectable; I myself lose patience sometimes, like here, where I undid an edit with a rude edit summary when I probably should, for formality's sake, have RFVed it.) There doesn't seem to be appetite for e.g. desysoping TKW (the votes on that had majorities opposed to desysoping). A temperament [TIL that word has an a and a whole extra syllable in it...] change also doesn't seem imminent. So I suspect we'll be having a discussion like this in another few months. I don't know what to do.
I do want to say, in my experience WAN and Mahagaja are both generally good editors (although I can't condone intentionally switching from a more specific to a less specific etymology template), and if they feel harassed (and I see why they do), I'm concerned. - -sche (discuss) 03:43, 30 May 2024 (UTC)Reply

Yes, I agree with you 100% in everything you say, both about User:WordyAndNerdy and User:Mahagaja being good editors and more generally. It is somewhat heartening that both WAN and TKW are wanting a more formal dispute resolution process; OTOH I don't want this process to work like it does on Wikipedia, where there seems to be a great deal of wikilawyering and formality, and as a result resolving disputes seems to take enormous energy on the part of all involved. That's why I am preaching voluntary de-escalation so that disputes don't come to a formal mediation process; but TKW's response to my last comment shows he doesn't really get it (yet, I hope). Benwing2 (talk) 03:54, 30 May 2024 (UTC)Reply

@Benwing2 Sometimes we do need to go over things in more detail, particularly when it boils down to whether someone is actually telling the truth. I do understand what you’re saying, and I don’t really know what else you expected me to say if you want me to be more hands-off in this discussion. Theknightwho (talk) 12:45, 30 May 2024 (UTC)Reply

"it boils down to whether someone is actually telling the truth."

Letting things get to the point where the a dispute is about behavior is the mistake. Disputants should always ignore behavior in discussion and focus on the actual substance of how the substantive dispute relates to building a better dictionary. Behavior becoming the topic is a clear indication of a problem. When this happens the wise courses of action include taking time off (leaving (for a time) the last word to the other party), explaining one's reasoning on the substance at length based on agreed principles and values, etc. The foolish, destructive courses of action include continuing any discussion that is solely about behavior. I really hope that we do not end up with a legalistic approach to this kind of dispute. DCDuring (talk) 16:20, 30 May 2024 (UTC)Reply

None of the mentioned actions taken by TKW, in my opinion, qualify as harassment or necessitate dragging this to BP. There's nothing wrong with going through a users edits to fix mistakes you think they have made, even if you think doing so is excessive. Unless TKW was publicly shaming you for your mistakes and making posts making fun of you, I don't really see an issue. And since there has been no mention of TKW doing anything like that, I'm going to assume nothing like that occurred.

The only thing mentioned so far that I found genuinely concerning is the mentioned behavior of Equinox. I would support requiring a vote for adminship should he return. — SAMEER (؂・؄・؏) 06:31, 30 May 2024 (UTC)Reply

Let it be recorded that I have absolutely no confidence in User:Theknightwho. DonnanZ (talk) 10:14, 15 June 2024 (UTC)Reply

@Donnanz And I’ve got no confidence in you either :) Theknightwho (talk) 13:30, 15 June 2024 (UTC)Reply

That doesn't surprise me. However, it is you who is being discussed here. DonnanZ (talk) 13:42, 15 June 2024 (UTC)Reply

Also concerned about User:Denazz

Above, User:Denazz says "I'm happy to take on that role, and hound the hell out of Purple!" Also...

How is this behavior OK? Purplebackpack89 23:57, 27 May 2024 (UTC)Reply

User:Denazz is Wonderfool. Is there bad history between you two? Benwing2 (talk) 03:17, 28 May 2024 (UTC)Reply

I think he's trolling. CitationsFreak (talk) 03:57, 28 May 2024 (UTC)Reply

This is stupid. Ioaxxere (talk) 22:34, 28 May 2024 (UTC)Reply

Yeah, that was a dumb comment by Denazz, TBF. They were drunk-editing again. WF needs to improve their diplomacy Denazz (talk) 07:11, 30 May 2024 (UTC)Reply

Also this... Purplebackpack89 12:09, 30 May 2024 (UTC)Reply
That is not harassment. Vininn126 (talk) 12:15, 30 May 2024 (UTC)Reply

Adding etymology trees to English entries

Latest comment: 22 days ago6 comments5 people in discussion

Per the results of Wiktionary:Votes/2024-04/Allowing etymology trees on entries, I would like to apply these trees onto English entries for which the etymology tree has a depth of at least 3 (i.e., A -> B -> C) and a size (total number of terms) of at least 4. Ideally, as much as possible of this should be automated (probably necessitating help from @Benwing2, JeffDoozan). What do we think? Ioaxxere (talk) 19:39, 27 May 2024 (UTC)Reply

Nowhere in that vote did it propose automation. Hard no support from me on that. -- Sokkjō 21:24, 28 May 2024 (UTC)Reply

@Ioaxxere I think it's too early to automate roll-out, as there's still the chance of issues cropping up once we start using it more widely. Theknightwho (talk) 11:22, 29 May 2024 (UTC)Reply

I'd say go ahead, but take it easy, so no automation. —Caoimhin ceallach (talk) 11:28, 29 May 2024 (UTC)Reply

Perhaps we can come back to this in a month or two once it's on more pages - I think we should test it on some high-traffic pages to see how readers react, and also we should try it manually on a few pages to see if there are any other bugs to work out. Once that's done, I think we can raise the idea of automating it again. Vininn126 (talk) 11:31, 29 May 2024 (UTC)Reply

@Vininn126: Will do. Ioaxxere (talk) 22:11, 29 May 2024 (UTC)Reply

see you later, baked potato

Latest comment: 8 days ago32 comments28 people in discussion

Guys! I joined in 2008 and I will finally disappear in 2024. It is time eh. I'll try to avoid any secret IP address edits. I will fully fuck off. And no grudges or bad feeling. I made a lot of great entries, I argued with a lot of people, but I don't hate anyone. I'm just not a "team worker". lol. In real life I am a self-employed computer programmer and I have a terrible attitude but on the other hand I really like speaking with users and learning what they want, and implementing it. I can be nice.

Remember the purpose of the wiki, it's about the created, not the creator. Whatever I wrote (whatever you write) will be fixed, improved, massaged, and fed into the AI that will control our hearts and livers some day. Anyway: been a pleasure. And I'm not dead or anything, you can always dig me up in some awful 8-bit retro computing hole. But I think 16 years was enough. I will no longer read an Italian restaurant menu like "IS THAT AN ENGLISH WORD THOUGH".

I've shifted or deleted most of my user page stuff. I have (of course) a lot of other materials, and dictionary-derived things, on my hard disk (wait! it's an SSD! that's not technically a hard disk -- okay okay I grew up with those blue 3.5" floppies), that may be of interest. I will stick 'em somewhere eventually -- probably not here because of "grey" (cough black) copyright areas etc. When I have cleaned up that data I will send it to User:Lingo Bingo Dingo @Lingo Bingo Dingo, yes you lucky girl. Someone I could trust.

Been a pleasure and genuinely think I have liked you all, even the ones I fought with regularly. Special kisses to WF. Bye. Equinox ◑ 23:04, 27 May 2024 (UTC)Reply

You've made a large number of helpful edits and those will always be appreciated. I hope it's all onwards and upwards for you. Let me know if there's anything I can do to help you exit. Should your admin rights be removed ASAP or should we wait until x weeks have passed? —Justin (koavf)❤T☮C☺M☯ 23:09, 27 May 2024 (UTC)Reply

I totally forgot. Please revoke 'em immediately! And thanks for your kind comments. Equinox ◑ 23:10, 27 May 2024 (UTC)Reply

Just to make sure this isn't missed, our bureaucrats are Benwing, Chuck Entz, EncycloPetey, Hippietrail, Paul G, Ruakh, SemperBlotto, and Surjection. No need to ping just yet (or spam mass-ping), but I'll contact someone if none of them manages to see this. Again, thanks to you, E. Be good and be well. —Justin (koavf)❤T☮C☺M☯ 23:14, 27 May 2024 (UTC)Reply

All the best! — Sgconlaw (talk) 23:16, 27 May 2024 (UTC)Reply

I have revoked User:Equinox's admin rights per user request, without prejudice. Benwing2 (talk) 05:40, 28 May 2024 (UTC)Reply

@Equinox All the best, and drop me an email or something so we can meet up in real life again at some point. Theknightwho (talk) 23:18, 27 May 2024 (UTC)Reply

o7. lattermint (talk) 23:52, 27 May 2024 (UTC)Reply

Enjoy TheGoodStuff.zip [34] This is most of what I have been working on, for the past months and years. Do me proud, Wiktionarians! Equinox ◑ 23:53, 27 May 2024 (UTC)Reply

Hmmm. Not sure how I feel about the password protected files in "information about Wiktionary's so-called Vanguard Party.zip", but I guess they are there and in that way for a reason, whatever it may be. Regardless, farewell and hope you remember to take care. —The Editor's Apprentice (talk) 11:09, 28 May 2024 (UTC)Reply

For the record, since the above link is set to expire in four days and the Wayback Machine is unable to get past the "one time" warning and archive the file, I've opted to retain a copy. If anyone now or a later reader would like a copy, please leave me a message on my my talk page or shoot me an email. Either way, I will find a way to get the file to you. Take care. —The Editor's Apprentice (talk) 22:22, 30 May 2024 (UTC)Reply

choosing to seethe about trans people as a swansong is bretty funny desu —Fish bowl (talk) 03:21, 28 May 2024 (UTC)Reply

Farewell, legend. You will be missed. — justin(r)leung _{{ (t...) | c=› }} 05:47, 28 May 2024 (UTC)Reply

In a while, crocodile. Nicodene (talk) 06:29, 28 May 2024 (UTC)Reply

Best of luck, we'll truly miss you! Thadh (talk) 06:33, 28 May 2024 (UTC)Reply

Bis später, alligator. — Mnemosientje (t · c) 07:45, 28 May 2024 (UTC)Reply

I will no longer read an Italian restaurant menu like "IS THAT AN ENGLISH WORD THOUGH".

Was that your trigger for introspection? :-) anyways, I wish you good luck in all endeavors, including analyzing the emotions of football supporters. ;) Shoshin000 (talk) 07:57, 28 May 2024 (UTC)Reply

The dictionary wouldn't nearly be what it is today without your work. Thank you Vininn126 (talk) 08:14, 28 May 2024 (UTC)Reply

Thank you for your service! Allahverdi Verdizade (talk) 09:15, 28 May 2024 (UTC)Reply

i will miss you as well. it's a volunteer project, so it can be a lot of fun one day and an aggravating chore the next, so i hope to see you back someday, but i understand if you decide to burn this bridge. best wishes, —Soap— 10:14, 28 May 2024 (UTC)Reply

Goodbye Equinox! I thought you would one day achieve one million edits. Maybe Wonderfool is up for the task... Ioaxxere (talk) 13:03, 28 May 2024 (UTC)Reply

Paul, even though you often cockblocked me from adding jokes to Wiktionary mainspace, I will miss you. Vahag (talk) 13:41, 28 May 2024 (UTC)Reply

See you in the funny papers. o7 Binarystep (talk) 15:47, 28 May 2024 (UTC)Reply

Sorry to see you go, Equinox. I'll miss having you around! Now who's going to add entries for all the missing words I've collected! Am I going to have to do it myself?? Andrew Sheedy (talk) 17:26, 28 May 2024 (UTC)Reply

You and your valuable contributions will be missed.. you touched the hearts of many editors, and every article I ever created. o7 LunaEatsTuna (talk) 22:52, 28 May 2024 (UTC)Reply

Best of luck in whatever you'll be up to from now on :) I must say, your leaving's left me feeling a big loss, even though I've rarely (or never) talked to you, because I know how much your absence will be felt now that your crazy work ethic isn't with us anymore! I must say I took that you'd be here forever for granted. Have another wonderful 16 years... even better if you do get hooked again and come back. :) Kiril kovachev (talk・contribs) 20:20, 29 May 2024 (UTC)Reply

You've obviously made a lot of valuable contributions to Wikipedia...BUT...I am very concerned about the file you dropped on your way out... — This unsigned comment was added by Purplebackpack89 (talk • contribs) at 16:34, 29 May 2024 (UTC).Reply

Farewell, Equi, thanks for your massive contributions and enjoy your new "free" time. We shall miss our MVP (guess which sense is meant, HUH?!). ~~←₰-→~~ Lingo ^Bingo _Dingo (talk) 21:41, 30 May 2024 (UTC)Reply

I’m much saddened to see you go. You are the god of Wiktionary, and it’s been an honour to have interacted with you. Your contributions have inspired hundreds or thousands of editors to work tirelessly and ceaselessly. Given the astoundingly gigantic work you have accomplished, I will probably keep imagining that you would continue editing here for 16 decades more, haha. You are immortalized in the best lexicographical project on earth by dint of the greatest number of edits published on en.wikt by an individual (namely yourself). Fare you well. Happy retirement beloved Captain Eq.

P.S. Will make sure, when I am more active in future, to follow your example & edit/research 12 hours a day & sleep for the remaining duration! Inqilābī 14:08, 31 May 2024 (UTC)Reply

I've been avoiding commenting because, given what you chose to do right before you left, it seems ill-advised to admit I'm sad to see you go. Based on a few comments you made at different times, I got the sense that you thought I didn't like you?, but in fact (as I tried to convey by thanking you for various helpful edits and joking with you about your post about the police helicopters and such), while we disagreed on some things, and I can't condone the times you strayed into unacceptable conduct towards others (which you did need to be reprimanded for), I appreciated and appreciate your many contributions to the dictionary. I want to say that, because I did also block you on 2. June (for 1 minute, because you already left so I didn't feel the need to work out what longer length would actually be proportionate) to put in the record—because recent BP discussions have shown that it needs to be made clear—that the conduct I referred to in my block summary was/is unacceptable for an editor and unbecoming of an admin. I hope you enjoy your wikiretirement, and if you ever return, I hope it's to do what you mostly did, and what we should all aspire to mostly (or even exclusively, if we're aspiring!) do, make copious useful contributions to Wiktionary's coverage of languages. :) - -sche (discuss) 01:55, 4 June 2024 (UTC)Reply

Crikey, just saw this. You and I didn't cross paths all that much, but I've appreciated your attention to detail and passion for the project. Good luck in your further endeavors, and may they be at least as rewarding as bashing words together here! ‑‑ Eiríkr Útlendi │^{Tala við mig} 20:14, 6 June 2024 (UTC)Reply

How strange! We did meet a few years ago in Oxford. DonnanZ (talk) 23:33, 12 June 2024 (UTC)Reply

Wonderfool leaving too

Latest comment: 20 days ago7 comments6 people in discussion

To start a new, better life. Denazz (talk) 23:23, 27 May 2024 (UTC)Reply

Not exactly beating the "Equinox's secret identity" allegations... Binarystep (talk) 04:30, 28 May 2024 (UTC)Reply

They’re retiring together to a life of marital bliss in a lovely old cottage in the Midlands. Nicodene (talk) 21:47, 28 May 2024 (UTC)Reply

A new life without the kid? Vininn126 (talk) 08:14, 28 May 2024 (UTC)Reply

You've been back before, so maybe this isn't goodbye—anyway, all the best to you as well! — Sgconlaw (talk) 17:37, 28 May 2024 (UTC)Reply

I don't believe Denazz ever left, judging by the contributions at least :) Kiril kovachev (talk・contribs) 20:42, 29 May 2024 (UTC)Reply

@Kirik kovachev: ha ha … — Sgconlaw (talk) 12:25, 31 May 2024 (UTC)Reply

change the default caption of Template:audio to be "Audio"

Latest comment: 21 days ago8 comments4 people in discussion

The majority of uses of {{audio}} include an explicit caption that reads "Audio" or similar. I propose to make "Audio" the default caption and add support for accent qualifiers (à la {{a}}) so you can write e.g. {{audio|en|File.ogg|a=US}} to get a caption reading "Audio (US)" with appropriately linked "US". Benwing2 (talk) 23:16, 28 May 2024 (UTC)Reply

No objection, provided there’s a way to override the default where required. — Sgconlaw (talk) 23:41, 28 May 2024 (UTC)Reply

Yes, this is just the default. The caption param will still work (and I can support the use of - for no caption if there is demand for this, although IMO there should never be a missing caption). Benwing2 (talk) 23:42, 28 May 2024 (UTC)Reply

Thanks. Actually I was wondering if there’s a way to use this template within {{WOTD}} so that a caption can be added in some manner, and perhaps even multiple audio files used. At present I recall that using {{audio}} within {{WOTD}} causes errors (though I haven’t checked). — Sgconlaw (talk) 23:48, 28 May 2024 (UTC)Reply

Yeah this template is crying out to be revamped. Not sure what the issue is with {{WOTD}}. Benwing2 (talk) 00:44, 29 May 2024 (UTC)Reply

Support, though I think there should be a colon for consistency with the other pronunciation templates. Binarystep (talk) 03:52, 29 May 2024 (UTC)Reply

Support Ioaxxere (talk) 22:11, 29 May 2024 (UTC)Reply

Ok, I have implemented this and revamped {{audio}}. There are now several new parameters to support accent and regular qualifiers, specifying the text of the snippet, specifying the IPA of the snippet, etc. The third param defaults to Audio and IMO should rarely be changed; use one of the new parameters to specify additional information. I will be cleaning up existing uses of |3= in a day or so (there are around 800,000 pages that transclude {{audio}}, of which about 600,000 actually call {{audio}} directly, so it takes about 35 hours just to download all the uses for offline cleanup). Benwing2 (talk) 06:45, 31 May 2024 (UTC)Reply

Symbol of le day?

Latest comment: 19 days ago6 comments5 people in discussion

Let me cook for a second. I am very biased because I absolutely love semiotics, specifically symbols like pictograms, emoji, road signage etc., but I reckon a Symbol of the Day (or similar) could be pretty interesting.

Wiktionary has over 43,815 translingual symbols and slightly more when we include symbols from other languages (most symbols are translingual), unfortunately none of which are eligible for Word of the Day (or the Foreign Word of the Day) because they both exclude symbols from being featured, citing that they "cause problems for readers who lack the required fonts". I think this applies mostly to emoji, which have different designs (that subsequently influence their meanings) on every platform/operating system. And, while there are indeed less supported symbols, Foreign Word of the Day has nevertheless featured language using scripts that are not supported on a number of devices, like Gothic, Nastaliq, cuneiform and Aghwan (which appears as boxes on my phone). Regardless, the solution to this, while unfortunate, is to just exclude emoji and the more unsupported symbols altogether (which usually, albeit not always, tend to be rarer anyways); or to use images if necessary. We already appear to do this for Egyptian hieroglyphs instead of using their Unicode equivalents, so it is possible.

But the reason I would like them featured is because symbols are awesome and I think that including symbols would certainly add value and probably intrigue a few of our readers.. symbols include stuff like pictograms, emoji, emoji combinations, ideograms, icons, emoticons, letters and punctuation (when considered in isolation) and general symbols concerning astronomy, mathematics, currency, musical notation, computing, electronics, alchemy, packaging etc. I know there are quite a lot of people who really like symbols like the infinity symbol ∞, pi π or the heart ♥, and stuff like the tick ✓, smiley :) and pointing hand ☞ are near-universal and have a lot of cool senses. There is also etymological value: not many readers will know know the universal recycling symbol ♻ was created in 1970 for a competition, or that the power button ⏻ is (possibly) a combination of the universal on ⏽ and off ⭘ buttons. There is also punctuation that is pretty interesting, like ; or ⸻ which is funnily long and has three interesting senses.

The only downside I can think of having this (which is a valid concern) would be creating unnecessary whitespace on the main page, especially if this was to be added below Foreign Word of the Day. Anyways, what are y'alls thoughts on this? I will not pursue this further if there exists no enthusiasm for it, BTW, hence why I am asking for opinions here before attempting to formally propose anything. :3 LunaEatsTuna (talk) 23:36, 28 May 2024 (UTC)Reply

Support. Binarystep (talk) 03:53, 29 May 2024 (UTC)Reply

It’s okay. They specifically exist for marketing. The attention emojis atract cross-finances underprivileged script encoding.

We can also make an occasional expansion of our English and Foreign Words of the Day by a Symbol of the Day, say every Sunday; you won’t like the routine of an actually daily featured symbol. Fay Freak (talk) 04:08, 29 May 2024 (UTC)Reply

Support Ioaxxere (talk) 22:11, 29 May 2024 (UTC)Reply

Maybe a weekly symbol? I'm not overly enthused about a daily symbol. We can always do it more frequently later on if there's sufficient interest. Andrew Sheedy (talk) 16:35, 31 May 2024 (UTC)Reply

That sounds reasonable enough! Fay Freak suggested this as well. I am happy to hear that there is support for this nonetheless :) LunaEatsTuna (talk) 23:44, 1 June 2024 (UTC)Reply

Out-of-process deletions of rfv-quote

Latest comment: 21 days ago3 comments3 people in discussion

I'm broadcasting this as I don't know the capabilities and cliques of the administrators. Could someone please stop the out-of-process deletions of the {{rfv-quote}} notice at Sanskrit अमृत (amṛta). --RichardW57m (talk) 14:26, 30 May 2024 (UTC)Reply

Richard, to begin with, your RFV tag was not valid. But I assumed good faith on your part, preferring to believe you were merely mistaken, in the way of the honest mistakes that everyone often makes. I tried explaining the matter to you; I and @Svartava took time to even show you evidence that the way of writing that you are calling illegitimate really exists. And you have flippantly ignored our points and have obstinately continued to stall this discussion through convulated reasoning. Were I to extend the same courtesy to you that you extended to us, I should have disregarded your concerns and blocked you from editing that page. Now, having dealt with you in the past, I know that trying to reason with you is futile as I know you will do anything but concede. Why are you decrying a mythical "clique" of administrators here at the Beer Parlor? Are you trying to mislead people who are not involved in this issue and have no knowledge of the subject matter? You are being disruptive at this point. -- 𝘗𝘶𝘭𝘪 𝘮𝘢𝘪𝘺𝘪^{(𝘵𝘢𝘭𝘬)} 14:51, 30 May 2024 (UTC)Reply

This is going too far than it should have. Please stop dragging this @RichardW57m, it is leading us nowhere since you fail to be convinced even after irrefutable proofs shown to you in the discussions already. Let's not argue in vain and rather go back to contributing. Svartava (talk) 15:09, 30 May 2024 (UTC)Reply

Inflections of PIE terms

Latest comment: 21 days ago6 comments2 people in discussion

Inflections of PIE terms seem to be created inconsistently. For example, Proto-Indo-European *h₁me (accusative of *éǵh₂) is a redlink whereas *n̥smé (accusative of *wéy) has its own entry. Some inflections can also be redirects, such as *íh₂. I think for consistency there should be a system for determining which entries should exist. Here are a few options:

Option 1: Always create the entry.
Option 2: Either create the entry or redirect, depending on how many attested descendants there are (not sure what a good number is).
Option 3: Always redirect.
Option 4: No inflection entries at all.

Pinging @Saph668. Ioaxxere (talk) 15:11, 30 May 2024 (UTC)Reply

Technically, is it possible to put the etymon template on a redirect page to continue the tree? If so, I say option 2. I also think that we should handle cases and gender differently. -saph 🍏 15:45, 30 May 2024 (UTC)Reply

@Saph668: Putting templates on a redirect page is (for lack of a better word) cursed. But I think it is technically possible. Ioaxxere (talk) 15:50, 30 May 2024 (UTC)Reply

We can keep the tree from displaying by keeping |tree= blank, right? Or is it just that templates generally don't go on redirects? -saph 🍏 16:15, 30 May 2024 (UTC)Reply

@Saph668: It's just that redirects are not meant to have any content. Because as soon as you add content, it might as well just be a proper entry, right? Ioaxxere (talk) 16:30, 30 May 2024 (UTC)Reply

Then let's do full entries for gendered inflections (with etymon templates) and redirects for case inflections (with etymon template on the lemma). For the latter, I think you can choose to override the display text of the hyperlink to show the inflected form. -saph 🍏 17:02, 30 May 2024 (UTC)Reply

Middle Chinese links in etymologies.

Latest comment: 14 days ago7 comments5 people in discussion

There are many Japanese entries with Middle Chinese links in their etymology sections. One example is 獅子#Japanese. It says "From Middle Chinese 獅子." but 獅子 was linked to [[獅子#Middle_Chinese]]. We never have any Middle Chinese entries in our dictionary. Should we just link all Middle Chinese words to Chinese entries? 列维劳德 (talk) 14:43, 31 May 2024 (UTC)Reply

We do have Middle Chinese, but everything Sinitic is grouped under the "Chinese" header. This is a special case, so causes problems with link targets. It shouldn't be too hard to solve. Theknightwho (talk) 15:44, 31 May 2024 (UTC)Reply

TBH in this case the MC term should be linked to with {{ltc-l}} which would correctly link to the #Chinese section. lattermint (talk) 16:59, 31 May 2024 (UTC)Reply

@Lattermint It would be good to do away with language-specific link templates if we can, because they cause problems when they need to be used in places like etymologies instead of the proper templates. Theknightwho (talk) 20:26, 31 May 2024 (UTC)Reply

@Lattermint @Theknightwho Agreed. It seems it should be easy to implement this by adding a language-specific flag redirect_links or something that specifies a lang code that links to that language are redirected to. Benwing2 (talk) 08:25, 6 June 2024 (UTC)Reply

The actual wikicode at 獅子#Japanese was this:

{{der|ja|ltc|獅子|lit=lion + (diminutive suffix)|sort=しし|tr=ʃˠiɪ t͡sɨ^X}}

To resolve the linking issue, I'm changing that now to:

{{der|ja|ltc|-}} {{ltc-l|獅子|lit=lion + (diminutive suffix)}}

If you all could update {{der}} and {{inh}} to obviate the need for such workarounds (so {{der|ja|ltc|TERM}} links to [[TERM#Chinese]] as appropriate), that would certainly be helpful. ‑‑ Eiríkr Útlendi │^{Tala við mig} 20:11, 6 June 2024 (UTC)Reply

I think what is needed here is for the automatic MC transliteration in regular templates ({{m}}/{{der}}/{{cog}} and so on, not sure what the exact module responsible for that is) to be able to replicate the exact behavior of {{ltc-l}} which first and foremost includes using the Baxter transcription, as right now the automatic transcription is not reconstruction-agnostic and uses the Zhengzhang Shangfang reconstruction by default, which is the first reconstruction listed in the MC sections in Chinese entries, as seen here. Another thing is that right now the transcription is only going to be shown if it is present on the Chinese entry in question (i.e. enabled with |mc=y), regardless of the existence of ltc-pron module data for the individual characters. Plus there is currently no way to specify different rime readings using |id= with anything other than {{ltc-l}}. lattermint (talk) 20:46, 6 June 2024 (UTC)Reply

June 2024

How to resolve conflicts on Wiktionary

Latest comment: 16 days ago102 comments17 people in discussion

Thou wilt quarrel with a man that hath a hair more, or a hair less, in his beard, than thou hast[.] — William Shakespeare, Romeo and Juliet

Throughout the lifetime of this online dictionary, there have been plenty of conflicts between users. Some of this is unlikely to end any time soon, such as the fight between admins and vandals. This fight has a clear "good guy" and "bad guy", unlike some of the other fights we have had over the years. These morally-conflicting fights often turn into virtual bloodbaths, with people hurling vicious insults at each other, at each other throats over who's harassing who or if some person should be banned. While these are important conversations to have, too often is the main point ignored in favor of calling people "idiots". This problem has been pointed out before [cf Beer Parlour July 2023 § "please reduce the heat"], but nothing seems to be done on the topic, and we seem to keep going in circles, never reaching the point where we can have civilized discussions.

I intend to change that. Please, leave your thoughts as to how we can avoid any future conflicts for good. CitationsFreak (talk) 04:13, 1 June 2024 (UTC)Reply

In principle, there is no way to stop all conflict, of course, but a really good start would be an expectation of civility and admins enforcing that. —Justin (koavf)❤T☮C☺M☯ 06:38, 1 June 2024 (UTC)Reply

Would that just be the way to end all heated conflicts? CitationsFreak (talk) 14:48, 1 June 2024 (UTC)Reply

Clearly not all of us have the same standards for civility and for the need for civility in all interactions with others. Further, some don't seem to care much about feedback from others about their behavior. In some cases, people seem to get very annoyed that others might be potentially causing them to waste their precious time, taking them away from their sacred mission to improve Wiktionary, by their lights. I'm pretty sure that the folks whose behavior I most object to are supremely confident that they are right and that civility is for others who are not on the sacred mission they have defined. DCDuring (talk) 15:04, 1 June 2024 (UTC)Reply

Pobody's nerfect and no system is perfect, but it's a start. —Justin (koavf)❤T☮C☺M☯ 20:09, 1 June 2024 (UTC)Reply

Conflicts are not always bad if people are arguing about somethin they both care about. I guess that the biggest problem is when people are call each other bad words when arguing about some stupid stuff like a definition of a sand broom, without caring about anyone’s input. It feels that they are either too much drunk or too little. However, luckily, it’s not happening so often. Tollef Salemann (talk) 15:38, 1 June 2024 (UTC)Reply

I think @Theknightwho should try to make an effort to participate in drama less. I think we can be reasonable and agree that nothing good came out of engaging with Wiktionary:Beer parlour/2024/May#Stalking/harassment by User:Theknightwho which was a pretty obvious (yet successful) attempt to fan up drama. Ioaxxere (talk) 18:46, 1 June 2024 (UTC)Reply

@Ioaxxere Sure, but there needs to be a way to resolve issues that doesn't just amount to ignoring them. Theknightwho (talk) 18:57, 1 June 2024 (UTC)Reply

Yeah, Help:Dispute resolution doesn't offer much guidance except, hilariously, relax and do something more important and assume that they [the other user you're disputing about] are eccentric and will thus never be able to see eye to eye with you. Hardly worthy of a Nobel peace prize. P. Sovjunk (talk) 19:11, 1 June 2024 (UTC)Reply

It's rather pithy... but I also sometimes feel this way looking at drama from the outside in. Vininn126 (talk) 19:49, 1 June 2024 (UTC)Reply

@Theknightwho Sometimes stepping away from things IS the best course of action. You need to do that more often Purplebackpack89 06:02, 5 June 2024 (UTC)Reply

@Purplebackpack89 Given the amount of friction you're creating, you actually might want to do the same. Benwing2 (talk) 06:15, 5 June 2024 (UTC)Reply

@WordyAndNerdy: Do you have feedback here? I'd value it. —Justin (koavf)❤T☮C☺M☯ 20:10, 1 June 2024 (UTC)Reply

It isn't feasible to "avoid any future conflicts for good." Such an approach will only worsen conflicts that inevitably emerge. Ignoring problems doesn't resolve them. It's like putting a lid on a pot and expecting it not to make a huge mess when it boils over. If TKW had been given guidance at an early stage, this issue may not have grown to this extent. Now there's at least five productive contributors (me, Huhu9001, LlywelynII, Mahāgaja, Purplebackback89) who find his admin conduct to be a recurrent issue. Wiktionary desperately needs both formal dispute resolution processes and the willingness to enact them. This isn't about "fan[ning] up drama." Seeing it characterised as such with no pushback (except from TKW, to his credit) does little to reassure me that Wiktionary is interested in having difficult conversations as a community and making necessary systemic changes. I have noted that TKW hasn't been as combative as in past incidents. That gives me hope that there's room for course correction. But my continued participation here is contingent on resolving the current policy vacuum. We cannot have a repeat of an admin (Equinox) functionally being given carte blanche to be as hostile and combative as he pleases for years because he also makes valuable contributions. WordyAndNerdy (talk) 21:54, 1 June 2024 (UTC)Reply

Huhu and Purple have both been criticized for not being productive. Mahāgaja's complaint has been addressed as their behavior was problematic. Please don't ignore these aspects in your diagnosis. Vininn126 (talk) 22:00, 1 June 2024 (UTC)Reply

Also LlywelynII has been heavily criticized for being sloppy. The term "productive" is being used too loosely here. Vininn126 (talk) 22:01, 1 June 2024 (UTC)Reply

Huhu9001 seems to do solid work in the Japanese language area and in template-space. My understanding is that the dispute between TKW and Huhu9001 arose over changes that TKW made to modules that ended up unintentionally breaking things. So, if LlywelynII can be faulted for "being sloppy," so can TKW. Purple has been around as long as I have. Wiktionary has shown habitually sloppy editors (Luciferwildcat) the door before. I wouldn't necessarily number Purple among them. In any case, the common denominator in these disputes is TKW, not anything any editor did to get on his radar. It also needs to be underscored that all of these disputes were unrelated. TKW has found himself at the centre of multiple heated disputes with unconnected editors working in different areas of this project. That isn't a coincidence. It's a sign of a pattern of escalating and personalising conflicts. WordyAndNerdy (talk) 22:19, 1 June 2024 (UTC)Reply

This only half-addresses the issues I raised with hand-waiving. A user frequently (I admit, too frequently) addressing sloppiness in others' edits does not make their edits not sloppy. From Purple I have seen 10x more drama and the issue "is a hot-dog a sandwich", which I'd hardly call productive. Huhu has been criticized by others, as well, and is known to be abrasive in conversations. So no, it's not just knight there, it's also an uncooperative personality. I find your reply to be lacking. Vininn126 (talk) 22:23, 1 June 2024 (UTC)Reply

Isn't this thread now devolving into precisely the kind of escalation that it was designed to stop? Theknightwho (talk) 22:46, 1 June 2024 (UTC)Reply

This seems like a seeing-the-forest-for-the-trees situation at best. Whether a rank-and-file contributor is insisting hot dogs qualify as sandwiches (if subs are sandwiches, so are hot dogs, FWIW) is perpendicular to the issue of problematic admins. Huhu9001 having been "abrasive" at some point doesn't justify an admin becoming hostile in kind. It absolutely did not justify TKW implementing a blatantly retaliatory block against Huhu last year. Admins have more power than rank-and-file editors. They need to be held to a higher standard of conduct accordingly. They absolutely shouldn't take administrative action in disputes in which they are personally involved. Admins aren't frontier sheriffs. They shouldn't be making and enforcing policy at their own own discretion. Power necessitates accountability and a certain level of restraint. What has been core policy on every other WMF project for decades shouldn't be weirdly controversial here. We shouldn't have a culture in which everyone nods along as an editor (not TKW, to be clear) with a history of inserting Daily Stormer quotes votes against an anti-harassment proposal with inane blather about "wokery" and the suggestion that PB89 seek "treatment for paranoia." This is discussion is doing nothing to relieve my sense that Wiktionary loves being a boys' club. It really does seem that some users will be forgiven any trespass, however severe, while others, no matter how much good work they do, will be summarily dismissed and denigrated and blamed for inviting the hostility to which they've been subjected. WordyAndNerdy (talk) 02:44, 2 June 2024 (UTC)Reply

1) An admin doing their job by addressing sloppy edits is a good thing.

2) Fayfreak’s point about “wokery” seems à propos given your throwing out questionable accusations of misogyny and now, apparently, Nazism. Apparently pointing out that a user is being a bit high maintenance means one must hate women. And pulling a random collection of usage examples from Google that happens to include some kind of far-right tabloid rubbish means you might as well be merrily goose-stepping and Heil Hitlering your way to the Reichstag. Nicodene (talk) 04:43, 2 June 2024 (UTC)Reply

Ugh, User:Nicodene, you are not doing yourself any favors with this post, and you are aptly illustrating User:WordyAndNerdy's point about Wiktionary being a boy's club. Benwing2 (talk) 04:53, 2 June 2024 (UTC)Reply

Also, IMO Fay Freak is in a class of their own with their weird views (and contorted syntax). They know a ton about obscure languages but tend to go off on bizarre rants/tangents that are best ignored; I would not hold them up as an example to be emulated. Benwing2 (talk) 05:00, 2 June 2024 (UTC)Reply

@Benwing2 Given that neither I nor WAN are happy about this, it seems fairly clear that the underlying issue is not that this is an old boys' club, but that there is no adequate way to resolve conflicts, because consequences are essentially arbitrary, and there is a culture of admins allowing things to peter out instead of actively drawing things to a close. WAN has concluded it's because of nepotism because she's only considering me (and now, apparently Fay Freak), but doesn't seem to realise that she's got away with quite a lot of disruptive behaviour herself, and it's not like people haven't noticed ([35]). Theknightwho (talk) 05:15, 2 June 2024 (UTC)Reply

Name one example of "disruptive behaviour" on my part. Since I'm allegedly guilty of so many you ought to be able to name one. Our clashes at shitgibbon and cupsona don't count. Neither of us behaved with the decorum we ought to in both instances. I've never deleted the main page. I don't habitually insert nonsense into entries. I don't add translations for languages I don't know. I think the most dust I've ever kicked up is over a user having a Patreon in 2015 and the weird resistance to accepting online cites in 2020-2022. And in both cases I just voiced my opinions and left for a time. Rather the opposite of "disruption," I'd say. Unless you're insinuating that not contributing is itself a form of disruption. In any case, you're deflecting again. WordyAndNerdy (talk) 05:42, 2 June 2024 (UTC)Reply

@Theknightwho I agree with all your points about the problems with Wiktionary (and I think Nicodene's comments were inappropriate). I do not think User:WordyAndNerdy's attempt to get you desysopped soon after Huhu9001's attempt was called for, and I said that at the time; but at the same time it's hard not to notice how multiple times, WAN has made a statement about something problematic in Wiktionary, and expressed a fear of getting subjected to denigration and hostility for expressing this, and someone then proceeded to come out and do exactly that.

As for a more systematic way of resolving conflicts, we definitely need that; but at the same time I don't think there's any appetite for a Wikipedia-style legalistic approach. IMO it has to be more mediation-based than arbitration-based, with arbitration-style "let's lay down the law" as a last resort. I think a good start would be maybe something like this: (1) a more clearly expressed code of conduct that clearly prohibits bigoted remarks, and gives examples of reasonable punishments for transgressions that admins (or bureaucrats if an admin is the transgressor) can make; (2) some sort of "appeal" process if one or the two sides (transgressor or transgressee) feels they're not getting fair treatment or their concerns aren't being heard or addressed. My hope is to avoid long, drawn out processes in the vast majority of cases, because IMO people here don't have the time or energy for this. Benwing2 (talk) 05:43, 2 June 2024 (UTC)Reply

I am in full support of your plan. CitationsFreak (talk) 05:47, 2 June 2024 (UTC)Reply

@Benwing2 I'd like to avoid long, drawn out processes as well, but I'd prefer them over long, drawn out threads where everyone gets angry and nothing gets done. Theknightwho (talk) 06:09, 2 June 2024 (UTC)Reply

I'd support this as well. AG202 (talk) 21:06, 2 June 2024 (UTC)Reply

@User:Benwing2 I think that "bigoted remarks", problematic though they are, are not the source of all the bad behavior that policy needs to address. More common are uses of derogatory labeling of people as, eg, idiots, morons, drama queens, even when cleverly or humorously worded. The emphasis in establishing a behavioral norm like "No personal attacks" has to be on personal. We may need a total ban on personal attacks (including accusations of Naziism, geneder bias, etc). Enforcement of such a ban couldn't be on a hair-trigger, but it would point in the right direction. A single personal attack should require an apology or temporary block; multiple personal attacks, say, over the course of 12 months would earn longer blocks, etc. I'm not sure about how to enforce better behavior by admins and veteran users (and their bots, templates, and modules). DCDuring (talk) 19:42, 2 June 2024 (UTC)Reply

Calling out someone for using racist/misogynistic/etc. language or linking to a neo-Nazi site in an entry isn't a "personal attack." Usually such call-outs are backed up by diffs demonstrating said behaviour. As a community we need to be able to discuss inappropriate conduct in order to effectively mitigate it. You nailed your colours to the mast long ago.[36][37][38] WordyAndNerdy (talk) 20:29, 2 June 2024 (UTC)Reply

Did I say it was? Whatever evil we attribute to such behavior would not justify attacking the person as a Nazi or an advocate of Nazism. We should be calling out the behavior, not the person, no matter what. I am proud to advocate freedom of expression, toleration, and universal coverage of English expressions in Wiktionary based on uniform standards of attestation and idiomaticity, regardless of the source or meaning. DCDuring (talk) 02:35, 3 June 2024 (UTC)Reply

You're off-the-mark on this front, I think, but I do respect you haven't been combative about it, and I do get the sense your take is born of principle. It's why I don't consider you a problem admin even if I regard your thinking as totemic of Wiktionary's systemic issues. Some of my reaction here may be that your initial comment was posted in "mostly unproductive." There's agreement that Nicodene's comments toward me in there crossed a line and -sche putting a lid on that is the main reason I've felt comfortable returning to this discussion.

Protecting individual freedom of expression shouldn't be a pressing concern on a crowd-sourced dictionary project. (Government censorship regimes OTOH can make our mission more difficult). No one's legal rights are infringed by a website setting standards on the type of speech permitted on the site itself. People still have a legal right to express their views on other platforms and in other contexts. Wiktionary is functionally a professional setting. Many employers maintain some type of code of conduct. Letting employees freely spout off their opinions will very likely create a hostile work environment. Our unvarnished thoughts aren't always helpful. I'm sure no one here wants to read my random thoughts on tax reform, ongoing military conflicts, etc. But sometimes uncomfortable conversations are necessary for change to occur or for problems to be rectified. We can't discuss individual user conduct issues if we can't name the specific problems some users present. It isn't a personal attack to characterise someone's speech as "racist" etc. If we treat it as such, all we'll be doing is ensuring that marginalised voices go unheard, as the majority is often resistant to putting its own biases under a microscope.

I also haven't advocated disallowing the inclusion of offensive terms. A lot of Category:English 4chan slang and Category:English incel slang is my work. I do think there are middle-ground interpretations of "Wiktionary is not censored." There is a lot of distance between having [slur] as an entry and including a quote featuring [slur] in some random entry like umbrella. The former is objectively documenting language as it exists. The latter is an unnecessary and inflammatory editorial choice. The Daily Stormer quote shoehorned into smash wasn't for a specifically neo-Nazi/white supremacist sense. It was for a sense that was a synonym of hottie (“attractive person”). This is why I long ago concluded that Fay is an edgelord. Edgelords don't necessarily personally endorse the views they express. For many it's about stirring up trouble for the lulz. But -sche seems to think think Fay may be the real deal, and I do trust his expertise in this area. WordyAndNerdy (talk) 04:42, 3 June 2024 (UTC)Reply

Expertise indeed, didn’t he graduate with a PhD in identifying Nazi lexicographers?

Frankly it looks like you’re bullying a person who may not be entirely neurotypical, wielding “they’re an X-ist!” as a cudgel to smash someone you dislike into submission. Nicodene (talk) 11:16, 3 June 2024 (UTC)Reply

@Nicodene: She also underrates that I am not a native speaker and was triangulating new definitions, that I wouldn’t know how specifically saucy or redpilled it is or not. Lacking intercultural competence in an international dictionary. Where would I have looked, amongst all bilingual and monolingual dictionaries, to find all bymeanings and implications, huh, WordyAndNerdy? All was gained inductively, the gold-standard of documenting language for a dictionary. For these movement-kind of words multiple people had to guess around because they were previously uncovered, cf. slam later written in Feb 2022 by me.

And then rather than trying to be edgy I wasn’t too happy to quote that guy so that’s why I hedged and balanced it with other quotes and Wikipedia links for author and publication where you already read “far-right, conspiracy theorist, neo-Nazi, white supremacist, misogynist, Islamophobic, antisemitic, and Holocaust denial”; that was all I could, save not including it, which wasn’t compelling, since dictionaries nowadays are notoriously not SFW unless defined otherwise, but the quote read so easy and illustrative! And since I had not studied psychology consciously to guesstimate the tribalization programs hinging on references, it was barely possible to be concerned; I say concerned but not bothered because I have only an cognitive simulation of what happens in others, which I note is a bit extreme in WordyAndNerdy.

This day I read p. 63:

> Carter and her colleagues (2012) were interested in investigating the ability of children with ASD to make judgments about pictured social interactions. The pictured scenarios did not require the use of language and both the children with ASD and those with typical development accurately identified the situations that depicted inappropriate social interactions. However, the children with typical development had robust activation in their language processing network when performing the task; they appeared to be spontaneously verbally encoding the information from the scenes that they were viewing. In contrast, the children with ASD had activation in a network associated with the processing of social information but no significant use of neural resources in the language network. This result suggested that the children with ASD were not spontaneously encoding the information into a verbal form.

Well, when I read this social interaction by political content creators, nothing happens and I don’t connect scenes and don’t encode the author’s or my or my publishing platform’s eventual position. Seems like others do but the automated categorization is still likely to be toned down or correlated with better possibilities by reason, and some wokeness courses do away with this capability again, such that people graduate to see intersectional discrimination structures and victimizations everywhere, trouble as a business sector, for which people privately readapt whole personal identities, as it is defined to operate by means of identification. Fay Freak (talk) 13:48, 3 June 2024 (UTC)Reply

I won't dignify this with a response except to note that I have decades of first-hand personal experience of ASD and somehow manage not use it as an excuse for questionable behaviour. WordyAndNerdy (talk) 21:05, 3 June 2024 (UTC)Reply

What is your excuse, then? For things like making a personal attack on the same page where you’d voted to ban personal attacks. Nicodene (talk) 23:14, 3 June 2024 (UTC)Reply

That's not a "personal attack" by any reasonable standard. It's a thing that actually happened, as is demonstrated by the diff. You need to stop dogging every comment I make. You've already been told that your previous comments toward me in this thread have been out of line. WordyAndNerdy (talk) 01:50, 4 June 2024 (UTC)Reply

The very first sentence of "No personal attacks" reads "Comment on content, not on the contributor." The exact opposite of what you did.

In the list of examples of what constitutes a personal attack, we specifically find:

"Using someone's political affiliations as an ad hominem means of dismissing or discrediting their views."

I should like to add that in this case it's a matter of "imagined political affiliations", since FayFreak has never once that I have ever seen actually expressed the slightest whiff of believing in racial superiority or exterminating undesirables.

I'm curious by what standard you consider anything I have written "out of line" which would not apply just as well to what you have been writing yourself. Nicodene (talk) 02:04, 4 June 2024 (UTC)Reply

Please, it's not worth your time or effort. AG202 (talk) 02:08, 4 June 2024 (UTC)Reply

It’s you who is dogged. The same point stands, whether framed with this concept or not. You can also consider yourself as one of extreme, insane, unhealthy, not to forget wrong. Hyperfocus on the same fiddlestick for five years. Someone with the same neural preconditions as me I would know to tell to stop being autistic; it appears the same “revisiting past points” happens if you answer trauma. Like to reinstate the DMN they rub themselves off on railway tracks, though it be obviously disadvisable.

Other candidates with ASD seclude themselves, keep their interactions brief, out of concerns or actual anxiety of being blamed or missing to react on social information appropiately, depending on their verbal abilities. Looking at the stats, instead of your first-hand anecdotes we can’t revisit (unlike my story which I retell you as far as I remember), with the must-criteria for this diagnosis of impaired social cognition + repetitive and restricted behaviours, most fall just short of schizoid or obsessive-compulsive personality disorder, so they (and me) need to make an actual effort to sidestep avoidant reaction to social input arising from its restricted interpretation. It is artificial though, rather than people-pleasing, and I had to practice it years to see things differently, which includes discussing and defending controversial viewpoints favourable for certain outcomes even if I don’t feel strongly or anyhow at all about them. I am supposed to be controversial. It is sure there would be issues incomprehensible to you, with your inflexible paradigms, if they engaged in politics, which I now have to do as a daily business since graduating law school, which is an exceptional case which you probably haven’t experienced even from hearsay, and you can’t imagine how edgelordy it had to be, for my life. Stats say hubris is greatest among jurists in Germany as compared to all academics (you take your standards from other fields?), again you miss the language and culture barrier on top of the double empathy problem, by which a lot more questionable things appear anyhow if one speaks across continents, and I couldn’t expect to reap a stalker from reciting the Daily Stormer once: kidding of course, don’t get it twisted, it is what AG202 said, not worth your time or effort, though we interpret the result differently. I know you aren’t trying to stalk or concern-troll, I tried to interpret and put into different perspective, again, and enable you. Extreme viewpoints which one contradicts are super necessary to set benchmarks, very different they appear in me from how you currently deal with them. Fay Freak (talk) 03:02, 4 June 2024 (UTC)Reply

@WordyAndNerdy: I wasn't planning to respond but I noticed you quoted me here. The reasons why I characterized User:Purplebackpack89's posts as an attempt to fan up drama:

Hyperbolic language ("stalking/harassment") which is frankly disrespectful to actual victims of stalking.
Referencing TKW's desysop vote, which has little to do with the current situation (and which also seems to argue against your premise that no action is taken against problematic editors — recall that Dan got indeffed mid-vote) and quoting random comments.
Inflammatory language, viz. "his edit was so bad", "it's likely he's following me around BECAUSE it bothers me.", etc., apparently intended to provoke TKW.

Ioaxxere (talk) 05:19, 3 June 2024 (UTC)Reply

"Wikistalking" is old-school wiki-jargon. "Wikihounding" or simply "hounding" has seemingly replaced it. But it needs to be remembered that PB89 has been around since 2010. It's not unexpected for a veteran editor to sometimes use older jargon. Wikistalking/hounding has never been regarded as a one-for-one equivalent of real-life stalking or even cyberstalking. It's exactly as PB89 has expressed it: combing through someone's edit history, systematically undoing their edits, inserting yourself into unrelated disputes, etc. It might not be intended as antagonistic, but it understandably comes across as such. TKW should ideally seek to moderate his tone and conduct if he wishes to avoid finding himself at the centre of conflicts (he has improved since last year). And mentioning the desysop votes was absolutely relevant. This is not an isolated incident. It's a pattern of conduct. Ignoring past incidents won't do us any favours. WordyAndNerdy (talk) 05:44, 3 June 2024 (UTC)Reply

@WordyAndNerdy PB89 made an absolutely unfounded accusation of harassment towards me for a single RfD of one of his terms combined with a single comment I made about him in another RfD (which is located directly above the RfD I added), in response to a comment of his. This is not the first time he has made unfounded accusations of harassment (and not towards TKW; I reserve judgment on this matter as I haven't looked at it in detail to see what the circumstances were). PB89 seems to think he can shut down criticism of his (IMO often sloppy or ill-considered) edits with such accusations. I should also add, from statements made on his user page, he rejects some core Wiktionary principles such as SOP, and seems to have difficulty understanding why Wiktionary isn't just Wikipedia-lite; so it's not surprising to me that several users feel his edits deserve extra scrutiny. Benwing2 (talk) 05:57, 3 June 2024 (UTC)Reply

I'm going to quote myself from ten years ago (June 2014) because I found this while digging up the roadworn diff and it seems just as relevant today:

Expressing minority viewpoints or being the lone dissenter in an RfD discussion does not constitute disruption. The fact that we have a formal discussion process at all means that the deletion of entries isn't an open-and-shut policy-enforcement matter completely up to the discretion of administrators. It means that RfD is an open forum where people may put forward serious arguments for or against the inclusion of terms and have these arguments weighed on their merits. Sometimes arguments put forward will not align with majority opinion. Sometimes they'll challenge the soundness of our policies. That's good! We need more of that, not less. The exchange of ideas is what discussion is all about. On the issue of "drama," as an outsider who's watched these incidents transpire from the sidelines, I'm not going to disagree that PBP's behaviour has been problematic, or that it needs to change. But the passive tolerance of incivility on Wiktionary is the proverbial elephant in the room here. We don't have formal dispute resolution or mediation processes like Wikipedia, and when incivility occurs and someone gets upset, the general response, in my own experience, is getting told that occasional rudeness and hostility is par for the course and one should learn to deal with it. This is unacceptable. So if PBP has developed a flair for the dramatic, perhaps it's because Wiktionary, lacking any means for addressing civility concerns in a reasonable and orderly fashion, has left PBP no recourse but dramatics. PBP isn't the problem; PBP is a symptom of the problem. Is it really fair to punish someone for a problem that Wiktionary as a whole has helped to create?

WordyAndNerdy (talk) 06:07, 3 June 2024 (UTC)Reply

@User:WordyandNerdy How would you categorize the types of incivility that are not personal attacks? Or are all types of incivility personal attacks, possibly veiled. I am wondering how to give shape to a civility policy beyond the most obvious. Attacking people's unstated (and possibly imagined) values, attitudes, beliefs, or motives is an example of problematic behavior, IMO. On occasion I have resorted to this, but I believe it to be undesirable in a wiki, as well as in many other environments. DCDuring (talk) 12:40, 3 June 2024 (UTC)Reply

@DCDuring Two examples of this are rudeness and passive aggression. We all engage in them sometimes, but they can easily have a chilling effect on productive discourse. I'm not saying we should ban them (which would probably have a much bigger chilling effect), but I do think any civility policy needs to be more nuanced than simply banning overt personal attacks and leaving it at that. Theknightwho (talk) 13:57, 3 June 2024 (UTC)Reply

I am looking for categories of items that are relatively easy to characterize and which have a high likelihood of triggering escalation. Such categories can form the core of undesirable behavior which can be controlled. There are lots of types of uncivility that are undesirable, but are hard to police. I think 'rudeness' and 'passive-aggression' are hard to define operationally. We can't start with them or let their existence prevent action on what might be relatively easy to control. My hope is that the basic lessons of the psychology of interpersonal relations can be productively applied here. DCDuring (talk) 14:07, 3 June 2024 (UTC)Reply

Targeting a large number of edits by the same editor all at once is likely to make that editor feel targeted. You talk of basic psychology...basic psychology would suggest that, if a large number of edits (made in some cases over a period of years) are all targeted at once, that I would feel targeted! Anyone probably would! What's the solution here? Spread it out! Instead of targeting all my edits in a period of a few days, maybe take a couple months. Purplebackpack89 14:40, 3 June 2024 (UTC)Reply

I would argue that this discussion should not be a forum for airing personal grievances and settling scores. User talk pages are better for those purposes. When they fail, a mediator's assistance might be warranted. We could at least try to generalize to the matter of how, in an environment of volunteers, a patroller should select entries and edits for revision and how the patroller and 'targeted' patrollee should interact. DCDuring (talk) 15:03, 3 June 2024 (UTC)Reply

"When it rains, it pours." I'd say the airing of personal grievances here was inevitable. When there's been no remedy for problematic user conduct – and when discussions on the subject have fizzled out – the result is feeling unseen, unheard, and unvalued. Such an experience can naturally leave one with a sense of injustice. But I hope that everyone's gotten things out of their system now and we can focus on finding solutions.

I'd say that the question of how to categorise "types of incivility that are not personal attacks" depends on tone, context, and several other factors. If someone is generally in the right but is unnecessarily hasty or severe about it, I'd characterise that as "rude," "short," or "brusque." An example might be someone reverting a poorly-formatted but well-meaning edit by a newbie with "learn correct mark-up!" If they're unnecessarily harsh ("f***king learn mark-up!), I'd describe that as "abrasive," "hostile," etc. If they assert their own superiority ("learning mark-up isn't hard!"), I'd call that "snide," "condescending," etc.

None of these present major issues in isolation. We're all human and we all err from time to time. It becomes a community problem when it's a pattern of behaviour. That said I don't think it will be necessary for any user conduct policy we create to classify types of incivility with this level of granularity. All of this could be covered by a general advisement to "please try to keep a cool head and remain respectful in discussions".

Where I think we would need to get into specifics is with statements that express antipathy toward characteristics typically covered by human-rights legislation. There's no reason for invective statements to target someone's race, religion, national origin, disability, sex, gender identity, sexual orientation, etc. Of course there'll be disagreement on what crosses the line. Calling someone a "dumb American" would be taking an unambiguous potshot at their nationality. But "'British people don't say 'elevator.' Why are so many Americans ignorant?" is arguably just an unhelpfully cranky statement of the fact English varies between countries. More nuanced incidents will warrant consideration on a case-by-case basis. But a blanket rule against what is generally deemed hate speech is necessary for a communal project like Wiktionary to function (and thrive!). WordyAndNerdy (talk) 22:41, 3 June 2024 (UTC)Reply

mostly unproductive

I support having actual bigotry be punished, not “one of the random usage quotes you found turns out to be from a far-right website”. For all I know one or more of the countless linguists I’ve cited over the years could have been a literal Nazi and I’d have had no idea whatsoever. Should I be put on trial too?

I get that you think I’m “spewing crap” and such but to me this is a genuine concern. I’ve seen more than one community turn unbearably toxic from this sort of thing. Nicodene (talk) 06:53, 2 June 2024 (UTC)Reply

I feel that you being on a far-right website would be obvious. CitationsFreak (talk) 07:03, 2 June 2024 (UTC)Reply

If you’re there to actually read the article or check out the website, yes. It’s never occurred to me to examine a site like that when I’m just there to grab a quick usage example that Google found for me. Now I will, though, out of terror of being accused of “Shoehorning neo-Nazi propaganda into random entries”. Nicodene (talk) 07:31, 2 June 2024 (UTC)Reply

You probably should read the articles, at least the parts that give context for the word. (Plus, FF literally said "[N]eo-Nazis are […] possible contributors – and [quoting them] shows that Wiktionary knows its onions." in Talk:smash, the talk page for the page where he added the Daily Stormer quote.) CitationsFreak (talk) 07:39, 2 June 2024 (UTC)Reply

I see. I don’t agree with hosting (even innocuous) quotes from such people. Nicodene (talk) 08:04, 2 June 2024 (UTC)Reply

No one innocently stumbles onto the Daily Stormer website off Google. Particularly not when Google is likely to filter results in compliance with local laws. WordyAndNerdy (talk) 17:40, 2 June 2024 (UTC)Reply

That is quite literally the only plausible explanation. How else do you imagine he found “smash” used in such a specific sense, at the same time on that site as various others, if not via a search engine? He’s a Nazi with superhuman memory or perhaps incredibly lucky? Nicodene (talk) 21:20, 2 June 2024 (UTC)Reply

I guess, it is working towards all the political bias as such, not only nazi linguists. It is not a good idea to rely on linguistic works of Josef Stalin or Nikolai Marr. But it does not mean that all the Soviet linguists are complete garbage, even if they write some Soviet propaganda in they works sometimes. It is a difference between quoting a story of H.P. Lovecraft or Gabrielle d'Annunzio and quoting a Nazi propaganda paper, even if all of them have some political bias, but a Nazi propaganda paper is made for the propaganda reason, while a story about monsters or flowers has some other main goal. As Bogdanov-Malinovsky said, it is always possible to find political bias behind any text if you dig hard enough. As of me, when i choose quotes for Russian words, i find it hard to find any Soviet or Russian well-known author not involved in any political ideology at all (maybe except of Pelevin or Nabokov). Tollef Salemann (talk) 12:57, 2 June 2024 (UTC)Reply

I just don’t see it, I guess. E’s comment was catty, without a doubt, but how one goes from “he called me a drama queen” to “he hates women” I genuinely don’t understand. Or how one could seriously think FayFreak, of all people, to be far-right. Nicodene (talk) 05:15, 2 June 2024 (UTC)Reply

(e/c) I'm actually mystified in the other direction, how one could have avoided noticing it; his leanings are apparent (and even prove useful sometimes, e.g. that he knew where to look to aid with Talk:ᛦ), although I made clear early on that open Nazis / Nazism isn't welcome on this site. (Beyond that, it's as Benwing says, he makes useful contributions in many areas, and disruptively best-ignored tangents and misdefinitions [see e.g. the recent discussion of Tatbestand, or Talk:negligence per se] in other areas.) - -sche (discuss) 05:20, 2 June 2024 (UTC)Reply

I genuinely read (past and present tense) him as left-wing. Maybe I’m naïve. Nicodene (talk) 06:14, 2 June 2024 (UTC)Reply

@Nicodene: In the past I have not well known how one could read me as anything, social cognition impaired, you know. To jump on the bandwagon of any political wing I would have to go outside or at least expose myself to some community, otherwise my personal weltanschauung at any given time is decidedly idiosyncratic. Now I hit thirty engaging none but my computer screen, library books and professors, not having been on intimate terms with any peer, it is unclear how (in this alexithymia) one could picture, or essentialize, allegiance, anymore than that of a faceless cyberhacker, only because I randomly, eclectically, made my awareness of ideological trench wars, which I never experienced other than by second hand, noted by coloured language, which cannot be extirpated, as by dint of the idiom principle they constitute the metaphors you live by. I don’t live by them, they are just epistemic content providers with signalling hazards for me.

Parents have evinced fatal neglectfulness when the first books I got after elementary school, which was a waste of time as all succeeding one, was some serious crackpot stuff ordered from some physical malvertisement, after which, without any attachment and due to genuine enthusiasm for seeing things from unconvential perspectives, I went fully down all conspiracy rabbitholes and also got the hang of online extremism extensively, while being excluded from school for patterning some bully up (legal loophole in compulsory education, you just make them not want you pending further clarification, may also be rightful self-defence, doesn’t matter, teachers are only right themselves always), though to be fair that was the triggering event for an eventual upcoming diagnosis that would have to lay the foundations for special support (which never took place, had to self-learn), and reading books is good either way, from my experience. There you start general epistemology and language science, because there must be methods to parse and evaluate any information. Without methods I am a non-responder, there are no “such people”, only information and art.

Arguing around is underrated, that’s why I am a jurist, after arguemaxxing since that time, preferably with freethinkers of course. Some write to me though and assume I am lefty for their own convenience and I help them to solve their self-care questions their DMN, trying them to fit into society somewhere, burdens them with, you are not extra. Nazis get “each to their own” as well. They are easy! And what do you think happens if someone approaches me and wants to talk me into the message of Jesus? Normally people just dismiss him but I extend my walk until dusk in order to dissuade him and convert him to infidelity, while equating Jesus and Hitler, the mindsets of such proselytizers, climate activists and terrorists of all kind on multiple occasions. All ideologies are wrong. Their proponents just made unfortunate experiences and now everything they suffer is their bedlam governed by imaginary friends and icons. I can only consent with individual policies, never partisan.

You never fail to conjecture too much into people, so I still expand. It was doubly difficult to read the room if a school assistant (so an apparent pedagogue translates, UK equivalents seem to exist not, unless you substantiate the contrary) engenders an observer effect. Protection layers of abstraction are between me and any input, mirror neurons unknown. It still baffles me how much intertextual association can haunt people. That that much of a social feedback loop experience is for allists, that someone not distancing himself from a Nazi player (which is unmotivated, as laid out, in the way that I don’t distance myself from cows and gavials, it is a different species) is permanent, pervasive, and personal. And your “social circle” is actually infectious, since you respond to social confirmation! I mean, by analogy, if that’s an inappropriate image to you, given that for my configuration social behaviour is not open to intuition but only cognitive simulation, which expressed WordyAndNerdy impudently calls “inane blather”. Apparently being unemotional and putting things into perspective is inane. Fay Freak (talk) 11:05, 2 June 2024 (UTC)Reply

Good Lord. The Daily Stormer is a neo-Nazi website. There is absolutely no ambiguity on that front. Shoehorning neo-Nazi propaganda into random entries is detrimental to Wiktionary's reliability, whether it's as the result of 1) an edgelord trying to stir up trouble, 2) a free-speech absolutist making some self-sabotaging "Wiktionary is not censored" point, 3) or an actual neo-Nazi evangelising their real beliefs. I also did not make "questionable accusations of misogyny" against Equinox. I made one (1) reference to his use of "misogynistic language" in connection to me. I still have the screenshot of him referring to my Wiktionary history as the "tragic tale of the abused wife who comes back" in the Discord last year. I would've supplied that as evidence privately if this matter had ever reached a higher level. But Equinox's departure has rendered a need for that moot. That doesn't absolve Wiktionary of its deep-rooted systemic issues as a community. WordyAndNerdy (talk) 05:19, 2 June 2024 (UTC)Reply

It seems clear from the edit you’ve linked that FF used Google (or Bing or whatever) to gather quotes containing various senses of “smash”. One of which happened to be that website. For the record not all of us even what Daily Stormer is, and I sure wouldn’t have guessed correctly based on the two sentences Fayfreak got from it. I support removing that quote, to avoid directing any traffic to that site, but I can’t understand seeing so much malice in a simple oversight.

At least with E there is a level of actual malice. The comment was catty, needlessly personal - yes. But misogynistic? Would he have not been catty about it all if you were male? And apparently he thought you yourself were misogynistic for re-adding “cisgender” to the entry for febfem? I just don’t understand any of this. Nicodene (talk) 05:53, 2 June 2024 (UTC)Reply

Getting really sick of being gaslit and told I don't understand what misogyny is as a woman. I'll finish attesting an in-progress entry and then I'm done. I've given enough second chances to this site. WordyAndNerdy (talk) 05:59, 2 June 2024 (UTC)Reply

It's also clear that you didn't even read my comment because it's clear that the "misogynstic language" was a reference to a Discord comment and not the transphobic jibe at febfem. WordyAndNerdy (talk) 06:04, 2 June 2024 (UTC)Reply

I have read your comment as well as his. And mentioned that the latter was rude and explained how I read it. He was mocking your behaviour as far as I can tell not your gender. If it is misogynistic then you could just say how?

The edit summary he left on febfem jokingly says that you hate women. I don’t understand the rest of that at all. Nicodene (talk) 06:11, 2 June 2024 (UTC)Reply

@WordyAndNerdy Please (try to) ignore comments of this sort, if possible. As I said to TKW (using the equivalent Spanish proverb), a closed mouth catches no flies, and (IMO at least) it's best to give no air to people who spew crap. Benwing2 (talk) 06:08, 2 June 2024 (UTC)Reply

My input was specifically invited in this discussion. It's turned into a beatdown, exactly as I feared it would, because some people seemingly don't want to inspect Wiktionary's hostile, corrosive side too closely. There's a reason I only made two oblique references to my gender in my first ten years on Wiktionary. Suffice it to say that existing openly as a woman on the Internet is generally not conducive to positive experiences. This discussion has only served to reaffirm that. I've had others firmly object to my judgements/opinions/etc. before, but I've never been condescended to and gaslit by multiple users like this. I'm tired and disgusted and done. WordyAndNerdy (talk) 06:44, 2 June 2024 (UTC)Reply

“Suffice it to say that existing openly as a woman on the Internet is generally not conducive to positive experiences.” Don’t think so. Most places on the internet are generally not conducive to positive experiences, they would have to be specialized to be otherwise: minorities stick together. “Deep-rooted systemic issues as a community” exactly I see not; though there are ways to feel like it, using loaded language with spicy twang is much better than daily business outside. Wiktionary is goodt. You wouldn’t be here if it were unsafe, it’s not just hope for revenge on mirror-images of abusers, I hope.

You should by now know how men work; they structure their whole livelihoods, and by extension that of others, about enjoying one of their preferred sex two days a week (average) and thus behave differently depending on whether women are present or not, anxious if they are unsure; you are lucky not to have all the T and would wish to go back if you ever found yourself to wake up as a man. In this respect I understand Equinox as, at bottom, sincere in assuring himself that you are, in fact, cisgender. I am sorry for myself I have talk about humans’ constant rutting now, I have never started this topic, but when we have resolved the whys of its arising it can go away. None of the male sex who does not attempt to expend exceptional empathy owns up to this actual concern without a level of cheekiness, unconscious of it himself. Don’t be condescended, it is an opportunity to set the hierarchies straight, possibly appearing smarter yourself! Cognitive reframing trumps both action-based coping and venting. Indeed I strive to beget positive attitudes and don’t say anything demotivating to people, gaslighting or not from my side (not that I would know; a heart for men and women both anyway). Fay Freak (talk) 07:45, 2 June 2024 (UTC)Reply

@Vininn126 it is irrelevant whether a particular editor is perceived to be "productive" or "sloppy". That shouldn't be an excuse to be combative with them, or escalate things Purplebackpack89 15:51, 2 June 2024 (UTC)Reply

I've had limited but productive interaction with both TKW with WAN. I respect them both as editors and hope they can both find a way to continue editing. Contributing to Wiktionary is a particularly thankless endeavor and I imagine that, like many editors, each has received much less praise than they deserve for their efforts while being on the receiving end of a disproportionate amount of criticism. They, and other editors, have good reason to feel aggrieved and I think that we, as a community, could do a better job of shutting down bad behavior earlier and providing a forum to air grievances where the involved parties could get some perspective from uninvolved editors instead of feeling like they have to personally defend themselves against attacks. I would hope such a forum could provide actionable support for legitimate grievances, perspective for editors who feel slighted by innocuous remarks or edits, and a quick boot for anyone using it in bad faith. JeffDoozan (talk) 00:16, 2 June 2024 (UTC)Reply

I'll be honest. I think too many people here have stopped actually building a dictionary. I don't like that. So I'll be absolutely clear as to my position once, and I sincerely hope that at least some of the people here that are trying to figure out how to emit as much aggression as possible onto unknowns on the internet will find a better hobby.

I didn't become an admin to enforce any rules on "civility" or the like. I simply don't care. I should probably start helping out with closing RFDs and RFVs more often (I have been pretty busy with real-life things, but right now I have a bit more time), but other than that I am a volunteer as much as anyone else on this website, and I don't come here to do busywork I wouldn't even do if I were paid.
- So, basically: If you need a nanny, this isn't a website for you.
- If you get called an idiot, or stupid: Tough luck, you making a BP post on that only proves this statement.
- Actual slurs are a different matter, and we shouldn't tolerate those in any shape or form. Use your head.
We aim to be a full dictionary. We are also a politically neutral dictionary.
- Yes, that means we have entries for slurs, Neonazi slang, communistic formation and whatnot.
- We shouldn't use politically loaded quotes unless necessary, but sometimes they are: 99% of literature written on the territory of modern Russia in languages other than Russian will be loaded with communistic messages, that doesn't mean we shouldn't quote them.
- If anything, quoting anything that shows a capitalistic or religious view (including the Bible even!) should be as problematic as neonazism or communism.
- If you can't handle us hosting such quotes when they are necessary, maybe lexicography isn't your thing.
- If you find something you think wasn't necessary, remember: assume good faith. That's like page one of our whole dictionary. I feel this rule that should be plastered all over the website is forgotten too easily in the last few years. The person adding a neonazistic quote isn't necessarily a neonazi themselves, they may just be lazy and have found this quote before any others. That's why I add communistic quotes for Ingrian, because that's most of the literature, and it's easier for me to just take a book and add quotes word for word than look through the entire corpus hoping to find a sentence where the word "religion" isn't followed by "is complete bollocks".
The recent amount of technical "fixes" has grown out of control.
- Entries go first, templates go second, and markup goes last.
- Going out to change any technical feature of a language you aren't personally in the process of adding entries for should be done only at the request/agreement of the ones that do edit it. In the best case, you will have to re-do these changes later on when an active editor appears, and in the worst case you will lose every single editor that is invested in working on this dictionary at all.
In the end, seriously, I would rather have an editor do constructive work and be a little rude than an editor doing nothing and be the nicest person in the world.
- I'd say 99% (yes, I like that number) of the languages in our dictionary are grossly underrepresented. To give an example: Just today Ingrian (which has an estimated 20 native speakers) surpassed the closely-related Estonian (which has an estimated 1.2 million native speakers) in terms of number of lemmas, and the situation in Africa and Southeast Asia is even worse.
- If an admin is monitoring your edits, it's because you apparently did something wrong. Doesn't mean you're a bad editor, just means you have room to grow. See what was changed and try applying that in the future.
- Now, if you continue to make the same mistakes over and over again, then you'll at some point get the message "Please stop, and if you don't you'll get a block", and at that point you should really stop. We cannot keep fixing your mistakes for you.
- To the admins monitoring: If you tell the people why you're going to monitor their edits, that will probably be more effective than just acting like you're not doing that, or only explaining it after they have completely freaked out.

Maybe let's stop trying to figure out who's right and wrong and start actually working on the dictionary? Does that sound like a plan? In that case, we don't need any conflict resolution, because nobody will offend anyone and nobody will get offended. Sounds like a win-win to me.

Because seriously, what in the world is keeping you from editing so much that you absolutely need me and a few dozen other editors to write this type of enormous text just to solve it? Thadh (talk) 15:59, 2 June 2024 (UTC)Reply

I'd have to strongly agree with a lot here. Maybe not everything, but a lot. I'd like to emphasize that it seems to be the people who stir up the most mud also seem to do the least editing. Vininn126 (talk) 16:01, 2 June 2024 (UTC)Reply

Here's my 2c, most of which has been said by me or others elsewhere.

Wiktionary tends to be dominated by a relatively small group of "guardians", such as Knightwho, Equinox and Fay
Some of those guardians (again, Knightwho, Equinox and Fay) have problems getting along with non-guardians
The guardians aren't that interested in holding each other accountable
Some of the guardians are OK with driving non-guardians from the project. At least one of them (rather foolishly) stated that publicly.
This is in conflict with one of the base principles of all Wikimedia projects: that anyone can edit them
With great power comes great responsibilities. In exchange for being awarded the blocking tool, admins should be expected to be held to a higher standard than non-admins
There is no deadline. Except for obvious vandalism, there's no need for minor tweaks to be done immediately, nor is there any need for them to be done by any one editor in particular
It's been pointed out quite a few times, by several different editors, that Knightwho has a problem with conflict and escalation (one example being that, when I felt harassed, he just went further and further back into my edits, rather than stepping away)
Remedies have been offered to KnightWho on how to avoid conflict, and he's ignored them

What does this mean in real terms?

De-escalation is a good and necessary thing
If the parties are unwilling to de-escalate, remedies like two-way interaction bans need to be available.

Purplebackpack89 15:44, 2 June 2024 (UTC)Reply

I am going to be perfectly frank. Someone shouldn't be an admin if they aren't willing to enforce user conduct standards. Civility is one of the five pillars on Wikipedia. There is no reason for a load-bearing policy to be entirely absent on Wiktionary except to preserve and enable a toxic culture. Any rank-and-file editor could theoretically do menial maintenance tasks such as closing RfVs. I had a short stint running Word of the Day back in 2012 and I was (and remain) a non-admin. The necessity of admins is not in doing maintenance tasks but in keeping the peace. With the ability to block disruptive users, they might be thought of as a wiki's police. Ideally, blocking shouldn't be the first line of defence. Problem users can be dealt with through guidance, de-escalation, interaction bans, mediation (if such a process existed here). When one of the few woman editors sticks her head above the parapet to speak on her negative experiences, she shouldn't receive gaslighting, condescension, and a stunningly weird and deeply discomfitting jeremiad about how men are too horny to work with women in response. It's impossible to have a serious conversation when this type of rank nonsense is tacitly allowed. Was this thread started to have a discussion about how Wiktionary can create dispute resolution processes? Or is it an exercise in hand-waving and navel-gazing ("Why can't everyone just get along?") without any actual commitment to examining Wiktionary's systemic issues and implementing badly-needed changes? The fact that a civility policy seems slated to be rejected by a landslide beggars belief. I honestly don't think anything is going to change without WMF intervention. The rot has spread too deep for Wiktionary to keep its own house. WordyAndNerdy (talk) 17:17, 2 June 2024 (UTC)Reply

I did hope that it would lead to the former, myself. (In fact, I hoped that we would make a dispute resolution process.) CitationsFreak (talk) 18:08, 2 June 2024 (UTC)Reply

It won't. I went into this discussion skeptical, and it's affirmed every misgiving I had. Even the level heads in the room seem to be taking a hands-off approach. No one wants to the one to button down and call for change. Tall poppies are smacked down; squeaky wheels are dismantled. Doesn't matter if they've got 14 years of solid work behind them. Preserving a cootie-free space for the boys' club is apparently more important than building a dictionary. Heaven forbid anyone be required to exercise personal restraint in what is functionally a professional setting. That's woke pinko free-speech suppression or something. WordyAndNerdy (talk) 18:47, 2 June 2024 (UTC)Reply

@WordyAndNerdy I made a (bare-bones) proposal above, do you have any thoughts about that? User:CitationsFreak and User:Theknightwho are the only ones who made any comments about it so far. I am trying to find something that will both have some substance in it and work in practice (two aims that aren't easy to reconcile). Benwing2 (talk) 18:59, 2 June 2024 (UTC)Reply

Do you mean this? I'd considered the possibility of a semi-formal mediation process myself. But such a scheme would be just as easy to game as a more legalistic one. Too often subjective judgments inform individual perceptions of a situation. The scale will always be weighted in favour of those with power and the right connections. People are more willing to assume good faith of people they admire and/or consider friends. Which is why I believe an intermediate stage in the dispute-resolution process would be necessary. Problem users (including admins) could be restricted to 1RR and required to bring concerns to the BP to ensure uninvolved eyes assess the situation. We'd need to be comfortable with enforcement being applied asymmetrically in some cases. Sometimes both "sides" in a conflict aren't equally guilty of bad behaviour. An admin who is habitually hostile/antagonising isn't the same as a rank-and-file editor who reacts poorly in an isolated instance. That's a level of nuance more legalistic approaches are generally better at handling. WordyAndNerdy (talk) 19:39, 2 June 2024 (UTC)Reply

@WordyAndNerdy Thank you for your response. I think in general, edit wars should quickly be brought to the Beer parlour; if you get to the point that you've done 3 reverts (or even two), you should stop and bring the discussion to the BP. At least, this is what I've done and I have seen others do the same. We are generally less tolerant of edit warring than Wikipedia is. Maybe something like this can be put into a formal policy. I do agree that sometimes one person will be right and other wrong, although it's not always apparent to outside admins. As an example, there was a dispute a few years ago between User:Saranamd (aka Tibidibi/Karaeng Matoaya) and B2V22BHARAT. Both users asserted the other was wrong and was edit warring; eventually it was clear that the latter user was in the wrong and was blocked for a week (causing them to leave), but it took awhile to sort this out, esp. since there was no admin dedicated to the dispute. I agree in general that any process can be gamed, but having the process is better than not having one at all, and I think maybe a mediation process with a single uninvolved admin could be an intermediate step required before a full legalistic panel. I have read through such panels in Wikipedia, and they're exhausting just to read (much less to participate in, I'm sure). Such panels may be necessary in Wikipedia because they are often caused by underlying real-world political disputes (abortion and other US political issues; the Israeli-Palestinian conflict; a whole host of Eastern European conflicts; etc.). But in my experience these disputes are thankfully less relevant in Wiktionary, where the disputes instead are more on the personal level. I invite others to contribute suggestions regarding what should be considered actionable, what the steps are in the process, etc. Benwing2 (talk) 21:49, 2 June 2024 (UTC)Reply

You're correct that people are often unaware of points of contention outside their own personal experience and knowledge base. That's why it seems integral for project Wiktionary to strive to both invite and sustain a diverse editor base in order to help counteract systemic bias. While I'd personally prefer a more structured ("legalistic") approach, any dispute-resolution process would be a vast improvement on none. WordyAndNerdy (talk) 22:59, 2 June 2024 (UTC)Reply

I am cautiously more hopeful; I read support on the vote page, even from oppose voters, for having a thought-out civility policy; the thing which the vote looks set to defeat is one editor's attempt to win a personal dispute by pushing through a page from 2006 seemingly without even reading/comprehending it enough to notice it still said one of the processes involved notifying Jimbo. I'd like to hope a guideline that doesn't posit "Head Boy of the boy's club should be notified", a modern civility policy written in 2024, is attainable. (I also think ensuring the policy / community has mechanisms for dealing with gaming is a valid and serious concern; on Wikipedia, my anecdotal count is that it seems like about half the trans editors who've dared edit trans topics there have gotten baited/gamed and censured/censored/banned; I think we do need to think about how to write a civility policy that doesn't empower the one or two people taking the stance that someone calling out / disliking Nazism is the one in the wrong.) - -sche (discuss) 18:33, 2 June 2024 (UTC)Reply

I'm also not aware of any openly trans or non-binary Wiktionarians. I'm sure there's a couple but how many want to hang around with all the trans-antagonistic soapboxing that goes on here? Our collection of trans-related terms has seemingly been built primarily by cis people. Imagine if all entries for a language were created exclusively by non-native speakers. How would that shape Wiktionary's coverage of that language in subtle ways? I mean, the general lack of AFAB editors on here is of genuine lexicographical concern. WordyAndNerdy (talk) 20:52, 2 June 2024 (UTC)Reply

While not the same issue, I feel the same way about racial issues. I've been called epithets by users/IPs and had to go on resource dives for showing that the most basic terms are actually offensive, see the history of all lives matter, specifically this edit, for an example. However, one thing I do think I've learned here, for better or worse, is that it's not worth it to get into spats even if you're in the right. It just bogs you down and puts a negative light on you. For myself, I just keep mental track of folks I've interacted with and act accordingly, such as with Equinox. Not worth it to argue anymore. That obviously doesn't work for everyone, and it's not easy, but it keeps me sane on this project, especially after 2022 with the discussions leading up to the creation of WT:DEROGATORY. I just hope that one day this project will be welcoming enough to where we can get actual coverage done for the languages that really need it. AG202 (talk) 21:05, 2 June 2024 (UTC)Reply

Same here. CitationsFreak (talk) 21:12, 2 June 2024 (UTC)Reply

I'm not sure if I'd personally label all lives matter as "offensive." That phrase seems to be employed more as a silencing tactic than a provocation. One might argue it's the racial analogue of not all men. That kind of complexity can be difficult to condense into a context label. I might've offloaded it onto usage note as happened at TERF. But I'm willing to accept that I've got a large blind spot here. It's definitely good to have a diverse editor pool for this reason. Not everyone is going to catch errors that result from their own limited experience and/or biases. As for continuing to edit despite it all, I'm not sure that's feasible for me, given it's clear I'm unwelcome here. There was a time when it took me more than a year to point out that an editor (not Equinox, to be clear) was habitually inserting inflammatory quotes from manosphere blogs into random entries. I don't have the patience for tying myself in knots trying to explain why that's a bad thing without referencing systemic oppression and prejudice anymore. WordyAndNerdy (talk) 21:54, 2 June 2024 (UTC)Reply

@WordyAndNerdy I'd like to clarify that you are definitely not generally unwelcome. Yes, some contributors have essentially told you to fuck off, but I for one appreciate your contributions. E.g. you have added a lot of info about fandom ships, something I know next to nothing about; from reviewing your contributions, I also see stuff related to non-binary and other gender-non-conforming communities (if that is the right term), social-media memes and trends, and other stuff that's important for keeping Wiktionary up-to-date and representative of all (sub)cultures, not just the dominant one. Benwing2 (talk) 05:13, 3 June 2024 (UTC)Reply

Thank you for the kind words. One of the most gratifying things was randomly seeing "WIKTIONARY HAS SHIP NAMES???" in a tweet. Knowing my work is being referenced by people outside the fandom sphere is cool. WordyAndNerdy (talk) 06:13, 3 June 2024 (UTC)Reply

I can think of at least two who have openly identified themselves; I'm sure -sche knows of more. I'm not sure however if either of the people I'm thinking of have contributed to trans-related entries. One used to be one of the most active contributors, esp. for bot-related work, but left for reasons (I think) are at least partly unrelated to their trans status. The other is still active but has stayed away from this discussion. Benwing2 (talk) 21:20, 2 June 2024 (UTC)Reply

Nor are they required engage in this discussion in a "any marginalised individual in a group is required to serve as a spokesperson" kind of way. I just think it would just be nice to have more LGBT editors onboard to help counteract systemic bias. As rewarding as it has been documenting trans-related coinages on Wiktionary, it can feel like talking over actual trans people or treating them as anthropological curiosities at times. WordyAndNerdy (talk) 22:14, 2 June 2024 (UTC)Reply

If Wiktionary really is a "boys' club", may I suggest you take the first step to improve this state of affairs by de-sysopping yourself, having been one of the boys in charge for years now? "Walk the walk", as they say.

For the record I don't buy it. A perennially catty user (Equinox) being catty to yet another person is not because they're a woman, it's because they're just another person. FayFreak is not a Nazi whatsoever, he's a "free speech" champion. You disagree with him, I disagree with him as well – the difference is you see burning malice where I see a kind of optimistic naïveté. Nicodene (talk) 22:15, 2 June 2024 (UTC)Reply

What part of "I was (and remain) a non-admin" do you not understand? Would be really nice if you actually followed this discussion instead of shadowboxing against things that no one said. WordyAndNerdy (talk) 23:11, 2 June 2024 (UTC)Reply

What part of my replying to -sche, not you, do you not understand? Nicodene (talk) 23:19, 2 June 2024 (UTC)Reply

Then use @ to make it clear who to whom you're speaking because this thread is playing fast and loose with indentation. WordyAndNerdy (talk) 23:23, 2 June 2024 (UTC)Reply

Your hostile remarks toward -sche are also completely unwarranted. Maybe sit this one out if you're just gonna throw peanuts from the gallery. WordyAndNerdy (talk) 23:26, 2 June 2024 (UTC)Reply

Basic reading comprehension on your part is not my responsibility. How "[you've been] one of the boys in charge for years now" could possibly be construed as being about you is beyond me.

I don't think what I've said (and I stand by it) comes anywhere near frivolously accusing someone of Nazism. If you'd like to apply your own apparent standards for hostility to yourself and "sit this one out", I'll be happy to follow suit. Nicodene (talk) 23:34, 2 June 2024 (UTC)Reply

Can we please de-escalate here? —Justin (koavf)❤T☮C☺M☯ 23:42, 2 June 2024 (UTC)Reply

Feel free to start a de-sysop vote for me, but something tells me your idea of what an admin should or shouldn't want or have to do is not the community consensus. Thadh (talk) 18:36, 2 June 2024 (UTC)Reply

If we subtracted all of the statements in this discussion that themselves were about individual persons' values, attitudes, and beliefs, including defensive reactions, we would have a very short discussion indeed. I don't see that most of the discussion here is contributing to the topic-creator's concerns or even to an improvement of that statement of concerns. DCDuring (talk) 12:40, 3 June 2024 (UTC)Reply
I completely agree with you. Theknightwho (talk) 13:59, 3 June 2024 (UTC)Reply

how to identify locations in audio snippets of minority languages?

Latest comment: 15 days ago14 comments6 people in discussion

I am cleaning up the captions of audio snippets, and I've come across an issue that needs discussion. Sometimes if the audio file refers to the location where the language in question is a minority language, the file identifies the location using the minority language's preferred name instead of the common English name (which is usually based on the majority language). Examples:

There are 1,179 snippets for Palestinian Arabic as spoken in Lod, Israel, which identify it using the Arabic name al-Lidd.
The audio for the Northern Kurdish term emerîkî comes from Van in Turkey but originally identified it using the Kurdish name Wan. (In this case I changed it to Van before the wider issue became apparent.)
There are 5-6 Northern Kurdish terms from Diyarbakır that identify the location as Diyarbakir (note the two i's in the spelling), using the Kurdish form of the same name, and one that identifies it as Amed, using the normal Kurdish name. (Note, in this case, the form Diyarbakır is a Turkified name adopted in 1937; the older form in Turkish was Diyarbekir, from Arabic.)

I'm sure there are others, but these are the most politically fraught ones I've come across. The questions are:

Should we use the common name, as Wikipedia does (the above cities are found under Lod, Diyarbakır and Van, Turkey) or defer to the minority language's name?
If we defer to the minority language's name, do we do this only in certain cases (e.g. ones that are politically fraught)? (I bring this up because e.g. Navajo names of places tend to be radically different from the corresponding English ones, cf. Window Rock, Arizona vs. Navajo Tségháhoodzání and I think it would be confusing to use the Navajo names.)
What about accent marks not typically found in the common English name? E.g. there are hundreds of Vietnamese audio snippets that currently use the spellings Hà Nội and Hồ Chí Minh City, which I've changed to Hanoi and Ho Chi Minh City in accordance with the common English names.

Benwing2 (talk) 04:22, 2 June 2024 (UTC)Reply

~~This is the sort of thing that AI should be good at doing. —Justin (koavf)❤T☮C☺M☯ 04:29, 2 June 2024 (UTC)~~Reply

@Koavf I don't get what you're saying at all. Maybe you're misunderstanding my questions? Benwing2 (talk) 04:49, 2 June 2024 (UTC)Reply

I can be ignored here. Sorry. —Justin (koavf)❤T☮C☺M☯ 04:58, 2 June 2024 (UTC)Reply

For Navajo and other Native languages, my gut reaction is: if the entries currently use Navajo names, then either just continue to use the Native name, or list both ("Tségháhoodzání / Window Rock" or vice versa). Perhaps not in that specific case, but in the case of some other Native placenames, the nearest semi-applicable English name may have different scope/boundaries (or it may be unclear where the Native placename was, although this is probably not going to be a problem with audio files), so retaining the Native name seems useful. Slashing both would be a lot to type, but this might be mitigated if the template/module drew on T:a-et-al and so e.g. "Tségháhoodzání", "Window Rock", and optionally some even shorter name like ~"nv-TG", could all be aliases...? Pinging User:Eirikr for your thoughts.
For Palestinian Arabic, renaming cities to Israeli names indeed feels way too loaded, and for my part I would not support it. (If we have audio samples from "Bakhmut, Ukraine", does there come a point at which it's been occupied long enough that we change them to "Artyomovsk, Russia"? Ehhh...) For diacritic differences like the Vietnamese examples, I'd be inclined to use the common English form; that seems like another place where it could be useful if the template/module could know Hà Nội was an alias of Hanoi and display "Hanoi" when given the input "Hà Nội". - -sche (discuss) 06:06, 2 June 2024 (UTC)Reply

@-sche The template does use {{a}} for this purpose so a lot could be done with aliases, although I'm not sure it would make sense to have slashed names in most circumstances. (The Navajo example I brought up is theoretical in any case; AFAICT none of the Navajo audio files identify any place name at all, although many say "Audio (NV)", which I am tempted to delete because it seems to convey no useful info. Similar issues occur with "Audio (AF)" for Afrikaans, "Audio (CS)" for Czech, "Audio (KN)" for Khiamniungan Naga [KN is the country code for St. Kitts and Nevis, which is nowhere near India :) ...], and "Audio (BCL)" for Bikol Central = lang code bcl.) The issue with Lod, as with all Israeli/Palestinian issues, is very complex and fraught; the reason I brought up this example in particular is that Lod is not internationally considered occupied and AFAICT the term "Lod" does not have the sort of political baggage associated e.g. with terms like Judea and Samaria, so it may not be parallel with the case of Bakhmut or with cities in Gaza and the West Bank, which unquestionably should use Arabic language names. Maybe a more parallel example is Lviv, formerly a Polish city known as Lvov; if we somehow had Polish audio from this city, it might make sense to use a slashed form Lviv/Lvov, and similarly here maybe Lod/al-Lidd? Same thing might apply to Jerusalem/al-Quds? (The status of this city is even more convoluted and intractable but since the common name in English is "Jerusalem" and most readers won't be familiar with "al-Quds", I think it would be confusing to only say "al-Quds".) For that matter, maybe this approach is tenable also for the Northern Kurdish terms I mention above. Benwing2 (talk) 06:39, 2 June 2024 (UTC)Reply

OK, I seem to have reversed myself from what I said at top. Benwing2 (talk) 06:40, 2 June 2024 (UTC)Reply

Lvov is the Russian name. The Polish name is Lwów. There is surely some English dialectological study of Palestinian Arabic, where the Jerusalem dialect has some name. If it is called Al-Quds, i will rather go for using Al-Quds, because it is how this dialect is known in the English books about the Palestinian dialects. But nobodys gonna refer to Moscow dialect of Russian as "Moskva", cause the English books on Russian dialectology are surely using "Moscow" as the name of this dialect. On Diyarbakir, we should see some English books on Kurdish dialects how they call this dialect. Tollef Salemann (talk) 19:33, 2 June 2024 (UTC)Reply

@Tollef Salemann Thanks, my mistake. If you know of any books dedicated to Palestinian or Kurdish dialects, feel free to list them. I would guess that the more well-known a place is, the more likely the common name will be used (as you note with Moscow vs. Moskva, etc.). Benwing2 (talk) 21:51, 2 June 2024 (UTC)Reply

Thanks for the ping, but I'm not sure I have any useful input here. Cheers! ‑‑ Eiríkr Útlendi │^{Tala við mig} 22:08, 5 June 2024 (UTC)Reply

We should use whatever the literature does, which will probably be the language's own name. Thadh (talk) 07:32, 2 June 2024 (UTC)Reply

@Thadh I actually suspect it will vary greatly depending on the individual author. It's hard for me to believe there will be any discernible standard here. But I may be wrong. Benwing2 (talk) 08:28, 2 June 2024 (UTC)Reply

Of course it will vary, but there will probably be an overal tendency to prefer native words over local words or the other way around. Thadh (talk) 08:32, 2 June 2024 (UTC)Reply

Agree with Thadh. Now we need to find all the English books about Palestinian and Kurdish dialects. Tollef Salemann (talk) 19:36, 2 June 2024 (UTC)Reply

Dealing with controversial quotes

Latest comment: 15 days ago17 comments10 people in discussion

In a bid to end the discord concerning the addition of quotes that disseminate objectional political etc. views, I would like to draw everyone’s attention to a recent discussion in which User:Geographyinitiative said he favors adding controversial quotations in the Citations namespace, which he deems a safe haven for such quotes which may not be suitable for adding in the dictionary entry. I, on the other hand, held the opinion that we could consider adding a note of disclaimer stating that Wiktionary does not endorse any of the views expressed in any quotes and they are for educational purposes alone (in this case however there’s the problem of cluttering up the dictionary page, so the note probably could be put in the mainpage?) Alternatively as a marriage of the twain ideas, we could as well resort to adding every controversial or inappropriate quote soever in the Citations namespace along with the said note of disclaimer put at the top of the Citations page using a template.

I think any of these ideas will be an attractive option if some people get so triggered by quotes bearing controversial POVs. Just my tuppence, thank you. Inqilābī 22:05, 2 June 2024 (UTC)Reply

I thank Inqilābī for the above comment, and I will say that I do not anticipate there is any negative outcome from this discussion from my view. I am fine with any note of disclaimer as proposed. Even if every Citations page I have worked on were deleted, I'm still okay. However, one among many uses for the Citations page seems to be to catalogue "fringe" material in a way that people can see it without it being right on the entry. There are other reasons for a Citations page. But I consider it one of uses. For instance, the users here like to analyze some wild racist words from dangerous evil blogs. That material seems so vile and repulsive to me that no note of disclaimer could fix it. But there should be some venue for the material given the "descriptivist" stance of the dictionary, so Witkionary "throws it in the hole" (the Citations page) so you can consult that if needed. There are numerous other uses for Citations pages including: a place for inter-sense citations or citations of uncertain sense (the 1966 and 1975 citations for Citations:transgender), a place for re-organizing senses or analyzing contexts, a place for cites of little importance or value for the entry proper, a place for words with only two acceptable cites so far (Citations:intercessionate), a staging area for a potential future entry if conditions permit (Citations:Pinghai), etc etc. The Citations page doesn't have to meet the standards of the entry proper, and stuff is less likely to be deleted there. But I tell you, some of the soul-scarring shit I've seen on the Citations page could NOT be solved by any note of disclaimer. It would HAVE to be deleted from the entry proper, regardless of anything, IMO. The Citations pages create distance from some of the most evil authors's evilest sentences I've ever seen and Wiktionary's entries, while simultaneously remaining true to the purist descriptivist mission. Wiktionary will not be allowed to exist if it puts those sentences on the entry proper. --Geographyinitiative (talk) 22:18, 2 June 2024 (UTC)Reply

Thank you for the elaborate reply Geographyinitiative. Just for the record, the main reason I wrote this post is due to disputes involving other editors, and not because of my RFD nomination that day. I would also like to maintain that I do not advocate deleting every Citation page, I understand your reasoning. Now if other editors overwhelmingly agree that such quotes can be thrown and kept secure in the Citations bin, then my suggestion of a disclaimer can be ignored. Inqilābī 22:34, 2 June 2024 (UTC)Reply

The "citations page containment zone" idea was floated back in 2022 and was not well-received for all its merits. WordyAndNerdy (talk) 23:22, 2 June 2024 (UTC)Reply

IMO, quotes espousing controversial or bigoted viewpoints should be limited to terms that are themselves associated with such viewpoints. If we stick to this, it shouldn't be necessary to have a cordon sanitaire like putting them in the Citations page, because the terms themselves will normally have (or certainly should have) labels indicating that they are controversial, offensive, etc., which clues the reader into the fact that the quotes (which are hidden by default) may express such viewpoints. Benwing2 (talk) 23:44, 2 June 2024 (UTC)Reply

Does this include things like using a quote from a racist speech on a word that is related to racism? CitationsFreak (talk) 23:53, 2 June 2024 (UTC)Reply

I would think so in general. What is the example you're thinking of? What to me isn't appropriate is e.g. User:WordyAndNerdy's example of an incel-type quote added to the word roadworn, since there's nothing about this term that relates specifically to the incel community or any other controversial viewpoints. Benwing2 (talk) 23:59, 2 June 2024 (UTC)Reply

I do not have a good grasp of policy. I'm just trying to 1) protect Wiktionary while 2) allowing the purist descriptivist mission to flourish. So my view does create a cowardly "semi-censored" and "self-censored" aspect to the project. It's not a good solution. But we exist in a society, and I guarantee Wiktionary could be snapped like a twig if it crossed the wrong lines. One device we can use to assuage people is say "hey it's not on the entry". Basically this applies to "fringe" content, so you just have to judge it for yourself. --Geographyinitiative (talk) 00:14, 3 June 2024 (UTC)Reply

Fundamentally, it boils down to "don't use a controversial quote unless you absolutely have to"

Don't use controversial quotes to talk about editors or about real people
If a word has three non-controversial quotes, use those three

Purplebackpack89 01:00, 3 June 2024 (UTC)Reply

No, it doesn’t. We do things that we don’t have to because it comes out optimized or more illustrative, rather than absolutely necessary. Don’t do things I “need to”, for example I don’t need creatine monohydrate but probably still benefit from it. And you wipe off the issue how controversiality is inferred and portrayed; in isolation, the Daily Stormer quote wasn’t the same as the site in general, but someone pushes a stance about the whole resource. Fay Freak (talk) 01:20, 3 June 2024 (UTC)Reply

Like the lock in {{R:OED Online}} “paid subscription required” (which I wanted elsewhere, for legal databases I quoted from) we could have a symbol and tooltip warning about “low factuality”. As on Ground News but less regularly. Nothing too regular since we generally shan’t consider any sources controversial inasmuch they are used for their language (which in rare cases itself is trolling), we already have a contradiction here and a lot of cognitive capacity is wasted for evaluating sources. “Incel-type quotes”? Am I supposed to waste my energy to say anything about these people?

Yet still Geographyinitiative does not recognize content generated via AI by the Chinese propaganda department snuck in as quotes, about which in some cases I have insider knowledge. Academic databases are littered with automated language, and in the former cases “publishing” takes places via PEMT. I invite everyone to search "gullible Bayes".

At least for random Neo-Nazis we know they are real people putting in the effort, and back then I also reasoned that this human language has durability, since Mr. Anglin has still not been downed from the internet despite all the efforts. AI imitates Mr. Average, and avoids controversial statements, think about it. Fay Freak (talk) 01:20, 3 June 2024 (UTC)Reply

@Fay Freak: The AI as a technology is perfectly capable of generating hate speech and Nazi propaganda [39][40]. It's just that the big players in the AI industry are making efforts to suppress this in their own products. But the technology can't be stopped and it is available to anyone. There may be already a lot of AI generated Neo-Nazi content in the net. So I wouldn't just blindly assume that every Neo-Nazi content is human generated and thus has some kind of linguistic relevance. --Ssvb (talk) 16:04, 3 June 2024 (UTC)Reply

@Ssvb: I don’t blindly assume it, but there are a number of reasons against the existence of formally plausible versions of it, apart from the circumstance that I have not stumbled upon it despite searches of the most heterodox things and following the upcoming trends in politics, which are warily tracked by hostile journalism more than anything if coming from this end. AI-generated article images of white families or the like appear, but we mean the texts. Currently everyone suppresses it, the hard cores of Neo-Nazis are too dumb or ideologically averse for targetted computer-generated content, and manual labour is too cheap and worth it for them: Like Kremlebots are real people sitting at a known address in Saint Petersburg. And it does not work: As the neuronal networks are trained on some old averages, even if it be biased content, and then have so-called model decay, they don’t hit humans where it hurts, they would have to have intricate understanding of current connotations of ideological concepts in order to reframe personal identities of people. You don’t change people’s worldviews with AI, though you can promote specific assumptions.

It’s a general problem in education, too. AI programs very much but teaches programming very little and human teachers will always exist and be preferred by totalitarian systems as well, and our dictionary be human-made because we explain politics, philosophy, psychology etc. Fay Freak (talk) 16:45, 3 June 2024 (UTC)Reply

@Fay Freak: "the hard cores of Neo-Nazis are too dumb" - this is a very questionable claim and I wouldn't count on that. Additionally, dumb people tend to make grammar and spelling mistakes, so this reduces the value of their content for Wiktionary. And some of them are even not native speakers. For example, I wouldn't consider the Anders Breivik's Manifesto to be a valuable example of written English. --Ssvb (talk) 19:07, 3 June 2024 (UTC)Reply

My approach, as I said in the 2022 discussion linked above, is: "we can (and do) already move un-illustrative, including unnecessarily offensive, quotes to Citations: pages if they're needed for WT:ATTEST. (If they're not, like someone is adding racist screeds as cites of and, just replace them with normal cites and block the user if needed.) This does lack a reader-facing warning [...] but eh, that probably reduces the amount of bad-faith or even good-faith debates over whether a quote is "really offensive" that a content warning would attract." We already see trolling about "they're not white supremacists, they're white racialists / race realists" etc etc: any "this quote is offensive"/"we don't agree with this quote" notice would just be a magnet for endless disputes. And do we apply "this quote doesn't represent our views" to quotes that express e.g. old or modern flat-earth or geocentric views, i.e. views that aren't really offensive but which nonetheless aren't Wiktionary's views? It's a morass we needn't create. Indeed, I'm not sure there's actually a problem here in the first place? AFAICT what I outline is what is broadly already done; is anyone actually going around and adding citations of Mein Kampf to und and der (and not immediately being reverted), is there an actual issue happening...? - -sche (discuss) 01:26, 3 June 2024 (UTC)Reply

Maybe the age of a quotation also plays a big role? Being old gives it at least a historical value. So that the ancient "flat-earth" theories are okay, but modern "flat-earth" theories - not so much. The former are likely to be honest mistakes, while the latter are likely to be the work of nutcases. Also if the readers see that a quotation is older than maybe 1950, then they can figure out themselves that it's unlikely to present a relevant up to date scientific information even without any extra disclaimers. For example, I added this quotation recently, which is stating something that is possibly not true nowadays (and possibly even debatable back in 1916). But does anyone really care? --Ssvb (talk) 18:49, 3 June 2024 (UTC)Reply

I pretty much agree with -sche here. I am not sure if this problem really merits a whole policy to tackle it, it's really a problem of common sense.

If an offensive quote does not add lexicographical value compared to a non-offensive quote, don't use it, or feel free to replace it with a more neutral quote (even if only because it is a waste of everyone's energy building this communal project to be bogged down in disputes over offensive quotes, or what constitutes offensiveness).

If it does add value (such as in illustrating firsthand the usage of offensive words, or of offensive senses of otherwise unoffensive words) or there are no good unoffensive candidate citations available in durably archived sources, feel free to use an offensive quote within the limits of reason. The guidelines that apply at WT:USEX ("Be friendly", particularly) already codify this for usage examples, fwiw. If we really want, we could expand WT:Quotations#Choosing quotations with a few (permissive) lines to the same effect, I wouldn't be opposed to that.

(I would on the other hand be opposed to disclaimers in mainspace indicating that a quote may be considered offensive, and I do not think that quarantining potentially offensive quotes in the Citations namespace is necessary as long as the principle of least offensiveness is followed wherever offensive quotes do not add any lexicographical value.) — Mnemosientje (t · c) 14:43, 5 June 2024 (UTC)Reply

Announcing the first Universal Code of Conduct Coordinating Committee

Latest comment: 18 days ago1 comment1 person in discussion

You can find this message translated into additional languages on Meta-wiki. Please help translate to your language

Hello,

The scrutineers have finished reviewing the vote results. We are following up with the results of the first Universal Code of Conduct Coordinating Committee (U4C) election.

We are pleased to announce the following individuals as regional members of the U4C, who will fulfill a two-year term:

North America (USA and Canada)
- –
Northern and Western Europe
- Ghilt
Latin America and Caribbean
- –
Central and East Europe (CEE)
- —
Sub-Saharan Africa
- –
Middle East and North Africa
- Ibrahim.ID
East, South East Asia and Pacific (ESEAP)
- 0xDeadbeef
South Asia
- –

The following individuals are elected to be community-at-large members of the U4C, fulfilling a one-year term:

Barkeep49
Superpes15
Civvì
Luke081515
–
–
–
–

Thank you again to everyone who participated in this process and much appreciation to the candidates for your leadership and dedication to the Wikimedia movement and community.

Over the next few weeks, the U4C will begin meeting and planning the 2024-25 year in supporting the implementation and review of the UCoC and Enforcement Guidelines. Follow their work on Meta-wiki.

On behalf of the UCoC project team,

RamzyM (WMF) 08:15, 3 June 2024 (UTC)Reply

"ux" template

Latest comment: 17 days ago10 comments5 people in discussion

I now religiously (well, most times) use the "ux" template for usage examples, since it is what I see others have done, but since this is no easier (in fact actually more to type) than not using it, I wonder whether anyone could explain what the actual advantage is, if any? Mihia (talk) 17:41, 3 June 2024 (UTC)Reply

As opposed to plain wikitext? Category. Same with {{co}}. Vininn126 (talk) 17:46, 3 June 2024 (UTC)Reply

By "category", do you mean that it puts the article in the category "English terms with usage examples"? Not that I am really complaining about typing a couple more characters to use "ux", it's not a big deal, but out of curiosity I wonder what use to anyone or anything is such a category? (A category for articles without usage examples I could understand.*) Mihia (talk) 17:57, 3 June 2024 (UTC) -- (* or, actually, a category for definitions without usage examples would be more useful, since an entry could have ten definitions, only one of which had a usage example, yet still, as far as I gather, show up in "terms with usage examples")Reply

A big underappreciated advantage is that the "ux" and "quote-book" templates are machine readable. This allows easily doing various kind of automatic processing. Yes, it's possible to find terms with missing usage examples if you are interested in that. --Ssvb (talk) 18:06, 3 June 2024 (UTC)Reply

I use it quite often to see what entries still need a usex in the languages I edit. I think others do, too. It's similar to the "English terms with quotations". Thadh (talk) 18:22, 3 June 2024 (UTC)Reply

How do you use category "terms with usage examples" to find entries that don't have usage examples? Mihia (talk) 18:33, 3 June 2024 (UTC)Reply

I compare it to the other category. Thadh (talk) 20:55, 3 June 2024 (UTC)Reply

For non-English languages, {{ux}} is required for the text to be tagged in the correct language for e.g. screen readers or other automated software. — SURJECTION ^{/ T / C / L /} 19:15, 3 June 2024 (UTC)Reply

Not to mention script and font. Thadh (talk) 20:56, 3 June 2024 (UTC)Reply

I mentioned these points at https://en.wiktionary.org/wiki/Template:ux/documentation. Anyone wants to amend or add to this, please go ahead. Mihia (talk) 21:02, 3 June 2024 (UTC)Reply

`{{etymon}}`

Latest comment: 5 days ago34 comments9 people in discussion

I wasn't quite aware of the intended scope of this. Apparently it's to be an all-in-one etymology template, subsuming the functions of {{affix}}, {{inherited}}, {{etymid}}, etc.

Its current syntax strikes me as more than a bit unintuitive, and I'd like to propose a somewhat more user-friendly way of going about it:

cleverly: {{ety|en:ID|clever:en:ID|-ly:en:ID}} "From clever + -ly".
- Note that the first ID is for cleverly, the second for clever, and the third for -ly.

charity: {{ety|en:ID|charitee:enm:ID}}"From Middle English charitee."

furlough: {{ety|en:ID|verlof:de:ID}} "From Dutch verlof".

монтировать (montirovatʹ): {{ety|ru:ID|montieren:de:ID|-овать:ru:ID}} "From German montieren + Russian -овать (-ovatʹ)".

For categorization purposes, the default assumptions would be as follows.

If all the language codes match (i.e. it's a language-internal formation): compounding, suffixation, prefixation, or confixation. That can be automatically determined by hyphens: yass + -ify is suffixation, neuro- + -genic is confixation, etc. Other types of derivation can be specified with an additional parameter like |blend=1 or |deverbal=1.

If the language codes do not all match: "English terms derived from Dutch", etc. For mixed cases like the aforementioned монтировать, nonsensical categories like "Russian terms derived from Russian" would of course be disabled. More specific types of relation can be expressed with an additional parameter like |bor=1, |inh=1, |calque=1, |conflation=1, and so on.

This strikes me as a reasonaby straightforward way to handle things.

Thoughts, objections, or alternative suggestions?

Paging @Ioaxxere as the person who made the template and @Vininn126, @Rex Aurorum, @Qwertygiy, @Akaibu, @Biolongvistul, @Protegmatic as people who have used it. Nicodene (talk) 21:48, 3 June 2024 (UTC)Reply

Condensing the language and the ID parameters is very agreeable. As for the reshuffling in the etymon slots, it disrupts the ascending hierarchy of specificity and would not prove any easier to internalise to me.

The semantic austerity of the af keyword is, I dare to assure, a temporary solution. We don’t even have categorisation implemented yet. ―⁠Biolongvistul (talk) 22:19, 3 June 2024 (UTC)Reply

Could you explain what you mean by ‘ascending hierarchy of specificity’? Nicodene (talk) 23:25, 3 June 2024 (UTC)Reply

Broadest first, most specific last, as in taxonomy for species. I believe it's mostly a happy coincidence that it's implied with the current syntax using the "greater than" symbol. Language > term > sense.

The rest of the proposition I don't believe I quite understand. Syntax like "bor|fr>unité>to unite|af|en>-ed>past participle" for "Borrowed from French unité (“to unite”) and suffixed with -ed (“past participle”)" feels intuitive enough to me. Qwertygiy (talk) 23:41, 3 June 2024 (UTC)Reply

@Ioaxxere I have been meaning to respond to another thread about adding manual transliteration into {{etymon}}. The obvious way to do that is through inline modifiers; in that respect, the choice of > as a separator is singularly unfortunate as it prevents use of inline modifiers with the normal <...> syntax. I would recommend changing this to something else; for example, the {{given name}} template uses < to indicate inheritance, but requires that spaces be put around the < sign, which allows concurrent use with inline modifiers. You could also use ^, @, etc. Benwing2 (talk) 00:11, 4 June 2024 (UTC)Reply

BTW if you need help changing this, I can do this easily by bot. Benwing2 (talk) 00:12, 4 June 2024 (UTC)Reply

@Benwing2 I don't think it does prevent the use of < >, as it's not actually ambiguous, but I could see it being confusing (though no more than template syntax). Theknightwho (talk) 00:37, 4 June 2024 (UTC)Reply

I suppose you may be right, I need to think if there are any edge cases that will be problematic, although without spaces it will be very hard to read, e.g. фоо<tr:foo>>бар<tr:bar>>баз<tr:baz> is well-nigh unreadable. Benwing2 (talk) 00:43, 4 June 2024 (UTC)Reply

@Benwing2 It's not great, I agree. My suggstion is foo:bar<id:baz>, which probably maximises consistency with other templates. Theknightwho (talk) 00:46, 4 June 2024 (UTC)Reply

@Theknightwho I agree with this. Benwing2 (talk) 00:49, 4 June 2024 (UTC)Reply

I tried it that way to have the same adjacent order of language and ID, as in {{ety|en:ID|charitee:enm:ID}} "From Middle English charitee". But I don't have any issue with {{ety|en:ID|enm:charitee:ID}}.

As for the use of ">", in addition to the issue that Benwing mentions, I found it unintuitive. The code on state for example currently contains "enm>stat>condition". Reading this according to the standard meaning of ">" in linguistics results in "condition is from stat, which is from enm".

As for united, as it happens I don't agree with the given etymology, since French unité is a noun meaning "'unity", not a past participle comparable to united. The latter is just unite + -ed. But if I were to agree with the given etymology, my proposal would result in {{ety|en:ID|fr:unité:ID|en:-ed:ID}} "From French unité + English -ed." Which seems a good deal simpler. Nicodene (talk) 00:50, 4 June 2024 (UTC)Reply

There are a lot of suggestions in here so I'll just dump a few opinions:

Neutral on changing the etymon parameter format. However, I oppose any scheme where > is used both as a separator and for inline modifiers for the reasons pointed out by Benwing. Out of the options discussed here I would take foo:bar<id:baz> (I assume foo is the language code).
Weak oppose having |1 be in the format lang:ID as I find this very unintuitive, although it does admittedly save keystrokes.
Oppose changing anything about the keyword parameters for now until the requirements are more established. I feel like @Nicodene is putting the cart before the horse in discussing categorization when it's not even clear how this should work. In particular, I'd like to eventually deprecate the existing "X terms derived from Y" system in favour of something more fine-grained (although this will be tough to implement in the short term).

Ioaxxere (talk) 04:15, 4 June 2024 (UTC)Reply

In something like foo:bar, foo should definitely be the lang code, otherwise it will be too confusing. In foo:bar:baz:bat, I would assume foo is a lang code and the others are terms. If the lang code is optional, we'll need a different separator for the terms. Benwing2 (talk) 04:28, 4 June 2024 (UTC)Reply

@Benwing2: Currently, with the > separator, the lang code is optional. Hence you can do something like {{etymon|ine-pro|id=father|af|unc|*peh₂->protect|*-tḗr>agent noun}} (the ine-pro> part is implied). Part of the reason I like the current system is that it's optimized for keystrokes, e.g. *peh₂->protect has 14 characters, whereas ine-pro:*peh₂-<id:protect> has 26 characters. But I think that it should be possible in the new system to omit the lang code in the same manner as long as : characters are escaped everywhere else. Ioaxxere (talk) 04:42, 4 June 2024 (UTC)Reply

@Ioaxxere I am not saying you need to use inline modifiers for things like ID's that occur frequently. You will find, for example, in {{it-conj}} that there are various delimiters used, e.g. {{it-conj}} for riempire might look like {{it-conj|a/riémpio,riempìi,riempìto:riempiùto}}; here the a/ at the beginning indicates the auxiliary verb avere; following are three principal parts, comma-separated, and alternatives for principal parts are colon separated. Some verbs need four principal parts and use ^ to separate the fourth principal part, e.g. venire, whose full spec looks like {{it-conj|e/vèngo^viène:viéne,vénni:vènni,venùto.fut:verrò.presp:veniènte}}. To help unpack this, the format for principal parts is PRES1S,PHIS1S,PP in most verbs (specifying the 1sg pres indic, the 1sg past historic, and the past participle), but PRES1S^PRES3S,PHIS1S,PP in verbs where the 3sg pres indic is also irreg. In addition, . separates distinct specs, where the main principal parts are collectively a single spec, and fut:verrò is another spec indicating the future principal part, and presp:veniènte is yet another spec indicating the present participle. I could have used the format of fut:verrò for all principal parts, which would look like {{it-conj|e/pres:vèngo.pres3s:viène:viéne.phis:vénni:vènni.pp:venùto.fut:verrò.pres:veniènte}} (BTW you can put spaces and newlines next to any delimiter to make it easier to read), but that's a lot more keystrokes. Benwing2 (talk) 05:07, 4 June 2024 (UTC)Reply

~~Is handling language and ID the same way throughout, as in~~

{{ety|en:polity|stat:enm:condition|inh=1}}

less intuitive than handling them in different ways like this?

~~{{etymon|en|id=polity|inh|enm>stat>condition|tree=1}}~~ ed: nevermind; see below

I wasn't aware you're considering getting rid of "X terms derived from Y" categories. Is the problem the name (as it happens I'd been thinking of suggesting "X terms of Y origin") or is it the problem that such categories exist at all? Nicodene (talk) 04:58, 4 June 2024 (UTC)Reply

@Biolongvistul, Qwertygiy, Ioaxxere, Theknightwho, Benwing2:

Adjusting for your comments, we get something like:

{{ety|en<id:X>|en:clever<id:Y>|en:-ly<id:Z>}} "From clever + -ly".

Does that syntax satisfy everyone?

If so perhaps we can get to discussing Ioaxxere's proposed changes to categories. Nicodene (talk) 09:16, 4 June 2024 (UTC)Reply

I like this, to be honest. Vininn126 (talk) 09:18, 4 June 2024 (UTC)Reply

I'd prefer something like

{{ety|en|clever#Y|-ly#Z}}

That way you minimize typing. Benwing2 (talk) 09:21, 4 June 2024 (UTC)Reply

Happy to go for #X instead of <id:X> if people like it.

It looks like you favour setting the default assumption for language codes to “same as the first one mentioned, unless otherwise specified”? So in this case, given the {{ety|en…}}, the following clever and -ly are assumed to be English.

I suppose in that case the syntax for Russian монтировать (montirovatʹ) would read {{ety|ru#X|de:montieren#Y|-овать#Z}} “From German montieren + Russian -овать (-ovatʹ)” or similar. Nicodene (talk) 10:47, 4 June 2024 (UTC)Reply

This raises the issue of adapted borrowings anyway. I suppose for the tree you'd have a fork either way, but the question is whether to print "bor" in the tree or not. I have a slight preference for <id:X>. Vininn126 (talk) 10:51, 4 June 2024 (UTC)Reply

Can we move forward with one of these syntaxes? Vininn126 (talk) 09:47, 6 June 2024 (UTC)Reply

@Vininn126 languagecode:lemma<id:X> appears to be the most accepted. Perhaps space-saving feature ls can be added down the line, like the aforementioned #ID or having language codes default to the first one mentioned.

@Ioaxxere wants to make major changes to the category system. From what I gather we’ve a long ways to go before reaching that: we’ve yet to hash out any details, and then there’s community consensus to reckon with.

On the other hand we have, if I’m not mistaken, agreed on a new syntax for {{etymon}. So I also think we might as well implement it now, unless someone has further modifications to suggest. It shouldn’t make adapting to future category changes any easier or more difficult than it would be currently.

I’ve volunteered to manually clean up the existing transclusions of {{etymon} and update the documentation. Nicodene (talk) 10:22, 6 June 2024 (UTC)Reply

Yes, I think adding categories would be great; I also don't think it's necessary for updating the syntax? I could be wrong. If not, then I think we can move forward. Vininn126 (talk) 10:25, 6 June 2024 (UTC)Reply

I have no strong feelings about the exact markup, I can adjust. Vininn126 (talk) 08:47, 4 June 2024 (UTC)Reply

Was suggested to bring up the fact that I've been setting the trees below the etymology as opposed to the "current practice" of putting them above, as to me, the trees are not be the focus of the etymology section, or at least they shouldn't be considered as such, as your average joe will probably not care that creepypasta's lineage contains the doublets pasta and paste, they'll just be interested that it came from the /x/ board. Akaibu (talk) 06:31, 5 June 2024 (UTC)Reply

Personally I'd prefer them above. Vininn126 (talk) 06:33, 5 June 2024 (UTC)Reply

I prefer above, it just looks much better. Plus it's collapsed by default, so I definitely think people will notice the etymology first. — SAMEER (؂・؄・؏) 07:31, 5 June 2024 (UTC)Reply

@Babr re diff: not currently. But you may be interested in this discussion. Ioaxxere (talk) 20:21, 15 June 2024 (UTC)Reply

Rethinking confidence parameters

Currently, to indicate uncertainty, you might do something like {{etymon|ine-pro|id=father|af|unc|*peh₂->protect|*-tḗr>agent noun}}. As pointed out by @Fenakhay, this is a bit unintuitive due to the fact that there are two "layers" of keywords present (both etymons are associated with both af and unc). As an alternative, I support being able to write {{etymon|ine-pro|id=father|af|*peh₂->protect?|*-tḗr>agent noun?}}. This is intuitive and also saves two characters. We would just have to make sure that there are no IDs ending in a question mark.

Also, I'm personally not a fan of using # to show IDs, since it could be confused with the actual fragment. In Benwing's example, {{ety|en|clever#Y|-ly#Z}} would link to clever#English:_Y. Ioaxxere (talk) 19:45, 4 June 2024 (UTC)Reply

If you like <id:X>, perhaps another inline modifier like <unc:1>? Nicodene (talk) 21:24, 4 June 2024 (UTC)Reply

I think using ? to indicate uncertainty is fine. I'm not sure about what > and -> mean here; I need to read the docs, but they maybe could be replaced with something more intuitive. Benwing2 (talk) 04:36, 5 June 2024 (UTC)Reply

> precedes an ID, and the hyphen is just part of the PIE lemma *peh₂-. Nicodene (talk) 04:41, 5 June 2024 (UTC)Reply

I see. In that case maybe use @ or ^ to separate the ID from the lemma. Benwing2 (talk) 04:51, 5 June 2024 (UTC)Reply

Classical Attic audio files

Latest comment: 17 days ago4 comments4 people in discussion

Umm ... I have come across several of these. Do we really want them? E.g. λέγω, where on top of everything else, the pronunciation is completely wrong; the speaker says /leːɡuː/ when the reconstructed pronunciation should be /lɛɡɔː/. Some others (which I have not checked yet): καί, ὁ, ψυχή, φύσις, αὐτός, εἰμί, χείρ, οὗτος, χθών, τίς, φθόγγος. Benwing2 (talk) 00:48, 4 June 2024 (UTC)Reply

I believe there was consensus to remove the audio files for Classical Latin, so this should be no different. Andrew Sheedy (talk) 01:10, 4 June 2024 (UTC)Reply

I don't want them either personally. At the very least they should be labelled with a disclaimer like ‘modern attempt to approximate Attic’ to convey some idea of the uncertainties involved in attempting a phonetic rendition of a pronunciation predating Christ. Nicodene (talk) 01:23, 4 June 2024 (UTC)Reply

The ones you have not checked, I am surprised how well they match. Such small details could make readers fond, in their grim and despondent struggles to master Greek. Can’t withsay them in the interest of the art and science. Fay Freak (talk) 01:42, 4 June 2024 (UTC)Reply

Use of etymology trees made with Template:etymon in the entries for multi-word terms

Latest comment: 12 days ago11 comments8 people in discussion

Hello, following the passage of Wiktionary:Votes/2024-04/Allowing etymology trees on entries last week, etymology trees generated by {{etymon}} have been added to a number of entries. Earlier today, there was some discussion on the Discord server about the inclusion of etymology trees in the "Etymology" sections of multi-word entries like United States of America (added here) and Abkhaz Autonomous Soviet Socialist Republic (not added as of writing). Some supported etymology trees on such entries while others opposed their inclusion. The discussion started getting detailed enough as well as got enough attention that I've decided to try and move it here, on-site so that it is more "official" and can have more organization and visibility. Pinging those who expressed views on Discord: @Qwertygiy, Vininn126, Lattermint, Ioaxxere, Akaibu, Soap, Saph668, AG202, Theknightwho. —The Editor's Apprentice (talk) 02:08, 4 June 2024 (UTC)Reply

Replying to say that I don't think it's best to have etymology trees on multiword terms like United States of America. It starts to get unwieldy, and while it looks "cool", we should be aiming for information presented in a concise and helpful way, not the pseudo-gamification that I've started to see. AG202 (talk) 02:15, 4 June 2024 (UTC)Reply

Completely agreed. In general the etymology of a multiword term should indicate the way the term was constructed in the same language, and that's it, unless the term was calqued from some other language. Benwing2 (talk) 03:20, 4 June 2024 (UTC)Reply

In the same vein, the discussion around adding a tree to Llanfairpwllgwyngyllgogerychwyrndrobwllllantysiliogogogoch on Discord shows me the gamification that I'm talking about. Even after being pointed out to that they shouldn't work with languages that they don't know, the tree was still added. I assume because it's a long word and they explicitly stated that they couldn't edit pneumonoultramicroscopicsilicovolcanoconiosis (locked to auto-patrollers and up). I'd also like to remind editors of the statement from the vote:

This vote does not:

Allow or encourage editors to mass-add etymology trees across the site. As stated above, each language community will decide if or when they are appropriate.

AG202 (talk) 00:37, 5 June 2024 (UTC)Reply

Weak support etymology trees on multi-word terms. I don't see the harm considering they're collapsed and don't take a lot of effort to create. However, I admit that the tree on United States of America is virtually unusuable simply due to how wide it is. I think the best course of action is to have trees of a certain width display in a horizontal format as seen in Wiktionary:Beer parlour/2024/May#Descendant tree design. Ioaxxere (talk) 04:22, 4 June 2024 (UTC)Reply

@AG202: I would like some clarity on what you're actually aiming for. Are you saying that no etymology tree should be added to terms with a space? What about a term like chow mein, which was directly borrowed from a single word? Ioaxxere (talk) 04:28, 7 June 2024 (UTC)Reply

@Ioaxxere: No, I said words like United States of America, where it’d be a clear SOP term if not for the fact that it’s a proper noun. When we start debating whether or not to add the tree for of in a multiword term, it’s getting out of hand. AG202 (talk) 04:41, 7 June 2024 (UTC)Reply

Oppose per AG202. —Caoimhin ceallach (talk) 00:36, 7 June 2024 (UTC)Reply

Oppose per AG202. DCDuring (talk) 15:02, 8 June 2024 (UTC)Reply

As much as I support the template in general,

Oppose the generation of trees on multiword entries. Of course having it for an ID and such is still useful. Vininn126 (talk) 15:09, 8 June 2024 (UTC)Reply

Oppose per AG202. — Fenakhay ^{(حيطي · مساهماتي)} 17:01, 8 June 2024 (UTC)Reply

User:Purplebackpack89

Latest comment: 12 days ago55 comments13 people in discussion

This user is making an awful lot of noise for very little signal, and judging by their mainspace-to-talkpages edit ratio, they don't seem particularly interested in actually building a dictionary.

Purplebackpack will probably argue that they're not making as many mainspace edits as they'd like because other people are constantly putting spokes in his/her wheel. They apparently don't like their work being reviewed and quality-controlled, or their edit history being looked at, and will readily dismiss criticism as "harassment", an accusation they've levelled at no less than four different people in the course of a single week (diff, diff, diff, diff).

While we should look to see if there isn't some truth there (I think we could have done without WF's trolling, at least), and make sure that there isn't a systemic problem of people feeling pressured (a topic which has recently been brought up), I would argue that rapid-fire accusations from a single editor make it harder to think clearly on such an issue.

And the fact that the same person has levelled similar accusations at an entirely different set of editors many years ago (diff, diff, diff, diff) certainly doesn't help in taking their claims seriously now.

They seem to take particular exception to people challenging them on their votes (see this discussion); notice the similarity between this and the accusation of harassment thrown at Benwing2 after his comment (on Purplebackpack regularly failing to provide a rationale for his/her votes).

I'd also like to mention that, while complaining of other people's behaviour towards them, they seem unbothered (diff, diff) by the idea that their own attitude might have played a role in the abrupt decision of a fellow editor to leave; note the striking temporal proximity between the aforementioned discussion and that editor's departure.

If Purplebackpack perceives any kind of scrutiny as harassment, I would say Wiktionary simply isn't the right place for them. Everyone on this project must be ready to face criticism - sometimes repeatedly.

I personally am loath to imagine not being able to go through a user contributions and express earnest concern about the quality of their interventions (in the main space or elsewhere) without being labelled as a "harasser".

Therefore, for the good of the project, I would like to propose that this user be prevented from further editing. This is not meant as a punitive measure (I'm not "out to get him/her"), but as a way of putting an end to highly toxic and massively detrimental behaviour, thereby preserving an atmosphere more conducive to serene dialogue and productive work. P U C – 23:07, 4 June 2024 (UTC)Reply

PBP's false harassment accusations have gotten to the point of trolling. I view such unwarranted accusations, esp. a pattern of them, as a blockable offense, and I think if PBP makes any more such accusations that aren't clearly warranted, they should be blocked, maybe on the schedule of one week, then one month, then permanently if they keep it up. PBP reminds me of Dan Polansky; a ton of heat, little light, and a strong increase in the toxicity of the atmosphere as a result of them. In Dan Polansky's case, I finally permablocked him for outright racism on top of everything else. I suspect PBP is smart enough not to engage in outright racism, but IMO that should not prevent a warranted block. Benwing2 (talk) 04:33, 5 June 2024 (UTC)Reply

^ I agree with Ben's suggestion of issuing increasing blocks. PBP's recent behavior has been really inappropriate and rude, but I'm not sure if a permaban is the best immediate action. But I definitely think we should not tolerate disruptions to the project. — SAMEER (؂・؄・؏) 07:24, 5 June 2024 (UTC)Reply

I agree. Theknightwho (talk) 12:56, 5 June 2024 (UTC)Reply

I have not had a single productive encounter with this user. Vininn126 (talk) 05:53, 5 June 2024 (UTC)Reply

Same - just a lot of vitriol and repeated sniping. Theknightwho (talk) 09:15, 5 June 2024 (UTC)Reply

Many can and have characterized your interpersonal relations same way, @Theknightwho Purplebackpack89 12:31, 5 June 2024 (UTC)Reply

"no u". thread's not about them, bro, it's about you. Vininn126 (talk) 14:23, 5 June 2024 (UTC)Reply

Have you heard what Wordy and I have been saying? There are greater systemic concerns here and it's wrong to single out one editor. Purplebackpack89 12:16, 6 June 2024 (UTC)Reply

@Purplebackpack89 There may be larger systemic concerns here, *AND* this does not absolve you from behaving in a civil fashion at all times. Imperfections in the system don't give you a free pass to run rampant and blame your bad behavior on "the system". Everyone (even Wordy) has tried to make that point in one way or another, but IMO you don't want to listen. Benwing2 (talk) 07:24, 7 June 2024 (UTC)Reply

At some point I may prepare a longer response, but I gotta interject this right now: I'm CLEARLY HERE to build a Wiktionary, as I've created 636 entries. Purplebackpack89 05:21, 5 June 2024 (UTC)Reply

I was going to say:
On a balance, I'm inclined (perhaps naively) to think PBP is not trolling but sincere, that he really regards people as harassing him, and is really freaked out about being blocked... in part because I think a troll would know that being so over-the-top — accusing so many different users of harassment (some on very flimsy grounds); and when blocked, sending lots of pings on his talk page, sending me an e-mail and contacting me on Wikipedia asking to be unblocked; and holding up creating ~636 entries since 2009 as an accomplishment — is counter-persuasive. Sincerity doesn't ameliorate the extent to which many of the accusations are unwarranted; indeed, sincerely perceiving most disagreement as harassment is a problem. PBP, when you're complaining to multiple different users about (for example) the fact that they RFDed an entry you made, but then the community discusses the entries at RFD and determines they indeed aren't the sort of thing we want to include, it would be prudent to reflect that the RFDer was not harassing you but correctly perceiving that the entry didn't meet commonly-accepted criteria for inclusion.
However, before I could post that, I see his lack of any indication of awareness of irony in telling other users to walk away while himself continuing to poke at them🙄 which... well, whether it's trolling or sincere, it's ill-advised either way. - -sche (discuss) 06:25, 5 June 2024 (UTC)Reply

@-sche I think the idea that I "perceive most disagreement as harassment" is exaggerated. Below I am going to explain how I came to the conclusion that I am being harassed. Purplebackpack89 12:36, 5 June 2024 (UTC)Reply

I agree with -sche's views here. Rarely have I seen so histrionic a user, who demands so much attention from his fellow editors and politicks so energetically on the discussion pages, while contributing so little and showing so few signs of introspection. — Mnemosientje (t · c) 15:42, 5 June 2024 (UTC)Reply

Not gonna weigh in on the question of whether PB89's contributions have been constructive on balance. I do find there seems to be a lot of selectivity in which editors are deemed intolerably disruptive. WordyAndNerdy (talk) 07:52, 5 June 2024 (UTC)Reply

Oh, totally agree. There are the guardians here and there are the peons. The behavior of the guardians is no better than that of the peons, but no peon can ever tell a GUARDIAN that he's wrong

And some of the people who are commenting on this are people who, in undoing or modifying my edits, have made questionable edits themselves. For example, Theknightwho stumbled into the hot-dog-is-a-sandwich debate being too hasty about reverting me. Benwing nominated dont tread on me for deletion...and quickly five votes that he was wrong showed up. Instead of owning up to their screw-ups, they're here. Purplebackpack89 12:29, 5 June 2024 (UTC)Reply

To be clear the "intolerably disruptive" remark was intended to reference generally trollish editors that Wiktionary has collectively chosen to tolerate/ignore for some reason. Problem admins ("guardians") are definitely an issue as well – and my experience is also that Wiktionary typically circles the wagons around them – but that's separate from a wiki keeping pet trolls. I'd also urge you to consider the possibility that Benwing RfD'ing dont tread on me was independent of TKW leaping into into the fray at hot dog. You didn't include an etymology explaining that this is the precise text on the Gadsen flag. It's possible Benwing saw the entry without being familiar with that history and concluded it was simply an unlikely misspelling. WordyAndNerdy (talk) 15:47, 5 June 2024 (UTC)Reply

I am willing to concede that they are possibly unrelated...but they still happened in the same window of a few days, which again brings us to the problem of a whole lot happening to me at once and that (understandably!) making me frustrated. If we're talking in hypotheticals, it's also possible Benwing could've acknowledged there was information he didn't know and admitted he erred. HE DIDN'T (He's a guardian...why would he?). If we're talking hypotheticals, it's also possible that Knight or Ben could've noticed "hey, Purplebackpack89 feels stress out and put upon! Maybe I should leave him alone for awhile, and if there's problems that need fixing, I'll get to them at a later date!". THEY DIDN'T. Purplebackpack89 16:22, 5 June 2024 (UTC)Reply

I think this is the stage at which we need to shift from narrowly focusing on individual incidents to discussing remedies to overarching systemic issues. WordyAndNerdy (talk) 16:47, 5 June 2024 (UTC)Reply

@User:WordyAndNerdy You could take the lead on that. My proposals haven't gained any traction:

to forbid any mention of any username (pings and signatures naturally excepted, probably also sayonaras and welcome-backs) in principal namespaces, Wiktionary space and their talk spaces, excepting the page required for the following proposals and enforcement thereof.
1. this would be enforced by increasing blocks and/or removal of admin powers. Formal public apologies on offended user's talk pages or in BP might mitigate the blocks or removals.
to have a request for mediation process, page, and template. Requests for interaction bans could be handled there as well.

Only the request for mediation addresses 'hounding' or 'abuse of administrative powers', including unjustified blocking, 'passive aggressive behavior', etc, or, possibly, the consequences of the 'gender-related, structural' composition of our veteran contributors, admins, and discussion participants. DCDuring (talk) 21:48, 6 June 2024 (UTC)Reply

I think there is abundant evidence, even on this subpage, that mentioning individual users on core community discussion pages too often rapidly leads to defensiveness and a total loss of focus on substantive, principled discussion, even discussion of how to limit (interpersonal) conflict. We should not want to have our conflict-suppression mechanisms be targeted against individuals, as has been suggested here. DCDuring (talk) 22:15, 6 June 2024 (UTC)Reply

Point one as I'm reading it strikes me as unworkable. Some discussions will inevitably centre on a specific user or group of users. Sometimes these discussions will be of a positive or neutral nature. Sometimes they'll involve navigating more difficult territory. But implementing formal mediation as a frontline remedy to interpersonal concerns doesn't seem like a viable plan. Some people won't look at the process as mediation. They'll see it as arbitration – being put on wiki-trial. Starting out on what some will find to be an adversarial footing doesn't seem like it would be conducive toward conflict suppression to me. It seems more likely to put people in a siege mentality and escalate matters that might otherwise be resolved without much fuss. I do think limiting the number of active BP discussions concerning a specific user to one at a time might be a step in the right direction. We do need a formal mediation process. I just think less-formal discussion might be ideal as a frontline approach. Why require a mediation process by default when it won't be necessary to resolve every disagreement that arises? WordyAndNerdy (talk) 06:52, 7 June 2024 (UTC)Reply

I'd be happy to hear about other proposals that have a better chance of success.

As a starting point, it is basic practical psychology (followed in business, law, government, and sometimes, even politics) to frame issues as about substance and not persons, even personal actions, let alone invisible attributes, like motivations, values, attitudes, beliefs, intelligence or energy levels, etc. To the extent our users aren't doing that, they would benefit from learning to do so. The first locus after edit wars are talk pages for entries, next are user talk pages. Right now people chime in (or pile on) on talk pages they are watching. At some point the discussion may fail to resolve the issue. This is where things go wrong if the issues are framed as personal and not substantive.

As soon as issues of personal behavior come up, especially in a public forum, we see: defensiveness, score-settling, etc. This can be worse than a real trial, it can be mobocracy. Were interpersonal conflicts diverted to a mediation, as there are necessarily two parties in an interpersonal conflict, neither party need be on trial. I would suggest that we may need the mediation page to be basically private, invisible to the community at large, except possibly in the event of failure of the process, after a waiting period. The role of a mediator is probably first to sort out substantive issues (for the appropriate forums) to the extent the users have failed to do so. Then behavioral issues can be sorted. Keeping attributions out of the discussion at all stages is critical. DCDuring (talk) 23:53, 7 June 2024 (UTC)Reply

@WordyAndNerdy I agree with you. In particular I think, as you do, that the mediation process should start only when an informal BP discussion fails to resolve the issue. I also think there's no way that it's workable to forbid mentioning specific users in Wiktionary-space, talk spaces, etc. Most of these mentions as they currently occur are not intended to single out a user for opprobrium or anything but for any of a number of other reasons, e.g. to agree with someone, to mention their theory or proposal on something, etc. I think your suggestion of limiting BP discussions concerning a particular user to one at a time should be enforceable; if there are multiple simultaneous concerns about a particular user they're likely to be related and should be merged. If in some weird circumstance we really need to have two unrelated simultaneous discussions about a given user and one can't wait for the other to finish, that should require prior explicitly discussed consensus. Benwing2 (talk) 07:18, 7 June 2024 (UTC)Reply

The occasional efforts of some of our wiser experienced users to mediate discussions in public forums often seem to simply lead to the interpersonal conflict threatening to involve them.

Direct person-to-person contact on user talk page is the first-line location for discussions. If an issue arises from a substantive matter, then the substantive matter should be discussed in the appropriate forum: BP, TR, GP. It should not be hard to refer to edits by diffs without mentioning the editor by name. I don't think that we have a very good record of resolving interpersonal conflicts in group forums, unless we count driving contributors of all kinds away or into virtual hiding (changing username, narrow range of edits) as success. It is very easy to exclude personal mentions: policies, warnings, escalating blocks. DCDuring (talk) 23:53, 7 June 2024 (UTC)Reply

Purplebackpack on feelings of harassment

Were people actually harassing me? Maybe, maybe not. May I explain why I felt harassed?

A large portion of my edits have been scrutinized in a very short amount of time. Taken literally years of work, some of which hadn't bothered anybody for years, and tried to change or delete a lot of it in just two weeks. Had the scrutiny occurred more slowly, I would not have not felt as put upon.
Editors have given the appearance of assuming bad faith and focusing on the editor, not the content .There have been several nominations or comments on the lines of "oh, well, this is a Purplebackpack89 edit". That's not supposed to matter.
Editors made no good-faith effort to deescalate continued making the edits even though it was clearly bothering me. No deadline...could just wait until I was less stressed out.
Some of the attempts to modify my edits ended up being questionable themselves. For example, Theknightwho stumbled into the hot-dog-is-a-sandwich debate being too hasty about reverting me. Benwing nominated dont tread on me for deletion...and quickly five votes that he was wrong showed up. Denazz piled on by trolling left and right

Given those four things happening, basic psychology would suggest that I would be frustrated. And naturally, a questionable 31-hour block and this thread would also put me on edge! It would probably put anybody on edge! Was I over the top? Maybe, but I feel that where my feelings of harassment came from are understandable. The solution to say that this is entirely my fault, I'm never entitled to feel frustrated, and nobody else did anything questionable is...just wrong. Fundamentally, this thread COULD end up having a chilling effect on speaking up if you feel put upon and...we don't want that either. Purplebackpack89 12:49, 5 June 2024 (UTC)Reply

Regarding your comments about "focusing on the editor": I think it's to be expected that if a user has a history of questionable edits/entries, their activity will get more scrutiny. For better or worse, there's an (unwritten) reputation system here, and it does matter who created an entry. Pretending otherwise is a fantasy. Jberkel 13:17, 5 June 2024 (UTC)Reply

Of late, people have been exaggerating the questionability of my edits though, @Jberkel. Above, people are essentially claiming that I never did anything productive at all and that is inaccurate. Purplebackpack89 13:31, 5 June 2024 (UTC)Reply

I'd say you're particularly sensitive to corrections, from what I've seen. I'd love to have people scrutinize my work. Vininn126 (talk) 14:24, 5 June 2024 (UTC)Reply

I think you'd love it to a point and not be comfortable with it beyond that point, @Vininn126 (And I believe that is true for most editors). If people scrutinized you in the manner I outlined, I think you (or anyone) would be somewhat bothered. Purplebackpack89 15:19, 5 June 2024 (UTC)Reply

I was heavily scrutinized when I first started editing. Even berated. I don't see similar berating towards you. I see corrections that I personally would welcome. Vininn126 (talk) 15:28, 5 June 2024 (UTC)Reply

I'd argue there's a significant difference to receiving heightened scrutiny as an actual wiki-newbie and receiving it as veteran editor. At a certain point it's only natural for a veteran to start feeling that they're being subjected to disproportionate scrutiny and opposition. Especially when this community creates a special policy carveout for a habitually trollish editor. It's almost as if provocation is treated as excusable while being especially provokable is not. WordyAndNerdy (talk) 16:12, 5 June 2024 (UTC)Reply

I really feel like you didn't read my messages. I'd say I'm a veteran editor at this point and that I'd love more scrutiny. Vininn126 (talk) 16:13, 5 June 2024 (UTC)Reply

Also I really don't see why editing for a long time gives you this freedom. Let's say someone took a long break, or have just always been problematic. Vininn126 (talk) 16:21, 5 June 2024 (UTC)Reply

That's you. Other editors will respond differently. Yes, this is a wiki. Every edit comes with the caveat it might be objected to or undone. But it's not unexpected for someone to start feeling like a pariah or whipping kid if they routinely encounter intense opposition. That feeling doesn't come from nowhere. This wiki definitely plays favourites at times. WordyAndNerdy (talk) 16:26, 5 June 2024 (UTC)Reply

Your assumption that it can come from nowhere, in my opinion, greatly misrepresents PB's reaction. I'm not trying to invalidate anyone's emotions, but that also doesn't mean someone's reaction can't be over-the-top or unproductive. If we never address that behavior, things get bad very quickly. Vininn126 (talk) 16:28, 5 June 2024 (UTC)Reply

It doesn't matter, @Vininn126. You still have to assume good faith about their edits (and the "reputation system" mentioned above flies in face of that btw). And if an edit feels put upon, it seems like a good idea to lay off him for a bit unless there's something serious like vandalism that has-to, has-to, has-to be dealt with right away.

I don't think Wordy is saying my reaction came from NOWHERE, I think he's saying that there is a SOMEWHERE, AND that that needs to be addressed rather than singling me out alone. Purplebackpack89 16:32, 5 June 2024 (UTC)Reply

At no point did I assume bad faith on your part. Having good faith doesn't absolve you from any bad behavior. Vininn126 (talk) 16:33, 5 June 2024 (UTC)Reply

My point is that it's a double standard to treat having a dramatic reaction to provocation as requiring community action while giving a pass toward actual provocation (see the second link in my above comment). WordyAndNerdy (talk) 16:40, 5 June 2024 (UTC)Reply

I saw, and see my comment that this thread is about this user in question. If it's about being "hounded", I don't think that those claims are founded. If it's about other actions, I'd prefer they stay in that thread. Please don't muddy the waters on the conversation to make a point that's tangentially related. Vininn126 (talk) 16:43, 5 June 2024 (UTC)Reply

It isn't "muddy[ing] the waters." It's providing relevant context. Nothing happens in a vacuum. My thoughts on this haven't changed in ten years. WordyAndNerdy (talk) 16:53, 5 June 2024 (UTC)Reply

You are muddying the waters - it's just a boatload of whataboutism. Theknightwho (talk) 00:03, 6 June 2024 (UTC)Reply

There was no heightened scrutiny, and even if there had been it would have been justified given the number of mistakes I (and others) have found in PB89's edits. Saying that PB89 "routinely encounter[ed] intense opposition" is simply a complete fiction. Theknightwho (talk) 00:16, 6 June 2024 (UTC)Reply

Knight, you bear some responsibility for this situation. There was nothing you were doing vis-a-vis me that had to be handled immediately. You could have noticed that I was frustrated by the way you were handling things and proceeded more slowly and cautiously. You didn't, in fact, you literally did the exact opposite.

And on top of this, you yourself made mistakes while hastily trying to undo my mistakes. And you never owned up.

There are several threads that have expressed concern about your confrontationalism and this one should echo those concerns. Purplebackpack89 12:14, 6 June 2024 (UTC)Reply

I wasn’t confrontational with you at all, and making some minor changes to entries you’d edited and then posted about on high-traffic pages didn’t have anything to do with you specifically. I tagged you in one edit as a form of guidance, and your disproportionate feelings of negativity are not a reasonable response, as numerous people have said by now. It is not my responsibility to manage your emotions; you are an adult, and you do not get a free pass on mistreating other editors simply because you feel upset. Theknightwho (talk) 16:25, 6 June 2024 (UTC)Reply

PBP, you need to understand that this is a collaborative project and the other users here are not your enemies. You brought up Assume good faith but you yourself have never assumed good faith in anyone this whole time. In the many disputes you've had you always assumed the other person had a motive against you, which is extremely rude and disrespectful. Which begs the question: Why are you the only one who deserves the assumption of good faith? Why do you never afford others the same assumtion you mention?

You should not assume people who look through your contributions are doing so out of spite, but rather because everyone makes mistakes and it's honestly for the best that everyone's edits gets reviewed at least occasionally. Otherwise, mistakes could go unnoticed to decades! If you've ever done entry maintenance, you would know that yourself. Hell, I actually used to get into disputes with Fenakhay when I first joined the project for the same reason. But he basically taught me how to format entries and now he's the main person I ask when I have a question about entry formatting.

Additionally, your repeated provocations of editors recently is completely inappropriate. What good reason is there to tag TKW 5 days after things calmed down to tell him to walk away? Especially since he DID walk away, 5 days ago! It was you who didn't! What reason is there to send aggressive messages to Ben telling him that he "better rescind" his RfD? After Ben told you to calm down because you were being aggressive, what was the point in continuing to double-down rather than walk away and discuss the issue at RfD?? And after being blocked for "intimidation and bullying", what is the reason to try to pick an argument with the blocker —who has not participated in any conversations with/about you— rather than just walking away (as your yourself suggested)? You preach values you don't even follow and regularly throw stones in a glass house. You started a whole post about how TKW to checking your edits was harassment, yet your somehow incapable of seeing how your actions towards Ben could be perceived as bullying by a third party. — SAMEER (؂・؄・؏) 18:55, 5 June 2024 (UTC)Reply

Benwing made a bad RfD. It was so bad that five people almost instantly voted keep. But...the problem is me telling him it's a bad RfD, not him creating one?

Benwing and Knight and Denazz bear some responsibility for this situation. They made the situation worse with trolling in Denazz's case and questionable edits in the other two. Why do they get free passes and I don't? Is it because they're GUARDIANS and I'm a peon? Purplebackpack89 20:06, 5 June 2024 (UTC)Reply

You're right, that RfD may have been a "mistake" (as in, it seems people disagree with him), but you had no right to be rude about it. When we see RfD's we disagree with, we discuss at the RfD why we think it's a bad idea and let other people compare the reasoning provided. We do not hound the people who made the RfD demanding they withdraw it (that's not even the process for resolving RfD's), and mock them for incorrectly putting up a term for RfD. That is bullying.

What do you mean Ben got a free pass just because he's a "Guardian"?? Do you mean that everyone just listened to Ben cuz he's an admin? Cuz if so, that literally didn't happen. In fact, you said yourself that most users in the RfD read your reasoning and agreed. Nobody voted against you just because you're a "peon" (whatever that means), so I have no idea what you are talking about.

Also, looking solely at interactions —that you specifically— have had with Ben and TKW, it seems like they were just doing entry maintenance. And looking at your recent interactions with Ben, it genuinely appears to me as though you are being a bully. — SAMEER (؂・؄・؏) 20:38, 5 June 2024 (UTC)Reply

Making an RfD that doesn't end up passing is, in fact, not a problem. That's perfectly ordinary. Making an RfD into a pissing contest, on the other hand... Nicodene (talk) 21:36, 5 June 2024 (UTC)Reply

Yeah agreed. It’s ok if you’ve ‘mistakenly’ rfd-ed an entry convinced that the entry will fail.

Purple is bereft of maturity and is sorely inexperienced. He isn’t necessarily acting in bad faith, he takes it for granted that he is always right in any untoward issues involving him but… evil dictators also know they are doing the correct deed, oh well. Thus it’s just a matter of interpretation whether Purple’s bearing is going to result in other editors getting psychologically harassed and being coerced into quitting Wiktionary—just as the populace of a brutal dictatorship are forced to flee their country or face persecution in their homeland—or Purple is actually an innocent victim of harsh law enforcement here. Inqilābī 22:09, 5 June 2024 (UTC)Reply

Yeah, this is basically my reading of it: PB89 isn't acting in bad faith. They just lack social awareness and think they're axiomatically correct about everything, so they conclude the only possible reason anyone disagrees with them must be because they're out to get them. Frankly, I don't care whether it's down to incompetence or maliciousness, but either way it's having a very negative effect. Theknightwho (talk) 00:10, 6 June 2024 (UTC)Reply

I tell you that something bothers me, Knight. Your response is to do that thing that much more, and to do it so hastily you make mistakes while doing so. It's not surprising that anybody would feel attacked under that circumstance. Your critique comes off as hypocritical because embedded in it is that your edits and conduct towards me are "axiomatically correct". Also, can everybody cut out playing amateur psychologist? You ain't Joyce Brothers. Purplebackpack89 15:17, 7 June 2024 (UTC)Reply

It is unreasonable to demand that an admin stop doing their job simply because of your personal feelings. Nicodene (talk) 07:02, 9 June 2024 (UTC)Reply

You keep saying I make lots of mistakes, but what you actually mean is that after I reverted your change to the definition of hot dog from “sandwich” to “entree”, Equinox then changed it to “snack”. The fact that you keep focusing on this doesn’t make any sense to me, as it clearly misrepresents what happened, and I don’t see how it’s supposed to be hypocritical anyway. Theknightwho (talk) 10:41, 9 June 2024 (UTC)Reply

I would suggest that, even if there is consensus to block Purplebackpack89, it would make more sense to just block him from discussion pages— while still letting him contribute to the dictionary proper, cause his lexicographical additions, with fixes and corrections by other editors, are still substantial. Inqilābī 19:50, 6 June 2024 (UTC)Reply

Constructed languages in the mainspace

Latest comment: 13 days ago27 comments10 people in discussion

(Notifying -sche, The Editor's Apprentice, Mahagaja): : I recently created this vote (start date TBD): Wiktionary:Votes/2024-06/CFI for mainspace constructed languages, in hopes of coming to a consensus on which conlangs should be included in the mainspace and why. Since its creation, nonetheless, I've come to realize that we currently include possibly two conlangs in the mainspace outside of the ones listed at WT:CFI, and I'm not sure what to do about them. These include:

Eskayan
N'Ko, though it's debated as to whether or not this is a conlang (per Glottolog) a mixed language per Ethnologue, or simply a literary register per Wikipedia.

If we consider N'Ko a conlang, should it be included in our permitted mainspace list? I would think so, but I also don't feel like it's an actual conlang. I don't know anything about Eskayan to comment on it.

On the same note, I'd like to bring up the case of palawa kani, created by the Tasmanian Aboriginal Centre as it seems closer to the revival of an indigenous language instead of a language like Volapük, at least based on my surface-level research of it. It looks to be taught to children, is used in place names, is used in official dubbing, has a growing oral tradition and more. I cannot yet verify if it has native speakers, but I wouldn't be surprised if it does, if not for a lack of direct access to the language (the merits of which I won't comment on). If so, I'd like to see what the consensus is about adding it to Proposal 1 of the above vote. AG202 (talk) 23:57, 4 June 2024 (UTC)Reply

Pinging @Mar vin kaiser since you seem to be the most active editor of Eskayan & @Thadh since you mentioned it on Discord. AG202 (talk) 00:22, 5 June 2024 (UTC)Reply

This looks well thought out, nicely done. Thanks. Vininn126 (talk) 07:15, 5 June 2024 (UTC)Reply

In a previous discussion (before the Interslavic discussion), someone said it felt like the divide we make between mainspace conlangs and appendix-space ones was that the handful of long-used conlangs are in mainspace, and new ones are in appendix space... and they said that thinking it was a bad thing (arbitrary), but I think it's been a reasonable approach. Having a fair number of native speakers and/or works in the language could be another decent rule of thumb. As regards Eskayan, I note how many aspects of our attitudes to / treatment of artificial languages seem to have been developed with Western conlangs in mind (often created recently and for certain reasons, attempting and failing to be world languages or new nations, or for fiction), to the extent that the existence of old non-Western artificial languages like Eskayan (created for different reasons and used in rather different ways, in Eskayan's case as a language of the Eskaya people, taught in several schools) seems to have slipped the minds of the people devising the original conlang policies, and flown under the radar. All things considered, that (fact that Eskayan is currently included) seems OK to me. I'm not wedded to it being in mainspace if people want it moved to appendix-space, but it does seem to be in a different boat from various Western conlangs that have been suggested for inclusion. - -sche (discuss) 00:53, 5 June 2024 (UTC)Reply

I have no strong opinions either way on Eskayan; it feels different in some way from run-of-the-mill conlangs but I don't know if that's just a bias based on its non-Western origin. Benwing2 (talk) 04:06, 5 June 2024 (UTC)Reply

BTW as for N'Ko, from reading the Wikipedia entry it sounds more like Standard Basque, Standard Moroccan Amazigh, Rumantsch Grischun or Unified Kichwa, which I do not consider conlangs so much as intentionally created koines. These are on the same spectrum as Modern Hebrew, standard German and standard Italian, all of which are partly planned languages but none of which are reasonably considered conlangs IMO. Benwing2 (talk) 04:17, 5 June 2024 (UTC)Reply

Thanks! Yeah I won't worry about it then. AG202 (talk) 05:06, 5 June 2024 (UTC)Reply

I've since done a skim of relevant chapters of The Last Language on Earth: Linguistic Utopianism in the Philippines by Dr. Piers Kelly (Dec 2021), which focuses on Eskayan, and based on what I've read, it seems like it has a strong rationale for inclusion. It's taught in schools to children, used in praying, singing, speechmaking, excluding overhearers, and common phrases, and there's an extensive literary history. "In effect, Eskayan appears to have supplanted the special authoritative role of English." They estimate that there are between 500-550 speakers of Eskayan, with several speakers with a high degree of linguistic competence in speaking, reading, and writing the language.

The only issue I'm seeing is that it's technically not a mother tongue: "Unlike Boholano-Visayan, which is acquired as a mother tongue, knowledge of Eskayan is learned through voluntary attendance at traditional Eskaya schools, and mastery of the language is considered a prerequisite for becoming truly Eskaya." Thus, there technically aren't any L1 speakers from birth, but seeing as though there are children taught it from a fairly young age, would that not qualify as a pseudo-native language? It's definitely different from the typical conlang, and has fully-fledged educational aspects, including arithmetic & equations being taught and performed in schools. The author starts out the final section, stating:

The immediate future for Eskayan as a viable language is reasonably assured. Competent speakers have status within the communities; in Biabas and Taytay the language is being actively learned by children, and plans are well under way to construct an Eskaya school in Cadapdapan. Recent government recognition, through the Indigenous Peoples Rights Act, provides additional legitimacy to an already valued language.

This makes it clear to me that it holds legitimacy and should be included in the namespace. AG202 (talk) 05:06, 5 June 2024 (UTC)Reply

@AG202 Sounds good to me; I would amend your proposals to include it along with Esperanto. Benwing2 (talk) 05:17, 5 June 2024 (UTC)Reply

@AG202, Benwing2: I seem to be late in entering this discussion but yeah, I agree that Eskayan is quite different from other conlang. I would describe Eskayan as already part of indigenous culture of that region of Bohol. So it should be part of the mainspace. --Mar vin kaiser (talk) 07:10, 5 June 2024 (UTC)Reply

Actually the more I'm thinking about it, the more I feel it might be possible to include it as a "jargon" of Cebuano. I guess it doesn't really give a complete picture, but it's basically Cebuano with an almost complete substitution of words, which is very similar to what we usually consider a jargon, rather than an independent language. Thadh (talk) 07:11, 5 June 2024 (UTC)Reply

On a similar note, Eskayan is currently considered an LDL; I assume that that should stay the same? I hate to continue having separate threads on this topic, but I want to make sure that everything is addressed before setting the start time & date for the vote. AG202 (talk) 22:39, 6 June 2024 (UTC)Reply

It's been brought to my attention that Ido is said to have 26 native speakers in Finland per the Ido language Wikipedia page. However, I'm not sure if it's been independently verified and would like more input before I make any changes in either direction about it. Surjection previously told me on Discord that the Finnish website does not provide any additional information. CC: @Benwing2, @Thadh, @-sche, @Vininn126 AG202 (talk) 19:55, 6 June 2024 (UTC)Reply

I think that if we can't verify it, we shouldn't consider it. Vininn126 (talk) 20:08, 6 June 2024 (UTC)Reply

I mean, we can verify it, namely by looking at the reference. Tilastokeskus isn't an organisations to just invent 26 native speakers, is it? Thadh (talk) 20:19, 6 June 2024 (UTC)Reply

I suspect that it can't be ruled out that these were just some pranksters fooling around with their own self reported information submitted to the population census database (if such choice had been presented in the questionnaire form). I doubt that the Finnish statisticians actually made any effort to verify the actual Ido language proficiency of these people. And if native speakers actually exist, then it should be possible to confirm this information from the other sources. --Ssvb (talk) 21:10, 6 June 2024 (UTC)Reply

I was very astonished when I saw it for the first time, and frankly, I find it extremely hard to believe. I mean, there are reports about native speakers of Volapük, but that was when Volapük was still a huge movement. Ido has never been a huge movement. In fact, I think these 26 native speakers appearing out of the blue are possible only if the entire community of Ido users decided to move to Finland within a short period of time, started multiplying themselves and teaching it to their newborns. But on the other hand, the source doesn't seem to be unreliable, although in this case I wonder if it couldn't simply be a mistake. IJzeren Jan (talk) 21:28, 6 June 2024 (UTC) N.B. I wouldn't put my money on pranksters either, as that would require some pretty good organization; and why would they pick Ido, of all possibilities? Putting myself in their shoes, I'd rather have chosen Klingon, Na'vi, Huttese or something similar. IJzeren Jan (talk) 21:39, 6 June 2024 (UTC)Reply

@Ssvb: From what I know, the Finnish statistical database, just like the Dutch one, is based on population data obtained at birth/subsequent corrections during the person's lifetime. Unlike the census, this is personal information the government has on you, so the chance people would play with that is a lot lower. But it's always possible that I misunderstood this? Thadh (talk) 21:41, 6 June 2024 (UTC)Reply

But on the other hand, does Finland collect data on one's native language, too? Because I'm sure as hell that the Netherlands don't! IJzeren Jan (talk) 21:59, 6 June 2024 (UTC)Reply

That's a good point. I don't know, but I'm sure that that information can be found somewhere. Thadh (talk) 22:05, 6 June 2024 (UTC)Reply

@Thadh: Yes, it's the personal information, but seems like the Finnish residents can just login using their online banking credentials here and update various details, including their "native language". I doubt that some oddly selected native language can possibly affect anything in everyday life. And I think that having a few dozens of conlanger weirdos in the whole Finland isn't statistically improbable. For example, there were some nutcases in Taiwan, who even changed their names just to get a discount. If there's a loophole in the system AND a real incentive to abuse it, then it will be abused. --Ssvb (talk) 01:01, 7 June 2024 (UTC)Reply

@Ssvb: More realistically tax advisors now suggest to change genders according to the new self-determination act in the FRG, because you get different capitalisation factors for the assessment of the value of a land encumbrance, remaining at the owner after donating a property and hence reducing gift tax, depending on legal gender. I can imagine legal advantages to slip in for someone determining his native language, as it is also an idpol kind of thing, or even purposefully teach a child an artificial language as a second native language just for benefits introduced somewhere. Wiktionary alone though is just not important enough to be gamed this way. Fay Freak (talk) 14:13, 7 June 2024 (UTC)Reply

The borderline transphobia aside (was that really a meaningful addition to the discussion?), I don't see how Ido, a constructed language, would give anyone benefits, so I am highly sceptical anyone would change their native languages for that reason, even more so than for the reason of trying to be funny. Thadh (talk) 16:37, 7 June 2024 (UTC)Reply

Strange individuals exist in every society. You can't expect everyone to be sane and reasonable. --Ssvb (talk) 18:06, 7 June 2024 (UTC)Reply

Maaaybe self-promotion, as you could brag about how your hip new language has "26 recorded native speakers in Denmark", even if it isn't technically true. CitationsFreak (talk) 08:32, 8 June 2024 (UTC)Reply

Personally I think we should ignore this data point about Ido, because it seems a priori unlikely, as others have pointed out. Claims about native and total speakers are habitually inflated, e.g. someone insists on putting back into Wikipedia the claim that there are over 200 million total speakers of Swahili, based on a single questionable reference and in contradiction to all other references; I have deleted this info several times but it keeps getting put back, and I don't have the energy to fight this. Benwing2 (talk) 22:33, 6 June 2024 (UTC)Reply

True that. Same goes for Esperanto, by the way: the ridiculously high number of 2 million speakers (sometimes even 10 million) keeps popping up regularly, even though it was refuted already a long time ago. Today we know that even a number of 100,000 is probably way too optimistic. Same goes for those one or two thousand so-called native speakers. Usually, such figures come from sources with an interest in inflating them. However, that cannot be said of those figures from Finland. Instead of drawing conclusions based on suppositions, shouldn't we at least ask them where those 26 native speakers come from? IJzeren Jan (talk) 23:25, 6 June 2024 (UTC)Reply

synthesized audio files

Latest comment: 12 days ago23 comments10 people in discussion

Do we have a policy on this? I have encountered some, e.g. at inconsequential the audio is explicitly labeled "CA synth", which I take to mean synthesized Canadian. Although it's now possible to synthesize realistic sounding text-to-speech audio, this particular audio sounds very artificial to me, and I think it doesn't belong. Even for realistic-sounding audio, I'm skeptical. Here are some other words with audio labeled "CA synth": extraterrestrial, catamaran, angst, centralization, depolarization, disorganization, amnesia, counterfactual, homily, atherosclerosis, icicle, ecclesiastical, enclose, intruder, gasp, entitle, grievance, goose flesh, biodiversity, lethargic, hyperventilation, coliseum, macrobiotics, impracticality, autobiographical, disputant. An additional file labeled as ca-synth in the filename but not the caption occurs in isolationism. Some of these have additional non-synthesized audio files, some don't. Benwing2 (talk) 04:04, 5 June 2024 (UTC)Reply

Our general rule (not sure how much of a policy it is) is that audio pronunciations ought to be recorded by native speakers of the languages—a machine is admittedly not a native speaker of any language. Some time ago I came across a bunch of synthesized audio files on English entries (around 20–30 IIRC) that were all created by a single Commons user years ago and then a few years later were automatically added by a bot (User:DerbethBot) that was adding missing Commons audio recordings not on the entries. The quality of the audios was really poor and many weren't even correct, so I went ahead and removed them. I admit perhaps I should’ve brought up the matter here, but it seemed pretty clear-cut to me that they had to go as, with an audio recording, one would expect the voice of an actual native human speaker. Even with better quality recordings and as voice synthesis technology gets better and better (particularly with the AI stuff), I think we should still try to supply authentic human recordings as any voice synthesis services will be available to the readers elsewhere. lattermint (talk) 05:05, 5 June 2024 (UTC)Reply

I support removing these and sticking a note somewhere that people shouldn't add synthesized audios as pronunciations. (OTOH, if someone could add a synthesized audio to e.g. voice synthesis as a T:examples type thing, that could actually be appropriate use of synthesized audio, ha.) I suspect these will become an increasingly common issue unfortunately; Commons is similarly dealing with low-quality AI art being added. - -sche (discuss) 05:33, 5 June 2024 (UTC)Reply

Agree with lattermint and -sche. Vininn126 (talk) 05:46, 5 June 2024 (UTC)Reply

I’ve encountered these before. Not a fan. Among other things a formal policy of ‘recordings should be of native speakers’ would help by automatically disqualifying this sort of thing. Nicodene (talk) 12:16, 5 June 2024 (UTC)Reply

We can formulate it this way, “recordings within pronunciation sections must be of native speakers”; how for allowed conlangs without native speakers? For dead languages I formulate “recordings for extinct languages are discouraged”, to leave room for interpretation, mine being that if it would pass off as native if we were ignorant of the information of the language being extinct then it is tolerated for practical purposes—we will find agreement on making a statement about recordings of natural languages possessing native speakers easy. Fay Freak (talk) 13:19, 5 June 2024 (UTC)Reply

Perhaps it could be phrased as 'If a language has a large body of native speakers, any recording should be of a native speaker. If a language is extinct or constructed but has a large body of non-native speakers, any recording should be of a proficient speaker using one of the conventional pronunciations’. Nicodene (talk) 20:09, 5 June 2024 (UTC)Reply

it reminds me of the radio recordings of traffic conditions i used to hear on the radio, where both the tone and the speed were out of step. outside of acute distress, we just dont speak that way. basically it sounds like someone who was abducted and is being forced to read a letter saying "no im fine please dont look for me im definitely okay i promise" —Soap— 12:50, 5 June 2024 (UTC)Reply

Phonology is also out of step; of native speakers announcing the stops in the tram I hear local toponyms being pronounced dodgily, since these speakers rarely get IPA transcriptions; even the pronunciations of municipality-level names from TV presenters are unreliable. It is impossible to guess and almost impossible to look up that Baumheide, Altenhagen, Hiddenhausen, Oerlinghausen but not Bad Oeynhausen, Ubbedissen and Asemissen are stressed at the second stem, and for Hövelhof I still don’t know whether ⟨v⟩ is /v/ or /f/ – IP corrected it to /v/, from a local public broadcaster’s newsreader I remembered it /f/. There is a lot of confirmation bias and no source, lest to speak of reliable ones, on de.Wikipedia’s claim for Hiddenhausen and Oerlinghausen being stressed at the onset. Fay Freak (talk) 13:19, 5 June 2024 (UTC)Reply

Can this policy be generalized to dealing with just any low quality audio? E.g. anything with disruptive background noise too. Also rather than deleting, maybe it's more productive to aggressively categorize and label them as something that needs replacement? The deleted low quality audio samples won't just disappear from the net and may be re-added again by the less attentive editors or bots. It would probably help if https://lingualibre.org/wiki/User:Olafbot could prioritize replacement of such known low quality recordings over adding the totally new audio samples. Pinging @Olaf just in case if he might be interested in this discussion.

As for the possible replacement of the current artificially synthesized isolationism audio, the https://commons.wikimedia.org/wiki/Category:Lingua_Libre_pronunciation-eng?from=isolationism link lists one human recorded sample. But it's a sample recorded by a native Mandarin Chinese speaker and, despite of that, it's used by fr.wiktionary.org and pl.wiktionary.org. This is also not ideal in my opinion. --Ssvb (talk) 15:02, 5 June 2024 (UTC)Reply

And to give an example, just listen to the click in the iridium audio sample. I encountered a lot of samples with similar or even worse defects, but can't easily find them offhand right now.

What is a Wiktionary editor supposed to do upon spotting such audio? Just simply let it be because it had been recorded by a native speaker? Remove it? Label it somehow? --Ssvb (talk) 18:03, 5 June 2024 (UTC)Reply

If the audio is particularly bad, please do bring it up for discussion (compare e.g. Wiktionary:Tea_room/2013/February#Dutch_enig.2C_Audio_file_file:Nl-enig.ogg; note that the current audio file is different than the one that was there when that discussion happened); if it's bad, people will probably agree on removing it; if a lot of files by a particular speaker have problems, we may want to remove them all systematically and try to 'blacklist' the user's files (Metaknowledge was spearheading a project to do this, but has been inactive). I like the idea of listing known bad recordings somewhere after removing them from entries, so they can hopefully be replaced (either outright overwritten with a better file, or someone just records and uploads a separate file). - -sche (discuss) 18:19, 5 June 2024 (UTC)Reply

@-sche: I understand and fully agree with starting a public discussion when bad faith is suspected. Such as covert vandalism or if a person with an obvious accent has an audacity to pretend to be a native speaker.

However this is simply not workable in all other cases due to excessive bureaucracy involved. You already set the bar at "particularly bad", possibly to limit the scope and the amount of paperwork. But even "moderately bad" or "slightly bad" audio samples shouldn't be normally desirable. Using Lingua Libre, it's possible to easily record more than 100 audio samples in less than one hour if one is up to it. Another factor is that the beginners are likely to systematically record and upload a certain percentage of low quality audio samples simply due to lack of experience. Jumping the gun to harass or blacklist the users for this is also counterproductive, because this is a sure way to lose a potentially valuable contributor.

My suggestion is to simply add a new parameter to Template:audio for flagging low quality audio samples. The parameter value can be a short text description: "wrong accent", "noise", "clipped", "muffled", "synthetic", etc. So that when I encounter a bad quality audio sample, I can just spend a few seconds on a quick edit to flag it. When this process is established, the problematic words can be automatically added to a Lingua Libre list, similar to this one: https://lingualibre.org/wiki/List:Eng/Lemmas-without-audio-sorted-by-number-of-wiktionaries (so that the Lingua Libre contributors know what to prioritize when recording their audio samples).

Please look at the same iridium audio sample again. Is it so bad that it needs an urgent removal or a public discussion? Maybe not. Would it be a good idea to eventually replace it? Yes, of course. --Ssvb (talk) 17:14, 6 June 2024 (UTC)Reply

@-sche: And here's another example: the Ukrainian хлор (xlor) sounds almost indistinguishable from хор (xor) because the sound "л" is missing.

It's interesting that another audio sample also recorded by @Tohaomg drops a different sound ("р" instead of "л") in хлорметан. And I can actually hear a click in place of the missing sound. After searching a bit, I found a known problem https://lingualibre.org/wiki/LinguaLibre:Technical_board/Audio_click_bug#HIGH_PRIORITY:_Audio_recordings_have_dust_and_clicks, which was likely fixed only in 2023.

Anyway, even though the audio samples likely got corrupted because of a bug in the recorder application, I see this primarily as a QA issue. Corrupted audios shouldn't be normally uploaded to commons.wikimedia.org by the person, who recorded them. --Ssvb (talk) 02:47, 9 June 2024 (UTC)Reply

@Benwing2: What's your opinion? Should we label problematic audio samples as |a=synthetic or |a=defective? Or a better solution is needed? --Ssvb (talk) 03:01, 9 June 2024 (UTC)Reply

@Ssvb I added support for a |bad= parameter for labeling bad audio recordings with arbitrary text. You can see it in action in User:Benwing2/test-audio (specifically, the last example under the "Production" section). The "bad recording" note should appear boldfaced in red, but it may take 5-10 minutes for it to appear this way as I just added the appropriate specs to MediaWiki:Common.css for this and it takes a few minutes after doing so for the changes to propagate. Let me know if this is helpful or if you want some other param. Benwing2 (talk) 03:16, 9 June 2024 (UTC)Reply

Also, uses of this param are currently tracked using the WT:Tracking mechanism, by visiting Special:WhatLinksHere/Wiktionary:Tracking/audio/bad-audio and Special:WhatLinksHere/Wiktionary:Tracking/audio/bad-audio/LANG for a specific lang code. Maybe this should be made into a category. Benwing2 (talk) 03:19, 9 June 2024 (UTC)Reply

I wouldn't be a priori opposed to artificially produced audios if native speakers can vouch for their sounding natural. --Lambiam 14:28, 6 June 2024 (UTC)Reply

@Lambiam: This is a slippery slope and it's not always easy to tell the difference between natural and non-natural. Audios may be just slightly unnatural and people would hesitate to discard them. That said, I don't mind having synthetic audios as a temporary placeholder, but only if they are always clearly labelled as such. And only if they are added to a publicly visible to-be-replaced list. --Ssvb (talk) 17:25, 6 June 2024 (UTC)Reply

I have no strong feelings about this, but note that occasionally an audio file presumably produced by flesh-and-blood native speaker may sound off as well (and sometimes even plainly wrong). In the end, whatever the means of production, ~~the capitalist will appropriate the surplus value~~ the quality of the result needs to be assessed and ensured by native speakers. --Lambiam 17:34, 6 June 2024 (UTC)Reply

Sure, not everyone is a professional voice actor. But a synthetic audio is like a synthetic flower. Some aspects of it are as good or even better than the real thing. Yet the other aspects are different, possibly in a subtle way. --Ssvb (talk) 19:10, 6 June 2024 (UTC)Reply

Hard against. Vininn126 (talk) 17:30, 6 June 2024 (UTC)Reply

Support removing synthesized audio. — SAMEER (؂・؄・؏) 04:21, 9 June 2024 (UTC)Reply

Anti-intensifiers and the epidemic of British meiosis

Latest comment: 2 days ago5 comments4 people in discussion

At the moment our entry for maybe lists the following sense:

(UK, meiosis) Certainly

Similarly, we find the following under a bit:

(UK, meiosis) Very.
(UK, meiosis) A lot.

and the following under somewhat:

(UK, meiosis) Very

The problem is that, as far as I am aware, every single word or phrase that carries a sense of moderation is fair game for ‘meiosis’. In no particular order I cite slight, modest, mild, decent, small, minor, light (adj.); relatively, perhaps, to some extent, fairly, a little, possibly, not exactly; might, could, seems; scuffle, tiff, misunderstanding.

The flipside of this is that people can and (especially in the UK) do assign sarcastic senses to any word that denotes a positive quality: genius, fantastic, brave, brilliant, revolutionary, creative, and so on.

Both meiosis and sarcasm are, I think, cultural/metalinguistic and as such beyond the purview of a dictionary. Nicodene (talk) 00:24, 6 June 2024 (UTC)Reply

Are there examples where words actually acquired new meanings through meiosis. What exactly distinguishes it from understatement? —Caoimhin ceallach (talk) 00:46, 7 June 2024 (UTC)Reply

Not that I'm aware of, and I don't believe there is a difference other than meiosis coming off as ‘a bit’ affected. Nicodene (talk) 00:53, 7 June 2024 (UTC)Reply

I do think there is a place for meiosis in wiktionary as it can contribute to etymology. As mentioned in the wikipedia article on meiosis, the Australian 'outback' is one example where the word did acquire a new meaning through meiosis. It was originally used as a meiotic comparison to the back yard of a house, but is now commonly used without that comparison in mind. That said, I would agree that meiosis should not be included as an additional sense in each of the entries you referenced. If meiotic or sarcastic senses are to be included at all, I suggest it be in usage notes, as in nice. Pangur Bán & I (talk) 06:21, 9 June 2024 (UTC)Reply

On a balance I think you are right, this is not worth a separate sense line, at least in those entries. (As Pangur says, there are cases where meiosis seems to become lexical, like pond.) I suppose the (small-c) conservative thing to do would be to conserve the information in a usex like the one showing sarcastic use at Sherlock, or move the quotes under the 'regular' sense. Indeed, it is nonobvious to me how one discerns that the "somewhat weatherbeaten" quote at somewhat is meiosis, anyway; do I need to have knowledge of the real condition of the train in question to know the writer is understating its weatherbeatenness, and out of meiotic intent rather than misassessment? - -sche (discuss) 16:31, 18 June 2024 (UTC)Reply

Kyakhta Russian–Chinese Pidgin

Latest comment: 12 days ago5 comments3 people in discussion

I suggest adding Kyakhta Russian–Chinese Pidgin in addition to existing pidgins based on the Russian: Mednyj Aleut (mud), Russenorsk (crp-rsn), Solombala English (crp-slb), Taimyr Pidgin Russian (crp-tpr). AshFox (talk) 08:18, 6 June 2024 (UTC)Reply

Support Protegmatic (talk) 18:13, 8 June 2024 (UTC)Reply

Do you have any plan about how to make the entries for Kyakhtian? The only clear feature it has is the suffix -la, and you even can't always use it. There is no clear grammar or spelling or pronunciation records. Also, i remember rumors that there are some Chinese records of this pidgin, and it were some problems with them as well. I have thinked about this pidgin long time and just gave up, cause am not sure how to make it structured enough for Wiktionary. Anyway, good luck with this work if you decide to do it. The pidgin is a mess, but it has many cool words worth to be mentioned on Wiktionary. Tollef Salemann (talk) 22:02, 8 June 2024 (UTC)Reply

But as for adding an own language code for it, I'm fully supporting it. Tollef Salemann (talk) 22:04, 8 June 2024 (UTC)Reply

Oh, yeah, there is also -shek-/-nek- instead of Russian -shk-/-nk- but I guess that it is just the Chinese pronunciation, and not really a pidgin grammar feature. Tollef Salemann (talk) 22:07, 8 June 2024 (UTC)Reply

Full stops after templates like {{synonym of}}

Latest comment: 8 days ago42 comments20 people in discussion

Should automatic full stops be added after templates used in definitions like {{clipping of}}, {{short for}}, and {{synonym of}} (with the option to turn it off)? @Sgconlaw suggested to make this discussion after I manually added one to sacrifice. J3133 (talk) 13:20, 6 June 2024 (UTC)Reply

Support (for English; separate discussion for other languages desirable). In fact, {{clipping of}} and {{short for}} already automatically add a full stop at the end. I think it makes sense to have a full stop automatically added for other templates like {{synonym of}} (with the option to turn it off in appropriate cases) for consistency with the earlier-mentioned templates, and because we treat our definitions for English entries like sentences, starting them with a capital letter and ending them with a full stop. — Sgconlaw (talk) 14:25, 6 June 2024 (UTC)Reply

Support Def templates usually benefit from having full stops..

Support for English,

Oppose for other languages in light of Ben's argument. Vininn126 (talk) 09:30, 7 June 2024 (UTC)Reply

Support (edited to add: for English only, in agreement with Benwing below); IMO, the templates should use the langcode to format themselves the way definitions are formatted, capital + period for English, lowercase + no period for other languages. That might need separate/more discussion, because some people disagree and want capital + period for all languages/definitions, or want lowercase + no period for all langs/definitions, but in any event having some [but only some] templates do "capital but no period" is not consistent with anything. Let's check for cases these are followed by (a) a manual period a bot should remove if we have the template start supplying them, or (b) something else, such as another template or gloss, which we / a bot might solve by adding nodot= — I sometimes see (or do!) things like "{{altcase|en|fooh}}: {{altform|en|foo}}". - -sche (discuss) 17:54, 6 June 2024 (UTC)Reply

Oppose. It's trivial to add punctuation: it's one keystroke. It's comparatively cumbersome to use parameters to disable unwanted punctuation: |nodot=1. Not automatically outputting punctuation is a more flexible design, more user-friendly, and less obtuse.

There have been cases in the past where someone has gone in and added auto-punctuation to long-standing templates, requiring lots of manual editing to fix existing wikicode where the templates were used mid-sentence.

The time gains from auto-punctuation are trivial. The time losses are substantially larger. ‑‑ Eiríkr Útlendi │^{Tala við mig} 20:33, 6 June 2024 (UTC)Reply

Hmm, this is a testable argument: T:alternative form of (for example) is used on 174,660 pages, so the cost of adding the missing period to them all would in fact not be 1 keystroke, but 174,660 keystrokes. [If we only add dots for English, the number will be lower but we also won't have to worry about adding nodot= to non-English entries... in any event, by ratio,] it seems like somewhere in the vicinity of 1/8th of affected entries would have to be putting other text (besides a period) after the template in order for the keystroke argument to support defaulting to no dot, rather than supporting defaulting to dots. Can anyone check what's the case? I suspect the number which are putting other text after the template is in fact far, far smaller than 1/8th, but I could be proven wrong! (In most cases, I think whatever display would be correct in the majority of cases should be the default display.) If we do decide the default display should be no dot, I hope someone will write a bot to add dots where they're missing, since at present this is not done and entries just sit around with their normal definitions and these templates looking inconsistent. - -sche (discuss) 21:29, 6 June 2024 (UTC)Reply

On Translingual entries, I have been forced to add "nodot=1" for synonyms templates used within {{taxon}} (which has a default period) and for instances of "See {{specieslite}} and

Beer parlour on Wikipedia.Wikipedia for other species". I'd be happy to forego the default period in {{taxon}} for the benefit of consistency in the need to consider the punctuation needs of the entry. DCDuring (talk) 15:12, 7 June 2024 (UTC)Reply

Oppose per Eirikr. If this is an issue, I can start adding full stops to the entries I create, or someone can make a bot do that; Adding an automatic printing of a full stop is always a whole headache trying to remove it. Thadh (talk) 21:12, 6 June 2024 (UTC)Reply

~~Strong oppose~~

Support for English,

Strong oppose for other languages. I believe as a general rule that all form-of templates should auto-generate capital letters and final periods (full stops) only for English (if that), and should default to lowercase and no periods for all other languages. I have wanted to implement that for all form-of templates instead of the morass of randomness we currently have, but need to get consensus for it. Benwing2 (talk) 22:26, 6 June 2024 (UTC)Reply

OK, I see User:-sche agrees with me, but has phrased it using "support". I think we should have a separate poll to implement this option. Benwing2 (talk) 22:28, 6 June 2024 (UTC)Reply

@Benwing2: mmm, isn't your view in support of J3133's proposal, at least where English is concerned? I also have no objection if it is felt that for non-English languages there should not be an initial capital letter or terminal full stop (though I'm unclear why). — Sgconlaw (talk) 22:29, 6 June 2024 (UTC)Reply

@Sgconlaw User:J3133 did not qualify their proposal with a restriction to English; I'm strongly opposed to making this a blanket addition to all languages, which is what the proposal suggests on its face value. Benwing2 (talk) 22:37, 6 June 2024 (UTC)Reply

@Benwing2: We (Sgconlaw and I) were discussing English entries, but I forgot to mention it. J3133 (talk) 06:05, 7 June 2024 (UTC)Reply

Abstain The current state of capitalisation and full stops in definitions is pretty chaotic. I think it doesn't make much sense to change some templates one way or the other before reaching consensus on puncuation for each type of definition (English and non-English, lemma and non-lemma, gloss and non-gloss). After deciding on that, the inclusion of automatic full stops in form-of templates may be worth another discussion. Personally, I think most (or all) non-gloss definitions, including those which use form-of templates, should have capitals letters and full stops in both English and non-English entries. Einstein2 (talk) 23:07, 6 June 2024 (UTC)Reply

Support for English,

Oppose for other languages, in strong agreement with Benwing above. — Vorziblix (talk · contribs) 00:30, 7 June 2024 (UTC)Reply

Support Per User:Benwing2 and User:-sche. Ioaxxere (talk) 04:28, 7 June 2024 (UTC)Reply

Support for English,

Oppose for other languages. Same thing with capitalization (capitals for English, lowercase for other languages). However, there should be a "nodot" parameter on all of these. Sometimes it's useful to add information (that should be part of the same sentence) after the template. Andrew Sheedy (talk) 05:02, 7 June 2024 (UTC)Reply

@Andrew Sheedy Agreed. Whenever a template auto-capitalizes or auto-adds a final period, there should be (and usually are) |nocap=1 and |nodot=1 params to disable the capitalization and auto-period. Benwing2 (talk) 06:23, 7 June 2024 (UTC)Reply

Support for English,

Oppose for other languages (as this proposal’s initiator). J3133 (talk) 06:28, 7 June 2024 (UTC)Reply

Abstain for English, given current practice which I would not have begun but now we have it,

Oppose for other languages and strongly support the opposite. Fay Freak (talk) 09:53, 7 June 2024 (UTC)Reply

Oppose for English and Translingual.

Abstain for other languages. The nodot=1 option required to maintain flexibility in the use of the template is an annoyance. Prohibiting use of such templates except in prescribed cases and in prescribed manner needs some kind of justification that I haven't seen here. I hope we aren't going in the direction of "Everything that is not mandatory is forbidden." DCDuring (talk) 14:38, 7 June 2024 (UTC)Reply

Somewhat

oppose for all languages,

strong oppose having English as a special case apart from other languages. It not only might confuse editors, it will definitely confuse editors. — SURJECTION ^{/ T / C / L /} 21:56, 7 June 2024 (UTC)Reply

Oppose -- Sokkjō 05:51, 9 June 2024 (UTC)Reply

┌────────────────────────────────────────────────────────────────────────────────────────────────────┘ Comment - it appears we have consensus to not include final full stops/periods (and probably not initial capitalization either) in non-English form-of templates, but no obvious consensus for English form-of templates. It appears there are two options, either include them by default with English or don't include them. The latter makes English consistent with non-English, but the former is closer to existing practice (in many cases, at least). Some thoughts:

Would it make a difference in your voting if there were a one-character way of turning off initial-caps and/or final period/full-stop? For example, a symbol like ^ or > (just brainstorming here, maybe there are better symbols, and it doesn't have to be the same symbol at the beginning and the end) could be placed at the beginning to suppress the initial caps and at the end to suppress the final period.
How strongly do you feel about the inclusion or non-inclusion of initial caps and final periods for English? (E.g. for me, I could go either way with English; what I feel strongly about is that initial caps and final periods should *NOT* be present for non-English.)

Benwing2 (talk) 09:18, 9 June 2024 (UTC)Reply

Assuming we are still talking about "templates like {{synonym of}}", I am considering abandoning their use embedded in {{taxon}} because {{syn of}} doesn't accept nocap=1. If the final period is mandated, then there would also be an extra period. DCDuring (talk) 19:25, 9 June 2024 (UTC)Reply

@DCDuring Are you sure that {{syn of}} doesn't accept |nocap=1? It's documented to accept it and internally it sets |withcap=1, which simultaneously turns on initial capitalization and adds a |nocap= option to turn it off. Also as mentioned above, I am thinking of adding a feature to make it easier (fewer keystrokes) to turn off the initial caps/final period. Benwing2 (talk) 19:39, 9 June 2024 (UTC)Reply

I'll try again. See Geomalia for the look at present. DCDuring (talk) 20:42, 9 June 2024 (UTC)Reply

@DCDuring Looks good to me. Benwing2 (talk) 20:51, 9 June 2024 (UTC)Reply

That's after I corrected my error. Previously: [41]. I still wish that the italics could be removed (optionally: noi=1), so italicized taxa could appear italicized in Translingual/taxonomic definitions that use {{syn of}} within {{taxon}}. DCDuring (talk) 21:34, 9 June 2024 (UTC)Reply

@DCDuring I could implement that, although at that point I wonder if it wouldn't be better just to manually write out "synonym of"; the template doesn't categorize so there seems little point in using it if you have to add a bunch of flags to get non-default behavior. Benwing2 (talk) 21:54, 9 June 2024 (UTC)Reply

It might be better for me to fork out {{taxonsyn}} for the increasing number of cases where I embed a synonymous taxon in a definition within {{taxon}}. Those 'lesser' taxon definitions merit less detail that 'real' taxon definitions, so excluding them from searches for 'incomplete'/improvable entries is desirable. The formatting peculiarities of taxa derived from italicization (and perhaps the meaning of synonym) may justify otherwise undesirable forking. DCDuring (talk) 22:12, 9 June 2024 (UTC)Reply

I am not familiar with the ins and out of taxon formatting, but in general if you are doing something repeatedly, it makes sense to have a dedicated template or template parameter for it. Benwing2 (talk) 22:35, 9 June 2024 (UTC)Reply

Support for English,

Oppose for other languages. @Benwing2 I note many (not all) of the full oppose votes are from editors who don’t edit English, who may have not realised the growing consensus for separate options. Theknightwho (talk) 11:18, 9 June 2024 (UTC)Reply

Oppose for English,

Support for other languages. P U C – 13:20, 9 June 2024 (UTC)Reply

@PUC: But we (including you) do not use full stops for non-English definitions. J3133 (talk) 13:33, 9 June 2024 (UTC)Reply

He's goofin. Vininn126 (talk) 13:38, 9 June 2024 (UTC)Reply

I feel strongly about having all English definitions start in a capital letter and end in a period, because my initial reason for becoming a Wiktionary editor was to rectify inconsistencies like that. But I would also like it to be based on consensus and not be forced on people who feel strongly the other way. Andrew Sheedy (talk) 18:16, 9 June 2024 (UTC)Reply

Support. Imetsia (talk (more)) 22:22, 9 June 2024 (UTC)Reply

@Imetsia Can you clarify? What do you support exactly, and for English or non-English? Benwing2 (talk) 22:35, 9 June 2024 (UTC)Reply

I support automatically adding full stops in templates like {{synonym of}}, for both English and non-English. I've wanted this for Italian entries for quite a while by now. Imetsia (talk (more)) 22:49, 9 June 2024 (UTC)Reply

Support, with the automation that nodotbe enabled for non-English by default. While at it, I also support nocap be enabled for non-English by default but I have not seen editors follow that convention as rigorously (I do, though, since I was told it is prescribed to do so). Svartava (talk) 07:11, 12 June 2024 (UTC)Reply

Oppose. Full stops are a nuisance where they are included in the template, especially when adding text after it, and a comma is needed. DonnanZ (talk) 23:47, 12 June 2024 (UTC)Reply

Old Franconian

Latest comment: 14 days ago1 comment1 person in discussion

Old Franconian is a language variety derived from Frankish, and has many languages within West Central German like Luxembourgish, Rhine Franconian, East Franconian, and Central Franconian. See this. That Northern Irish Historian (talk) 03:49, 7 June 2024 (UTC)Reply

Collapsible lists within definitions

Latest comment: 2 days ago24 comments8 people in discussion

I propose that for cases in which definitions include lists (especially long ones), it be adopted as best practice to make said lists collapsible with a template such as those existing for quotations and semantic relationships (or one based on a code I cobbled together to attempt this for the list of place names in Eden). I believe this would be worthwhile to help streamline some unwieldy pages, prioritizing definitions and relationships. @Soap @Ioaxxere – Pangur Bán & I (talk) 21:41, 7 June 2024 (UTC)Reply

I support this. Hopefully if we approve this we can base it on existing code like that of {{collapse}} or {{collapse-top}} (neither of which will work inside a list as of yet) so that it can be guaranteed to work on all browsers. —Soap— 21:52, 7 June 2024 (UTC)Reply

@Soap, you said before that you would be willing to assist me in drafting this proposal. What are your thoughts on this in light of DCDuring's opposition? Pangur Bán & I (talk) 22:19, 17 June 2024 (UTC)Reply

I only meant I could help start the post since you're a new user and I felt you might be too shy to come here outright. But you have a good understanding of the issue and how to express yourself, so right now I dont have anything else to add. —Soap— 17:44, 18 June 2024 (UTC)Reply

I gotcha. Thanks anyway! Pangur Bán & I (talk) 17:50, 18 June 2024 (UTC)Reply

Support although there aren't that many pages where collapsed definitions are worth using. Maybe Mandarin màn (consider someone searching for the Vietnamese entry)? Ioaxxere (talk) 22:48, 7 June 2024 (UTC)Reply

Thank you for your support. To your point, as noted by DCDuring, this does seem to be primarily an issue with toponyms. See entries like Chester, Richmond, Franklin, and Weston for a few examples of pages bloated of pages where I think collapsible lists of subsenses would be worth using. Pangur Bán & I (talk) 15:54, 17 June 2024 (UTC)Reply

Support but I don't want it to make a box around the sub definitions when you expand it, cuz I think that's kinda ugly. That is, if it's even possible to do that. — SAMEER (؂・؄・؏) 04:25, 8 June 2024 (UTC)Reply

That is absolutely possible. I gave my makeshift collapsible list a border just to make it visually distinct, but in hindsight I think it would make more sense for something like this to more closely follow the style of the semantic relations and quote templates, just in a bulleted or numbered list format unlike those. Pangur Bán & I (talk) 16:20, 17 June 2024 (UTC)Reply

~~Abstain~~

Oppose Not a complete proposal. It's just based on the Eden anecdote. By my lights it would have to be restricted to definitions formatted as subsenses. As nobody seems to have analyzed the cases, perhaps we should wait to see how it would be applied to toponyms for now. DCDuring (talk) 15:12, 8 June 2024 (UTC)Reply

@User:Geographyinitiative Any thoughts? DCDuring (talk) 20:58, 17 June 2024 (UTC)Reply

I really have no opinion on the proposal. I will be fine with it if you do it. Please compare Washington County on Wiktionary with Washington County on Wikipedia and Category:Washington County on Wikimedia Commons. The solution sounds like an innovation beyond Wikipedia and Commons. I would want to find out if this have been discussed in Wikipedia, etc. Geographyinitiative (talk) 23:04, 17 June 2024 (UTC)Reply

But, what's the benefit in the case of Washington County? There's is only one screenful of total content in the entry. DCDuring (talk) 16:50, 18 June 2024 (UTC)Reply

Apologies for my perhaps poor phrasing. I would be absolutely fine with amending this proposition to be restricted to definitions formatted as subsenses and I would even support having a toponym-specific template. Though, I would still be in favor of having one for subsenses more generally as I think that would allow some editor freedom without any cost that I can see. Any thoughts? Pangur Bán & I (talk) 15:51, 17 June 2024 (UTC)Reply

How many non-toponym entries would benefit from this? What criteria are to be applied, eg, number of subsenses, total number of definitions in PoS section, nature of supersense definition (Some are purely hypothetical for purpose of grouping. @User:-sche)? Others may have more questions and issues. I feel this might need a formal vote, not just a straw poll on this page. DCDuring (talk) 16:47, 17 June 2024 (UTC)Reply

Your concerns about a general subsenses template are absolutely worth discussing, but before we move on to that, would you definitely support a toponym-specific collapsed-list template in the vein of the formatting of in-line collapsed quotations, and hypernyms, meronyms, etc. (but formatted as a bulleted or numbered list)?

Once the details are more hammered-out, a formal vote sounds like a great idea. My main trouble is that I don't have the coding knowhow to do a good job writing the template I'm envisioning. I don't know how I would go about producing a comprehensive count of how many entries would benefit, but block, cross, finger, head, stand, slash, and band are just a few non-toponyms I've found that I think could potentially use collapsible subsenses. As for requisite criteria for use, if you have any specific suggestions I'd genuinely love the help in fleshing out this proposal. The existence of two or more items seems to be the only hard criterion for quotations formatting and semantic relations templates, which seem fine models for something like this, but I'm happy to consider alternatives. Based on this poll, it would certainly seem that there is some interest in this functionality, and if it does reach the point of a formal vote, different options for potential criteria could easily be offered. Pangur Bán & I (talk) 18:51, 17 June 2024 (UTC)Reply

If we don't begin to address the issues now, than it will not be possible to draft a meaningful proposal. At head we have two levels of subsenses. The first definition is "The part of the body of an animal or human which contains the brain, mouth and main sense organs.". Under this definition, the first subsense layer consists of two non-definitions: "(people) To do with heads." and "(animal) To do with heads." Would that first layer be visible or not under a yet-to-be specified proposal? DCDuring (talk) 20:58, 17 June 2024 (UTC)Reply

I'm not set on anything and am entirely willing to continue workshopping this proposal. In pages like head, perhaps subsenses hosting a second layer of subsenses should not be collapsible under this prospective template. I see no problem with that if that's what you're suggesting. Pangur Bán & I (talk) 22:19, 17 June 2024 (UTC)Reply

Neither of the two member of the first subsense layer at the first definition of head are real definitions. The first definition itself does not necessarily suggest the range of definitons at the second layer of subsenses. To me this is a specific sign that hiding subsenses can make it harder for less experienced user to find less common definitions. DCDuring (talk) 16:47, 18 June 2024 (UTC)Reply

Support. Imetsia (talk (more)) 22:23, 9 June 2024 (UTC)Reply

I am ambivalent about the idea of doing this to placenames, or long lists of Chinese "romanization of"s as also suggested above; I would not support collapsing 'real' definitions e.g. at take, even if there are very many. It seems like the number of placename entries which really have so many senses as to merit collapsing is small, and it seems like the sort of person who'd go to màn#Mandarin is someone interested in learning what it's a romanization of: why else wouldn't they go to or click through in the TOC to màn#Vietnamese? so collapsing just adds an extra step for them. In general it does not seem like that much of a hassle to scroll past placenames one is uninterested in. Whereas, collapsed content is easily missed, even by veteran editors who know to look for it (I myself often missed the existence of various inline -nyms under definitions back when they were autocollapsed, and have seen other veteran users miss collapsed etymology content), let alone new users. So I am ambivalent, leaning against it. - -sche (discuss) 16:15, 18 June 2024 (UTC)Reply

Thank you for your consideration and your well articulated concerns. I have no opinion on the "romanization of" example as that's not something with which I have any experience myself, and in hindsight I do think my original proposal here is likely too broad. I don't really want every list of subsenses to be collapsed, but rather for this to be available as a tool in situations where it may be truly helpful, its usage being determined via consensus for edge cases.

For 'real' definitions, I would agree that genuinely distinct subsenses such as those in take probably shouldn't be collapsed, at least not by default (I think making them open by default but with the ability to collapse them could still be useful). I really had in mind entries wherein the "subsenses" are really just examples, which can be seen in some of the examples I cited (block, head#Noun sense 2, etc.).

My issue is less that it is inconvenient to scroll past them per se, but rather that lists of examples subordinate to the most common senses are effectively privileged over secondary senses that can be more prevalent/noteworthy than items in those lists. This, I think, is not conducive to efficiently absorbing the information, and rather counter to the purpose of ordering senses in the first place. Pangur Bán & I (talk) 17:30, 18 June 2024 (UTC)Reply

We only have opinions, not facts, about the relative frequency of use of different definitions, the relative frequency of requests for different definitions, even of the time-period of use of definitions. I have trouble justifying the privileging of some contributor(s) opinions about what is to be listed first and what de-privileged by being rendered into subsenses. I also have trouble understanding why we discuss this in terms of the rights and privileges of definitions. Our concern is merely with users and their ability to navigate an essentially linear presentation of data, in which some data necessarily precedes other data. I'm afraid that tradeoffs are inevitable and that we have little reasoned basis to make them in general. DCDuring (talk) 20:47, 18 June 2024 (UTC)Reply

I suppose 'privilege' was ill-chosen here. I meant 'prioritize', in the sense of placing one thing before another in sequence. As for privileging contributors' opinions about what is listed first, every Wiktionary entry that includes multiple senses already does that, in accordance with WT:SG#Definition sequence, with the frequency-based order determined via consensus, exactly as I'm proposing the usage of this template be. My concern is also with users and their ability to navigate the information, which is precisely why I am proposing this. Pangur Bán & I (talk) 21:41, 18 June 2024 (UTC)Reply

User:Ioaxxere/MTE glossary

Latest comment: 11 days ago3 comments3 people in discussion

I invite you to check out this new glossary format. Using JavaScript (User:Ioaxxere/auto-glossary.js), it automatically scrapes every entry in a certain category and finds definitions containing a certain label. To see the output, you will have to add the line importScript("User:Ioaxxere/auto-glossary.js"); into your common.js page. Here's what the output looks like: https://imgur.com/a/kKQLGSG.

I propose that we create more of these automatically-generated glossaries in Appendix space, as I think that they are very useful for keeping track of a certain category. Ioaxxere (talk) 05:45, 9 June 2024 (UTC)Reply

Yeah, they should be efficient search engine spam, people land on when searching slang words. Fay Freak (talk) 08:20, 9 June 2024 (UTC)Reply

Neat. Vininn126 (talk) 18:38, 9 June 2024 (UTC)Reply

standardizing the form of phrase lemmas

Latest comment: 2 days ago7 comments5 people in discussion

This is based on a discussion in WT:RFM originally concerning tail wagging the dog, which someone proposed moving to the tail wags the dog. User:Theknightwho asked about general conventions, and I suggested the following:

try to avoid "one" or "someone" in a lemma unless it's unavoidable, e.g. it's in the possessive; so kiss goodbye not kiss one goodbye or kiss someone goodbye;
if "one" or "someone" needs to be expressed, use "one" if it is the same as the subject, "someone" otherwise; hence kiss one's ass goodbye is correct, not kiss someone's ass goodbye; take someone's word for it is correct, not take one's word for it (which is correctly a redirect); but someone's ass off should be one's ass off (the latter is incorrectly a redirect to the former);
use the infinitive for verbs occurring at the beginning of an expression (in a verb-object phrase), but the simple present for verbs occurring with a subject (hence the tail wags the dog not the tail wagging the dog; time stands still not time stood still, time standing still, time stand still, etc.
there should be something about whether to include the word "the", e.g. in tail wags the dog or the tail wags the dog.

User:DCDuring asked:

Those seem like good rules to me. There is an interaction with what I think is our preference not to have headwords with leading the. Also, to clarify, when you say infinitive you mean the 'bare infinitive', not the 'to infinitive'. When should something be used instead of someone? (Does it depend on the relative frequency of use of the expression with non-gendered things? Threshhold?) Are there circumstance in which we would go with a different lemma headword? Should we have alt form entries for some of the inflected and other variant forms or just hard redirects. I don't know how complete we should try to be. To much detail might delay implementation and course correction. DCDuring (talk) 01:53, 7 June 2024 (UTC)Reply

To which I replied:

These are good questions. You are right that I mean "bare infinitive" rather than "to-infinitive". As for something vs. someone, I think if it can reasonably occur with both, one should be a soft redirect to the other. Generally I prefer soft redirects over hard redirects, although I understand that hard redirects are easier to enter. Another issue is, what's the inanimate equivalent of one's? Is it its? I will bring these rules to the BP and see what people say. Benwing2 (talk) 03:01, 7 June 2024 (UTC)Reply

The suggestion is to put these in the WT:Style guide rather than WT:Entry layout (which requires a vote to make any substantive changes). Does anyone have any thoughts or additional suggestions for standardization rules? Benwing2 (talk) 09:06, 9 June 2024 (UTC)Reply

Definitely agree on point 2, which is WT:CFI#Pronouns already. Re point 4, Wiktionary:Tea room/2023/December#Proverb_entries_starting_with_"the" suggested more people want to include the in proverbs than don't (obviously only for phrases that can include the; nobody is moving →*the Rome wasn't built in a day), hopefully a wider discussion finds a wider consensus. (Maybe we can even determine whether to standardize the situation with short the X phrases / nouns: we have the bomb, but (after some TR discussion) the talk is a redirect to talk; the Netherlands redirects to Netherlands, but the Rock is an entry, and I don't think anyone would dream of moving The Hague.) I advocate redirects from whichever form we don't lemmatize to whichever we do. Point 3 seems reasonable; there too I advocate redirects from other common forms (e.g. the tail is wagging the dog). If we remove the object from the entry title (point 1), I hope we strongly encourage people to add usexes or citations showing where in the phrase the object goes, because sometimes it's [verb] [other word] someone and sometimes it's [verb] someone [other word] and sometimes it's other possibilities. - -sche (discuss) 17:29, 9 June 2024 (UTC)Reply

@-sche Thanks, and I completely agree with your idea of strongly encouraging the inclusion of usexes showing where the object goes. Sometimes even a single expression can go both ways; my canonical example for this is see through. For this example, we do include usexes for each sense, along with a usage note indicating that some senses take the object before through, some after. Maybe there is a way to standardize this? Benwing2 (talk) 18:40, 9 June 2024 (UTC)Reply

I don't have a strong opinion on how we lemmatise (though I see the merits of the cut down) but I agree that pronouns (and other arguments) are extremely important (and in the case of phrasal/particle verbs, also their relative positions), especially for learners, and would support a policy which requires mandatory marking of the arguments which a word/phrase takes, at least in the entry (via usex or similar), if not also in the headword. By way of illustration, compare the variants of the lemma turn on:

turn something on (“activate, start"; also possible in the order: "turn on something”) (as far as I can tell, this is the only construction from these examples which can display ergativity, and thus can occur in the bare form, apparently without an object: "the coffee machine turned on [by itself] in the middle of the night"),
turn someone on (“excite, esp. sexually”),
turn on something (“revolve around, centre on"; also: "activate, start”),
turn on someone (“unexpectedly attack or betray"; but IMO this order is ~~not possible~~ rather awkward in the meaning "excite sexually”).

In practise, this information may not be as obvious/readily accessible to editors as we might hope, since although I've probably used all the above examples before, the third and fourth examples only occurred to me after consulting a dictionary.

Edit: it occurs to me that the preferred order may also vary depending on whether the object is a pronoun or a noun.

Helrasincke (talk) 07:31, 19 June 2024 (UTC)Reply

I disagree slightly about the "avoid someone/one" rule - where an entry would be ambiguous, I think having the pronoun is better. For instance, I think leave someone holding the bag seems better than leave holding the bag, which could be misinterpreted as "to leave [a room] while holding the bag". Occasionally this is all that separates entries like get there vs get someone there. Smurrayinchester (talk) 19:05, 12 June 2024 (UTC)Reply

The right to bear ewes

Latest comment: 11 days ago4 comments3 people in discussion

A usual way of qualifying the restricted applicability of a verb sense is to have a label saying, of a .... For example, for the verb proceed:

6. (intransitive, of a rule) To be applicable or effective; to be valid.

Since the verb is intransitive, this can only refer to the subject of the verb. For transitive verbs, there is an ambiguity: does the restriction apply to the subject or the object of the verb?

Here is an example. At bear, Etymology 2, we see both

1.2. (transitive, of garments, pieces of jewellery, etc.) To wear.

and

1.3. (transitive, rarely intransitive, of a woman or female animal) To carry (offspring in the womb), to be pregnant (with).

Common sense tells us that the first sense does not mean to refer to diamonds wearing a smile and the second sense not to being pregnant with a ewe. But common sense may not be good enough in cases where both interpretations make sense.

Is there a way to disambiguate this that does not depend on common sense? --Lambiam 16:31, 9 June 2024 (UTC)Reply

Granting that this doesn't help someone who is unfamiliar with our subtle norms (and doesn't help if the norms aren't followed): in theory I think the nature of the restriction is supposed to be clarified by the form and placement of the restriction: "of..." labels precede the definition and restrict the subject, whereas restrictions on the object are supposed(?) to occur within the definition itself, not as a label, and not normally with "of" (although clearly this is not always followed, and maybe my sense of this is wrong!). Hence "To carry (offspring)" uses "(offspring)" to indicate that the thing in the womb is normally restricted to being offspring, and that if a surgeon left a surgical implement in a woman's womb after surgery she wouldn't normally be described as bearing it in this sense. (However, there was a discussion recently where a set of "of..." labels were moved—because they had been using {{a}} or {{q}} or manual formatting—from being in front of the definition, to being qualifiers after the definition, which made things [even] less standardized/predictable in this respect.) In theory we could make this explicit by saying things like "(SUBJECT is a pregnant person)", "+ OBJECT (offspring)" or something modelled on however we express objects being in the accusative-vs-dative (etc) already. - -sche (discuss) 17:45, 9 June 2024 (UTC)Reply

Agree with User:-sche here about using of (before or after the definition) to indicate subject restrictions, and parens after the definition without of to indicate object restrictions. Preposition restrictions should use {{+preo}}. Other sorts of predicate restrictions should use {{+obj}}. (I have a sandbox version of {{+obj}} that reworks it to support prepositions and such much better than {{+preo}} currently does; you can see examples at User:Benwing2/test-obj. At some point I will finish this and deploy it.) Benwing2 (talk) 18:45, 9 June 2024 (UTC)Reply

If I understand this correctly, a more appropiate way to express sense 1.2 of bear above is

1.2. (transitive) To wear (garments, pieces of jewellery, etc.).

It would be nice if this was documented in some form of guidance to creating good definitions.

But note that there is a slight problem in applying this to sense 1.1. We get

1.1 (transitive) To carry (weapons, flags or symbols of rank, office, etc.) upon one's person, especially visibly; to be equipped with (weapons, flags or symbols of rank, office, etc.).

(although I can't immediately think of a use covered by the second part not already covered by the first part). --Lambiam 19:09, 9 June 2024 (UTC)Reply

Batch editing Wiktionary with AWB

Latest comment: 8 days ago4 comments3 people in discussion

As discussed in February, there are cases where for both US and UK Englishes, the voiced alveolar approximant /ɹ/ is transcribed as the trill /r/. Our team at CUNY (myself and @Yaejunmyung) would like to use the AWB tool to a batch editing, mapping all instances of the trilled /r/ to /ɹ/ for both US and UK Englishes. Please let us know if you see any issues with this batch editing. If it sounds okay to you, could you please add me and @Yaejunmyung to the enabled user list? Thank you! Cpeng2 (talk) 19:31, 9 June 2024 (UTC)Reply

This seems reasonable. Indeed, it's possible that the replacement could be fully automated (for specific accents where it's known that trilled /r/ is not phonemic and thus that it can be replaced systematically). I will wait to see if any bot-maintainer wants to run it as a bot task, or if anyone has objections; if not, I can add you to the AWB list after ~a week, or someone else can feel free to do that sooner. (For other people, let me provide a link to the February discussion; this seems like a more limited and safer proposed change than the changes to parenthetical (ɹ).) - -sche (discuss) 20:00, 9 June 2024 (UTC)Reply

OK, based on Surjection's comment it looks like it would be better for this standardization to be done by someone more familiar with Wiktionary, so we can be sure it's done correctly. (I do think that now that accents have been incorporated into T:IPA, it would be possible for a bot operated by one of en.Wiktionary's competent bot operators to do this if they are reading this and have time; indeed, it might even be possible for the T:IPA template to know that if the input is /r/ + an accent that doesn't have trilled /r/, it should simply correct the displayed output to /ɹ/ and/or add a cleanup category, the last of which is possibly the safest option.) - -sche (discuss) 17:06, 12 June 2024 (UTC)Reply

Oppose granting AWB. I have had to block Yaejunmyung twice for bot-like edits so careless that they did not even check which language they were editing (exhibit A, exhibit B, exhibit C, exhibit D). Some other edits are also inexplicable. This level of editing is simply not acceptable, and if this is what we can expect, we absolutely should not be making it any easier. — SURJECTION ^{/ T / C / L /} 09:17, 11 June 2024 (UTC)Reply

The final text of the Wikimedia Movement Charter is now on Meta

Latest comment: 10 days ago1 comment1 person in discussion

You can find this message translated into additional languages on Meta-wiki. Please help translate to your language

Hi everyone,

The final text of the Wikimedia Movement Charter is now up on Meta in more than 20 languages for your reading.

What is the Wikimedia Movement Charter?

The Wikimedia Movement Charter is a proposed document to define roles and responsibilities for all the members and entities of the Wikimedia movement, including the creation of a new body – the Global Council – for movement governance.

Join the Wikimedia Movement Charter “Launch Party”

Join the “Launch Party” on June 20, 2024 at 14.00-15.00 UTC (your local time). During this call, we will celebrate the release of the final Charter and present the content of the Charter. Join and learn about the Charter before casting your vote.

Movement Charter ratification vote

Voting will commence on SecurePoll on June 25, 2024 at 00:01 UTC and will conclude on July 9, 2024 at 23:59 UTC. You can read more about the voting process, eligibility criteria, and other details on Meta.

If you have any questions, please leave a comment on the Meta talk page or email the MCDC at mcdc@wikimedia.org.

On behalf of the MCDC,

RamzyM (WMF) 08:45, 11 June 2024 (UTC)Reply

Proposal for a Turkish conjugation module

Latest comment: 5 days ago21 comments7 people in discussion

(Notifying İtidal, Fytcha, Vox Sciurorum, Lambiam, Whitekiko, Ardahan Karabağ, Orexan, Moonpulsar, Lagrium):

I've noticed that the current conjugation tables for Turkish verbs are incomplete, sometimes wrong (korkmak has korkmış as its inferential past 3rd person singular form, according to the table) and different from one another, albeit for minor things (etmek and gitmek seem to be, together with their derivates, the only verbs that show the polite imperative forms in their table). These reasons, together with the fact that as of now there are way too many templates (Template:tr-conj, Template:tr-conj-v, Template:tr-demek-yemek, Template:tr-conj-*tmek) that require way too many parameters (tr-conj requires the verb's stem, the last vowel in the verb's stem, the stem with the aorist suffix, the last vowel when the aorist suffix is attached and a t/d to know which consonant to use in the suffix -dI) to conjugate Turkish verbs, have made me decide to work on a module that could summarize every possible Turkish verb's conjugation, adding more forms, requiring parameters only if strictly necessary (i.e. if the verb's aorist suffix is unpredictable of if it ends in a t which turns into d before vowels) and making the default table smaller too by setting some forms as collapsible, and I'd like to propose that we switch to this module (here are some sample verbs to display the table)

— Trimpulot (talk) 12:18, 12 June 2024 (UTC)Reply

The module is very impressive. I would totally support switching to the module version. Lagrium (talk) 12:49, 12 June 2024 (UTC)Reply

It looks like a huge improvement. --Lambiam 15:06, 12 June 2024 (UTC)Reply

In many ways this looks like a huge improvement over what we have now. Before we ship it could you squeeze in some more info? Like the formal imperative forms, maybe? And you added the verbal noun but the -iş form is not there. Maybe these two should be listed on the same row to save space horizontally. Rn -me form is there in its own mansion of a box. Same things with adverbial forms. You listed 2 but many are missing. Like -ince, -ip, -e -e, -dikçe, -eli, -esiye and maybe a few more if I'm forgetting any. Whitekiko (talk) 15:58, 12 June 2024 (UTC)Reply

@Whitekiko: The formal imperative forms (as well as -sene and -senize labeled as informal imperatives since I didn't know how else to name them for the time being), -ince, -ip and -e -e are already on the table but aren't shown as a default, mostly because I tought it would overcrowd the table. As for the other forms you mentioned I did miss some of them but I'm not sure adding -iş is really necessary since as far as i can tell it's more of a derivational suffix more akin to -im or -i, whereas -me has actual grammatical functions.

— Trimpulot (talk) 16:52, 12 June 2024 (UTC)Reply

There is enough space for adding the -ince forms:

temporal adverbs açınca, açarken

However, speaking in general, tables for Turkish forms will never be complete. For example, the verbal nouns are declined like all nouns, including case forms of possessive forms. Under ekmek we give the form ekmeğime, so shouldn’t we also, for the sake of completeness, give the form ememememe (as used in ememememe bakmayın! – “don’t mind my inability to suck!”) under the impotential verbal noun emememe? What about the passive, causative and reciprocal forms? And the causatives of reciprocals, like uyuşturmak, or the causatives of causatives, like öldürtmek? The just-do-it suffix -(y)iver? Maybe, one day, we’ll have a module for analyzing Turkish forms, but attempts to be complete in tables are doomed to fail. --Lambiam 20:03, 12 June 2024 (UTC)Reply

@Lambiam: Of course we can't include the entire noun-like declension of the verbal noun nor do we need to as it is implicit in the fact that it is a verbal noun. As for the adverbial forms though, -ince is actually included, but it only appears after toggling the "Show complex tenses" switch for no reason in particular other than if all the hidden adverb and participle forms were visible by default they would overcrowd the table in my opinion, as they would outnumber the finite TAMs. Also I don't think listing them all on the same line would work because that way they wouldn't get any description of their usage or function at all, however small it may be: if -esiye and -eli where in the same box separated by a mere comma how is one supposed to understand that they are pretty much polar opposites in meaning?

— Trimpulot (talk) 20:20, 12 June 2024 (UTC)Reply

Could you also add -er -mez ("as soon as") as a temporal adverb? I forgot to mention that. As with -iş... Our current template has it and I think that's for a reason. -im comes only after a finite number of verbs to derive nouns and these nouns always appear on dictionaries. On the other hand every verb has an -iş form and it always means "the way someone does x". It'll help users that are beginners in Turkish find the infinitive of the verb. -iş has a weird status. There was a debate around it, idk how it ended. We weren't sure what to call it, if it should be a lemma or a non lemma, if the pages should be created. Whitekiko (talk) 08:26, 14 June 2024 (UTC)Reply

@Whitekiko: I don't think that -iş always having the same meaning and being able to be applied to any verb is enough of a reason to include it in the conjugation table, since that argument could also be made for -ici and similar suffixes in other languages as well, like -tio and -tor in Latin, but those are left out. Of course the line between what counts as conjugation and what doesn't isn't precise but it has to be drawn somewhere and I think that semantically heavier suffixes with little grammatical or syntactical meaning should be left out.

As for -er -mez, I would like to add it but I still don't understand if it works with polarities other than positive, and if so how? If you can help me figure it out I'll see it added.

— Trimpulot (talk) 11:38, 14 June 2024 (UTC)Reply

It's just that the first part takes the aorist and the 2nd part takes negative aorist. I've added the def and an example under -er some time ago, rather then creating -er -mez and such. Not sure which one's the right thing to do. Putting these 2 suffixes together will create 6? combinations because first part can go through vowel changes.

Maybe we're of different opinions but I'd like to see -iş and -ici forms too somewhere on the table. I don't think adverbial forms are considered conjugations either but I loved to see them. I don't know the technicalities behind this but it would be revolutionary if we could add "ghost texts" to the templates. Yalayış and Yalayıcı, for example should pull yalamak as a result. In case users run into it in the wild, and they surely will. Whitekiko (talk) 12:46, 14 June 2024 (UTC)Reply

Proposed module looks great. I had noticed the irregular behaviour with certain above mentioned verbs, but unfortunately I'm module illiterate. And I've always thought the current template gives terrifyingly too much info to an absolute beginner checking one of the simplest conjugations, so the drop-down menus are smart. The details can be discussed and smoothed out, but I definitely support this improvement.

By the way, there are a few more active native editors of Turkish, who might have something of their own to say about this; @Hswehli, Blueskies006, Kakaeater, Science boy 30. Orexan (talk) 20:39, 12 June 2024 (UTC)Reply

@Trimpulot: That looks really good! I have a few suggestions:

Make the colours a bit more muted (see {{es-conj}} {{la-conj}} for good examples).
Make sure each link has #Turkish.
The "show complex tenses" button should be inside the table itself to make it more clear that it expands the table rather than showing a new table.
I feel like the infinitive, being the lemma form, should be at the top.
You may want to use — to indicate an "impossible" conjugation (although leaving the cell empty works as well).
You may want to have a slightly different colour scheme for each table.
You could add a disclaimer explaining which forms aren't included.

Ioaxxere (talk) 02:35, 13 June 2024 (UTC)Reply

@İtidal, Fytcha, Vox Sciurorum, Lambiam, Whitekiko, Ardahan Karabağ, Orexan, Moonpulsar, Lagrium, Hswehli, Blueskies006, Kakaeater, Science boy 30.

I have updated the module with some minor changes (fixed the links, moved the "Show complex tenses" button inside the table and added some missing adverbial forms) however I would like like to ask if you think it makes sense to have those "complex tenses" hidden at all. At first it was meant to make the table more readable by hiding all of the forms that employ more than one suffix but I noticed that even without hiding them it is still relatively small and readable. Also let me know if there is a better way to label the forms in -eli, -esiye and -dikçe since I really don't like using a translation as a label but I also don't know how else to call them.

— Trimpulot (talk) 13:19, 13 June 2024 (UTC)Reply

My personal opinion of the complex tenses drop down menu is not only should it stay but it should also have a high-vis warning that says something like "Attention! May cause shock, anxiety, dizziness, despair in beginner learners. Abandon all hope, ye who click here!" Maybe that's a little much, but it should definitely stay.

I would like to float the idea of adding "-cesine" meaning something like "as if ...", forming adverbials. It is productive with a myriad of suffix combinations, see here ("-ercesine", "-mişçesine" "-yorcasına" "-ecekçesine" etc.) as well as "noun + -cesine" though unrelated. It's a common enough usage to encounter. I guess you would only display the "Simple" aorist form like you did with "-ken" (which is also productive as "-erken", "-yorken", "-mişken", "-ecekken" etc.).

Also, as a native and someone who's got above average grasp on English but isn't a grammar expert, the labels don't mean anything to me, even some of the conjugated forms don't mean anything unless I see it in an example sentence. I literally had to google "-esiye" to see what the hell it was used for. I assume it would be similar with other natives or learners, so I think coming up with labels is kind of an exercise in futility. I get that each form will point to a page of their own, where a text like temporal adverb "until" inflection of açmak would look strange. I'm not saying the module should include example sentences for each and every usage, but translations actually make it easier to have some idea about how or when something is used. At the end of the day, nothing short of turning the table into a full scale grammar book will go very far in the way of helping someone understand the contexts in which these conjugations are used, at least beyond the most basic ones like "açarım, açıyorum, açtım" etc. — Orexan (talk) 14:54, 13 June 2024 (UTC)Reply

@Orexan: That makes sense. As for the various forms suffixed with -ken and -cesine, I've been thinking about putting them all on thesame line like so:

temporal adverb simple açarken, açıyorken, açmışken, açacakken

modal adverb "as if" açarcasına, açıyorcasına, açmışçasına, açacakçasına

Or alternatively we could just display the aorist form as you said and add a note of some kind to explain that those suffixes are actually way more productive.

Also for the drop down menu, do you think it should stay even for those participle and adverb forms, and the formal and "informal" imperative forms, even though they don't employ more than one suffix?

— Trimpulot (talk) 15:32, 13 June 2024 (UTC)Reply

I'm not sure if the line is long enough to contain all combinations of some suffixes, especially with verbs with longer roots than two letters like "aç-". For reference, the paper I linked lists "açarcasına, açıyorcasına, açmışçasına, açacakçasına, açarmışçasına, açmazcasına, açmazmışcasına, açıyormuşçasına, açacakmışçasına, açmacasına, açmamacasına" but I highly doubt any mortal could possibly identify all possible combinations. Displaying the aorist form only, with a note indicating their productivity, and maybe a link to that suffix's lemma page, which hopefully one day comes to be and shows at least a good portion of these combinations and the meanings they convey and, if one can be so bold to ask, one or two example sentences while one's at it, would maintain the table's structural integrity and still be helpful, even if the suffix pages that don't exist yet aren't made in the near future. A suffix page of this comprehensivity is a ton of work, though. I tried to put something together for "-sa" a while back, which is in dire need of an update and some cleanup. That was painful.

The participle and adverbials could be outside the drop down, yeah. But the alternative forms of the imperative are good within, in my opinion. — Orexan (talk) 16:20, 13 June 2024 (UTC)Reply

@Orexan: I see and I agree with you. I have updated the module as well.

— Trimpulot (talk) 18:54, 13 June 2024 (UTC)Reply

First thoughts, good. It's very big, though, like Swahili. There are some parts that are grammatically correct that might be omitted to save space. I suggest putting some boolean constants near the top of the module to control behaviors we might change our minds about.

the -abilmek forms are not necessary (and the -ivermek forms should not be added)
omit passive imperative (probably requires a template argument), potential imperative, and maybe even formal and informal imperative
what about the -iş verbal noun form?
I suggest packing the impersonal particlple and gerund/adverb forms into as few lines as possible even if that means omitting the less common forms or losing some of the labels ("impersonal participles | açan, açmış, açacak")

A minor coding style issue, the initializers for local variables lv and hv should have line breaks in the same places. Vox Sciurorum (talk) 16:49, 14 June 2024 (UTC)Reply

I agree that -ivermek shouldn't be added, but omitting -ebilmek while -ememek is included is just asymmetrical
I see why you say to omit the potential imperative (as well as the impotential, I assume), but why the others as well, especially the informal and even more so the formal imperatives?
as I said before, I think -iş is past the boundary of what counts as conjugation and what doesn't
cramming all of the participle or adverb forms on the same line without any hint as to what distinguishes them from one another wouldn't really be helpful in my opinion

As for the line breaks, I just put all the vowel inputs that return the same vowel on the same line, that's the only reason for it being the way it is.

— Trimpulot (talk) Trimpulot (talk) 17:34, 14 June 2024 (UTC)Reply

@Vox Sciurorum: I have an idea on the last point: how about placing all impersonal participles or adverbs on the same line by default but separating them when the table is expanded? You can see the table like that here as the conjugation for açmak

— Trimpulot (talk) 09:09, 15 June 2024 (UTC)Reply

I guess this makes sense, as per an earlier comment of mine, the labels and even the conjugated forms don't mean much in a vacuum like this, so this setup makes sense at least from a design point of view. Orexan (talk) 15:10, 15 June 2024 (UTC)Reply

Names of people

Latest comment: 6 days ago42 comments11 people in discussion

[Thread moved from Tea Room]

Van Gogh

Vincent van Gogh, Dutch draughtsman and painter.

Monet

Claude Monet, French painter.

Picasso

Pablo Picasso (1881–1973), Spanish painter, best known as a founder of the Cubist movement.

Can anyone clarify upon what basis we have these entries and others similar? Mihia (talk) 19:18, 11 June 2024 (UTC)Reply

There are others, like Einstein. This issue has come up before. Personally, I do not think they comply with WT:CFI, and thus should not be in the dictionary. I think it is worth having a formal vote to clarify the wording in the CFI. — Sgconlaw (talk) 20:00, 11 June 2024 (UTC)Reply

I agree. CFI says about names "No individual person should be listed as a sense in any entry whose page title includes both a given name or diminutive and a family name or patronymic. For instance, Walter Elias Disney, the film producer and voice of Mickey Mouse, is not allowed a definition line at Walt Disney."

However, it says nothing about entries for individual persons under their family name only (or given name only, for that matter). This seems to be an omission, perhaps because there is no agreement.

(By the way, I also think that the "Walter Elias Disney" example introduces an unnecessary complication/distraction, being different from "Walt Disney". I think it would be clearer to use an example such as, let's say, Pete Tong, which does not have this complication.) Mihia (talk) 20:15, 11 June 2024 (UTC)Reply

@Mihia: I don't have strong feelings about the "Walt Disney" example, but have no objection if the example is changed as you suggest. — Sgconlaw (talk) 22:41, 11 June 2024 (UTC)Reply

I think it may also be worth taking this opportunity to clarify the following, which have also come up before:

Whether terms which are a combination of an honorific or title and a name are permitted, e.g., King Charles (meaning Charles III) and Queen Mum (meaning Queen Elizabeth The Queen Mother). I'm generally of the view that we shouldn't allow such terms, because we may then get entries like King Louis (Louis I, Louis II, Louis III, etc.) and Pope Leo (meaning Leo I, Leo II, Leo III, etc.). The relationship between such terms and nicknames (e.g., Brangelina), which I believe are generally thought acceptable, needs to be considered. Perhaps the rule should be that a term which is a combination of an honorific title and a name are generally not permitted unless it is a widely used nickname.
Whether senses which mean "a work by a person with the surname X" are allowed, e.g., Picasso (meaning "an artwork by Picasso") and Roy (meaning "a book by Arundhati Roy"). Again, I am not in favour of such senses because any surname can be used in this way.

— Sgconlaw (talk) 22:33, 11 June 2024 (UTC)Reply

Yes, based on the consensus in the spate of RFDs now at Talk:Michelangelo, uses of NAME to mean "a work by NAME" should not be included; if we can formalize this somewhere, all the better. Regarding the Walt Disney example, I would only add Pete Tong, but not remove Walt Disney: having Walt Disney as an example is useful for showing that you can't defend having someone's name just because you didn't enter their full name. - -sche (discuss) 00:49, 12 June 2024 (UTC)Reply

Right, I see what you mean. Mihia (talk) 09:25, 12 June 2024 (UTC)Reply

@Mihia, -sche: actually I realized that Pete Tong might be better as an example since we don't have Walt Disney as an entry at all. — Sgconlaw (talk) 12:50, 13 June 2024 (UTC)Reply

I do see -sche's point, though, that the Disney example does illustrate how the policy applies even if the article title is not the exact full name. Mihia (talk) 14:36, 13 June 2024 (UTC)Reply

Draft proposal

For discussion purposes, I've taken the liberty of drafting a proposed amendment to be inserted under "Wiktionary:Criteria for inclusion#Names of specific entities". — Sgconlaw (talk) 12:34, 13 June 2024 (UTC)Reply

Original text

However, policies exist for names of certain kinds of entities. In particular:

No individual person should be listed as a sense in any entry whose page title includes both a given name or diminutive and a family name or patronymic. For instance, Walter Elias Disney, the film producer and voice of Mickey Mouse, is not allowed a definition line at Walt Disney.

Proposed amended text

However, policies exist for names of certain kinds of entities. In particular:

Names of people are subject to the "People's names" section of this page.

People's names

In an entry consisting of both a given name or diminutive and a family name or patronymic, including a pseudonym, no individual person should be listed as a sense. For instance, at the entry Pete Tong, the following sense is not allowed: "Peter Michael Tong (born 1960), the English disc jockey." The entry Mark Twain is not allowed if its only sense is "The pen name of Samuel Langhorne Clemens (1835–1910), the American author". However, any figurative sense is allowed.
In a forename or surname entry:
- No individual person should be listed as someone having that forename or surname. For instance, at the entry Mariah the sense "Mariah Carey (born 1969), American singer" is not allowed, and at the entry Van Gogh the sense "Vincent van Gogh (1853–1890), Dutch draughtsman and painter" is not allowed.
- As a corollary, a sense meaning "a work by a person with the surname" is not allowed. For instance, at the entry Picasso, the following sense is not allowed: "An artwork by the Spanish artist Pablo Picasso (1881–1973)."
A nickname for a person, or two or more persons collectively, which is not their legal name, is allowed. For example, the entry Brangelina (defined as "The couple consisting of celebrities Brad Pitt and Angelina Jolie, together from 2005 to 2016") is allowed. Ye defined as "Kanye West, American rapper, songwriter, record producer, and fashion designer" is allowed, because it was a nickname before West legally adopted it as his name in 2021.
An entry consisting of an honorific or title and a name is not allowed unless it qualifies as a nickname as described above or has a figurative sense. For instance, Lord Byron (defined as "George Gordon Byron, 6th Baron Byron (1788–1824), the English poet") and Prince William (defined as "William, Prince of Wales (born 1982)") are not allowed. Prince Albert, meaning (among other things) a Prince Albert coat, is allowed.

@Sgconlaw: Would Jack the Ripper be deleted as a pseudonym? J3133 (talk) 13:02, 13 June 2024 (UTC)Reply

@J3133: my initial impression is no, because it does not consist of "both a given name or diminutive and a family name or patronymic". — Sgconlaw (talk) 13:06, 13 June 2024 (UTC)Reply

@Sgconlaw: I assume you do not mean we could have anyone’s pseudonym as long as there is no family name included. J3133 (talk) 13:10, 13 June 2024 (UTC)Reply

@J3133: Yes in general, but I haven't given full thought to this point. I think we would want to extend the general forename + surname rule to pseudonyms (perhaps including names like Cardi B and Malcolm X which are in the same format), but if a pseudonym is only a single word it comes close to becoming (or may be indistinguishable from) a nickname, in which case there may be consensus for including such names. — Sgconlaw (talk) 13:34, 13 June 2024 (UTC)Reply

I suggest the nickname portion of the proposal be amended to explicitly disallow stage names and assumed names. As it stands, the proposal as written would technically allow entries for Malcolm X, The Rock, Grimes, Pink, etc. None of those monikers are legal names. But they go beyond being simply nicknames. They're how those individuals identify and are identified publicly. The nickname policy was designed to allow for informal/colloquial nicknames for people. E.g. King of Pop for Michael Jackson, RPattz for Robert Pattinson, Elongated Muskrat for Elon Musk, or Maggie for Margaret Thatcher. Those entries have lexical value that entries based on stage names don't. Someone seeing a celeb news headline like "RPattz to play Dark Knight" might not think to punch "RPattz" into Wikipedia. Whether readers can easily connect a nickname to its bearer via WP depends on whether there's a redirect or disambiguation page. Alternatively, someone is unlikely to encounter a headline like "Elon Musk and Claire Boucher split." Everyone knows her as "Grimes." It's what her Wikipedia entry is titled. There'd be no benefit in having a definition for her at Grimes. WordyAndNerdy (talk) 21:08, 13 June 2024 (UTC)Reply

Hmm... to me, someone seeing "RPattz to play Dark Knight" (or seeing "RPattz" anywhere else) and thinking "I should look that up in a dictionary" seems even less plausible, vs. them thinking to google it or thinking Wikipedia might have a redirect from that to the article on whoever it is. No? I mean, if I'm not going to find out what "Slipknot to appear in new John Wick" or "Grimes and Pink to appear in Barbie sequel" means from a dictionary, and I'm not going to find out what/who Margot Robbie ("Margot Robbie to reprise Barbie role", etc.) is from a dictionary, why would I expect to find out about RPattz from a dictionary? What is the rationale for having RPattz in a dictionary, and not having Grimes, Pink, Slipknot and Margot Robbie? - -sche (discuss) 21:37, 13 June 2024 (UTC)Reply

@-sche You're right, but people do tend to click on things that pop up on Google search results, which is why Urban Dictionary is so successful. Theknightwho (talk) 22:34, 13 June 2024 (UTC)Reply

RPattz is a proper noun that's used exclusively as informal slang. Margot Robbie, Slipknot, and Grimes are the "official" names (legal and self-styled) of various entities. Slang is something a descriptive dictionary should aim to document. Proper nouns like Margot Robbie, Slipknot, etc. are best left to the encyclopedia side, where they can be covered with the depth and detail afforded by biographical articles. The purpose of the RPattz entry is to tell readers this term means "Robert Pattinson, British actor," while the goal of RPattz's Wikipedia entry is to tell you where he was born, how many siblings he has, his first acting job, etc. WordyAndNerdy (talk) 23:03, 13 June 2024 (UTC)Reply

I have never heard of either "RPattz" or "Grimes". I would have no reason to imagine that I could look up the former in Wiktionary but not the latter. Mihia (talk) 23:31, 13 June 2024 (UTC)Reply

This is one those areas where someone's individual knowledge base seems likely to inform their perspective in nuanced and hard-to-pin-down ways. Regional variations in English, differences between native speakers vs. proficient secondary speakers, generational differences, differences in interests and subcultures (follows celeb news vs. doesn't). I don't think there's a "right" or "wrong" answer to some of the questions being raised in this thread. I just think some approaches are generally more workable than others. More conducive toward hitting the sweet spot of a dictionary that's more inclusive and up-to-date than Oxford but vastly more serious and reliable than UD. WordyAndNerdy (talk) 23:57, 13 June 2024 (UTC)Reply

We certainly do not want to emulate the sea of crap that is UD. However, although it somewhat goes against my personal instincts, I do think it is at least worth considering allowing ALL proper names that meet some reasonable requirement of widespread mention sufficient to prevent a tidal wave of trivia. In this way we would avoid the need to make fine policy distinctions that might make sense to us at the time but are probably lost on ordinary users, such listing "RPattz" but not "Grimes", or listing "Mona Lisa" because we can find references to "a Mona Lisa smile" but not "Barbara Streisand" because we can find references to "a Barbara Streisand nose", or whatever it might be -- and also avoid the need to be perpetually debating these distinctions. If you asked me, or had asked me, I would say that every single tiny place name definitely was not dictionary material, and yet that policy was agreed. If we can have every tiny place name, then why not also "Grimes", "Monet" and the rest of them? What is the difference, essentially? They are no more or less encyclopedic than the place names, in my opinion. Mihia (talk) 11:50, 14 June 2024 (UTC)Reply

@Sgconlaw: I have a couple of comments on your proposed text.

I wonder whether allowing nicknames, with no further restriction, could open the door to some potentially unwanted entries. Strictly speaking, as the text stands, there seems nothing to prevent me from adding an entry for my mate nicknamed "Bagger". I wonder whether we want unrestricted coverage even for well-known people. I was going to give the example "Giggsy", which is a fairly trivial nickname for a footballer called Ryan Giggs, as something that we wouldn't want to include, but now I see that we actually already DO have this entry! I guess someone thought it was suitable for inclusion.

To be doubly clear, I wonder whether we could explicitly mention that stage names are excluded as pseudonyms.

You mention the exclusion of "a work by a person with the surname"; I wonder if at the same time we should consider making some exclusions as to what does not count as "figurative" use. In my opinion, the following are all candidates for exclusion (in fact, these apply to other proper nouns as well as to people). It seems to be possible to find examples of these for almost anyone/anything that one has heard of, or certainly anyone famous.

"like X", referring to some characteristic of X.
"the X of Y", e.g. "The Ronald Reagan of liberalism".
"do a X", "pull a X", referring to some behaviour associated with X, e.g. "do a Ronald Reagan".
"an X moment", e.g. "a Ronald Reagan moment".

By the way, would it be appropriate to move this discussion to the Beer Parlour, as it concerns general policy? Mihia (talk) 17:59, 13 June 2024 (UTC)Reply

@Mihia: yes, by all means relocate the discussion to the Beer Parlour, and we can continue it there. — Sgconlaw (talk) 18:05, 13 June 2024 (UTC)Reply

Sorry, just one other point that occurred to me. Would it be simpler/shorter to specify under what circumstances definitions that consist ONLY of a real person actually ARE allowed, rather than listing the exclusions, which seem to cover most cases? Mihia (talk) 18:37, 13 June 2024 (UTC)Reply

@Mihia:

I seem to recall from previous discussions that there seems to be a consensus that nicknames are generally allowed, though it seems that this isn't reflected in the CFI. I don't think entries for nicknames of people's random friends will be an issue—it's almost certain that such entries won't pass the verifiability standard.
I'm not clear what you mean by "whether we could explicitly mention that stage names are excluded as pseudonyms". Are you suggesting that stage names should or should not be allowed as entries? (I assume the latter?)
Yes, I think it is a good idea to clarify what counts as a figurative use. Feel free to work that into the draft.
Personally, I think it is clearer to specify in the policy both what is allowed and what isn't, otherwise later on we may be in a difficult position of trying to discern what the applicable rule is from the silence of the text. But maybe it would be clearer to specify what is allowed first, followed by what is therefore not allowed.

— Sgconlaw (talk) 18:45, 13 June 2024 (UTC)Reply

Yeah, for better or worse, even obvious abbreviations of first+last names have tended to be included (Talk:RPattz, Talk:JBiebs), although if enough people comment here we might get a sense of whether there's appetite to reconsider that. I think our CFI are bizarre when it comes to what names we do vs don't include. Why is it considered that I need to know which specific person Giggsy is, but not which specific person Dua is? Last I checked, I was only able to find 2-3 people with the name Dua (and our current presentation of it as an Albanian female given name fails to reflect that two of the 3 bearers got it from Arabic, and one is nonbinary) but perhaps more works have been digitized and the name is better attestable now. Do we include band names, e.g. Slipknot, Rammstein, Einstürzende Neubauten? It seems we do not, and that seems reasonable to me... but then why is Slipknot referring to a set of individuals not included, but Brangelina referring to a set of individuals is? (This is only the tip of the iceberg, consider e.g. fictional places' names.) For Prince William et al., cf. Talk:George VI.
I will opine that if a nickname is used for multiple individuals, and especially if it's productively applicable to e.g. everyone with the surname Giggs, it is probably better defined as "a nickname for people with the surname Giggs" [etc] rather than as "the nickname of [specific person], [specific other person], [specific third person], [specific fourth person], [specific fifth person], ...", similar to how we treat Ed. - -sche (discuss) 19:21, 13 June 2024 (UTC)Reply

Things would be a lot simpler if we decided either "No definitions at all are allowed that simply describe a proper noun -- go and look at Wikipedia for that" OR "Every proper noun (attestable to some minimum level) is allowed"! Mihia (talk) 19:32, 13 June 2024 (UTC)Reply

@Mihia That seems unhelpful at best: we document terms, whereas Wikipedia documents the referents for those terms. Excluding terms that describe a certain class of referent because people might be looking for information about that referent would lead to us excluding English tree because the Wikipedia article Tree exists. Obviously that's a silly example, but it underlines the point that it's not sound logic to be basing policy on. If you don't care about proper nouns that's fine, but quite clearly many users and editors do. Theknightwho (talk) 22:26, 13 June 2024 (UTC)Reply

On the contrary, I believe that it is an EXCELLENT idea to choose one or the other. Judging by your last sentence, you seem to have missed my second option. Mihia (talk) 22:53, 13 June 2024 (UTC)Reply

@Mihia It's only an excellent idea if you prioritise swift policy decisions over anything else, but I don't think it would even achieve that: any set of rules always raises questions about what does and does not qualify, unless it is infinitely permissive or restrictive, but neither of those stances would improve the dictionary, in my view. Theknightwho (talk) 16:10, 14 June 2024 (UTC)Reply

The problem with this is that not every language/variety has the privilege of having a Wikipedia. Even if they do have Wikipedia's that Wikipedia might be prescriptive. For example, the official Persian word for Malaysia is مالزی (malezi) in Iran and مالیزیا (malīziyā) in Afghanistan (those are the respective terms used by news agencies in both countries). However, Persian Wikipedia is extremely prescriptive and considers standard Iranian Persian "correct" and standard Dari "wrong". Mentioning that the country is called مالیزیا (malīziyā) in Afghanistan is actually not even allowed and would be reverted. So it's not as though we can implement a hard rule that says "go look to Wikipedia for Proper nouns" because in some cases, the only place it can be documented is on Wiktionary!! — BABR (talk) 02:44, 14 June 2024 (UTC)Reply

Brangelina is an informal nickname for a celebrity couple used in the media and colloquial speech. Celebrity couples generally don't present themselves to the public by such monikers in the same way bands collectively identify as Radiohead or Slipknot. Official band names can treated like stage names. They're names that individuals have chosen for themselves and thus seemingly fall outside our scope. Whereas informal/colloquial nicknames call under the umbrella of documenting language as it exists. We have Fab Four (informal nickname), but Beatles should probably only exist as a plural of Beatle. This is a complex and somewhat subjective line to draw. Which is why I think CFI should ideally leave room for case-by-case considerations. Nailing down hard and detailed rules about what is and isn't inclusion-worthy in this area might create more headaches than it resolves. WordyAndNerdy (talk) 21:59, 13 June 2024 (UTC)Reply

Nevertheless, the present situation is a mess, whereby there are perpetual case-by-case arguments. Mihia (talk) 22:20, 13 June 2024 (UTC)Reply

I generally agree that having clear and consistent policy is favourable to vague (and often unwritten) rules. But in this particular case I'm not sure that exhaustively itemizing what's includable would be an improvement. Would the clarity make for swifter resolution to discussions, or would it create new opportunities for bickering? I just don't see heated disagreements erupting over whether "a Monet" used in reference to an individual work of art is sufficiently figurative to warrant inclusion (it is, IMO) outside a call to explicitly disallow such terms. People often remain indifferent to policy considerations until their hard work is on in the chopping block. Which is the main reason I've tried to take an inclusionist approach. People gauge the relevance of language to Wiktionary's mission differently. I've never seen the relevance of taxonomical names. But clearly a number of Wiktionarians do and have put in good work in that area. WordyAndNerdy (talk) 23:32, 13 June 2024 (UTC)Reply

1. Yes, good point about verification.

2. I think it would be helpful to mention that "pseudonyms" includes stage names, if that is indeed the intention (or not, if that is the case, I guess). I mention this because "pseudonyms" can sound more "literary".

4. It seems to me that the silence of the text is more likely to be an issue IF we try to explain both what is allowed and what is not, since "almost inevitably" some case will later arise that is not mentioned at all. If we were to say "these are the only cases when people are allowed as definitions, and everything else is excluded" there can't be any room for doubt. Of course, anything can be challenged later if it transpires that something important has been overlooked. Mihia (talk) 19:27, 13 June 2024 (UTC)Reply

What about foreign renderings of names? Such as 忽必烈 (Hūbìliè) and Hốt Tất Liệt for Kublai Khan. MuDavid 栘𩿠 (talk) 01:36, 14 June 2024 (UTC)Reply

@Sgconlaw Re: the draft proposal above and adding to what MuDavid brought up here - the proposal as it stands seems to fall short in the case of borrowed names of specific individuals in corpus languages. For example, we have zero evidence that 𐌰𐌻𐌰𐌹𐌺𐍃𐌰𐌽𐌳𐍂𐌿𐍃 (alaiksandrus) was a given name in Gothic; it is "encyclopedic" content in that sense, it seems to just refer to an individual. Yet it is valuable to include, because names such as this constitute valuable linguistic and onomastic evidence in otherwise poorly attested languages. Another similar case in Old High German Ōtacher, which is very valuable evidence from a philological standpoint, but which refers again to a specific individual only without attested use as a given name afaik. — Mnemosientje (t · c) 12:23, 14 June 2024 (UTC)Reply

I don't think it is the function of principal namespace to be a repository of unattested terms whose only justification is their possible value to linguists. DCDuring (talk) 12:54, 14 June 2024 (UTC)Reply

These are not unattested. As neither of these is "an entry consisting of both a given name or diminutive and a family name or patronymic", I don't see how they would fall under the exclusions outlined in the proposal. Including such terms in the case of extinct languages with a relatively closed corpus seems clearly preferable to me.--Urszag (talk) 13:03, 14 June 2024 (UTC)Reply

You are, of course, as right about attestation as I was wrong. I really don't think that principal namespace should have entries for terms whose main justification is the convenience of linguistic researchers, that doesn't meet our standards for inclusion for all languages. I believe that there is nothing that prevents the use of names in etymologies. Whether we would want to have Appendices of such items is a separate question. DCDuring (talk) 14:33, 14 June 2024 (UTC)Reply

Gothic and Old High German are extinct and are Limited Documentation Languages, so unless otherwise excluded, terms meet criteria for inclusion if they are attested by one use in a contemporaneous source or one mention in a source accepted by the community of editors for that language. I don't see a reason to have a stricter policy for proper names of the type Mnemosientje mentioned than for other terms.--Urszag (talk) 14:43, 14 June 2024 (UTC)Reply

@DCDuring whose main justification is the convenience of linguistic researchers Who else do you think is interested in entries on Gothic or Old High German at all? Theknightwho (talk) 22:07, 14 June 2024 (UTC)Reply

Jatki and Western Punjabi

Latest comment: 6 days ago1 comment1 person in discussion

First off, I believe Jatki (i.e. the Lahnda dialects of Jhangli, Shahpuri and Dhanni) need to be given their own language code under Lahnda. Currently Jatki entries have to be put as dialectal Punjabi, which doesn't make sense as all the other Lahnda dialects (Saraiki, Pahari-Potwari, Northern and Southern Hindko) get their own language codes.

Secondly, there is an issue where Punjabi (the Wiktionary sense) is not exactly Punjabi anymore. Because although half or more of Lahnda speakers (of Jatki and Pothwari particularly, up to 50 million people) call their language Punjabi, Punjabi of the Wiktionary sense only includes the Eastern dialects (Majhi, Doabi, Malwai, Puadhi).

So I have a wild suggestion; rename Punjabi as it is now to Eastern Punjabi. (I know this would have a tonnn of complications, just a suggestion :D)

Assuming this did happen, it kind of brings up another problem, because "Eastern Punjabi" does not correlate with "Eastern Punjab" (the Punjab state of India), which could cause confusion. Majhi (the taken standard and central dialect of Punjabi) is an eastern dialect and shares its grammar with other eastern dialects. However, the majority of its speakers are from Western Punjab (Pakistan).

Thoughts? OblivionKhorasan (talk) 14:21, 14 June 2024 (UTC)Reply

English pronunciation module

Latest comment: 2 days ago13 comments4 people in discussion

I am soliciting comments for a possible English pronunciation module. I originally thought of doing this using English-style respelling but it occurs to me it may be too complicated to do it this way. For comparison, I wrote a German pronunciation module that uses respelling based on standard German spelling conventions and is mostly finished; it runs to 2400+ lines and supports only a single dialect (the prescribed one with /ɛ:/ for long ä). You can see testcases (lots and lots of them) here, here and here. So I'm thinking of reusing something similar to enPR notation, i.e. something that can map fairly directly onto phonemes but abstracts out the dialectal differences as much as possible. It would be pan-dialectal as much as possible, at least across conservative GenAm (i.e. without the cot-caught and merry-marry-Mary mergers) and RP, so that if a distinction is made in either dialect it needs its own symbol. But it would also support giving separate per-dialect respellings to handle one-off differences like in controversy and advertisement. Does this make sense to people? What do people think of "augmented enPR" as a notation?

BTW by "augmented enPR" I mean enPR with some additional symbols. For example, enPR calls for writing short o in cot as ŏ and au in caught as ô, but cases where short o is pronounced like au in GenAm (the lot-cloth split, as in dog, long, moth, coffee, chocolate, etc.) would need an additional symbol, maybe ŏ*. Similarly for the RP trap-bath split, where affected words (class but not crass, path but not math etc.) would need an additional symbol, maybe ă*. And probably similarly for the weak vowel merger, because (I think) some unstressed /ɪ/ vowels do not turn into a schwa in GenAm (although I can't say which ones other than bring up the canonical minimal pair Rosa's ~ roses). Ideally in this augmented enPR notation people would write hw in words like which and whale, and ōr in words like hoarse and borne that are distinct from horse and born in accents without the horse-hoarse merger; although in practice the latter might be hard to get right as I'm not sure which dictionaries still notate the distinction. (Update: The Longman Pronunciation Dictionary does indicate this distinction for GenAm, as a secondary pronunciation in the cases where the hoarse vowel can exist. For example, force writes the RP pronunciation as only /fɔːs/ but the GenAm pronunciation as primary /fɔːrs/, secondary /foʊrs/.)

Probably we'd have to manually put spaces or hyphens at all syllable boundaries as this is hard to do automatically, although possibly there could be defaults.

There would have to be parameters for the supported dialects so you can specify different pronuns for each (or some subset) as needed, but it might also make sense to have a way of adding pronunciations with arbitrary accent labels.

I might ditch the standard primary and secondary stress symbols that go after the syllable in question (rather than before as in IPA), or at least let you also use IPA-style symbols that go before, as well as probably acute and grave accents that go on the stressed vowel. (The latter would result in lots of double-accented vowels but most modern fonts support them reasonably well, and at least on my Mac using the ABC-Extended keyboard layout, it's easy to type acute and grave accents but harder to enter IPA or enPR stress marks.)

Thoughts? Benwing2 (talk) 03:46, 16 June 2024 (UTC)Reply

An excellent starting point is the diaphonemes listed here. One will need a distinct way to represent each of them. Nicodene (talk) 07:16, 16 June 2024 (UTC)Reply

@Nicodene That is quite a table. I won't be starting off with anywhere near the coverage of dialects given here; probably just traditional GenAm (w/o cot-caught and Mary-marry-merry), "new" GenAm (w/cot-caught and Mary-marry-merry), and RP, maybe also GenAus. Benwing2 (talk) 08:16, 16 June 2024 (UTC)Reply

Still usable for reference, whatever dialects one chooses to include. Nicodene (talk) 07:10, 17 June 2024 (UTC)Reply

Support. My only request is that you consider generating reconstructed Early Modern English pronunciations (see w:Shakespeare in Original Pronunciation). For example, the Oxford Dictionary of Original Shakespearean Pronunciation glosses knight as /(k)nǝɪt/, although I would prefer /(k)nǝɪ(x)t/ to reflect the fact the fact that it existed in other EME dialects (and in some cases unexpectedly shifted to /f/, e.g. thruff) [42]. Simon Roper's videos are also an invaluable resource. By the way, I thought @Theknightwho was working on this too? Ioaxxere (talk) 16:30, 16 June 2024 (UTC)Reply

@Ioaxxere I definitely don't want to step on User:Theknightwho's toes but they said they wouldn't be getting to this for awhile. The approach in their prototype was quite different, using English-based respelling and a whole bunch of rules taken I think from a Git package for text-to-speech (which were maybe RP-specific?) to convert to IPA. Benwing2 (talk) 19:02, 16 June 2024 (UTC)Reply

I'm of two minds about how to handle widespread mergers (especially the horse-hoarse merger): on one hand I support notating what the pre-horse-hoarse merger pronunciation was (and very much support notating what the non-wine-whine and non-Mary-etc merger pronunciations are), and indeed I like the idea of making it easier to notating the full phonological history, mentioning what the Early Modern pronunciation was, what the pre-pane-pain merger sound was, etc. On the other hand, if no or few people make a particular distinction anymore... and we expect the single required 'main' input to make that distinction... people won't make the distinction correctly. Realistically, they'll be notating a hoarse word and it'll be 50/50 chance whether they look at and copy the notation of a hoarse vs a horse word, because they don't realize there's any difference between those (because there isn't any difference, for any of the major modern national standards, nor most of the subnational dialects, AFAICT). So, it might be safer to have the 'main' input be horse-hoarse-merging, and require the horse-hoarse-distinction sound to be input as a separate value? This means the extra horse-hoarse-distinguishing line will be missing most of the time (like at present), but perhaps that's better than it being wrong much of the time(?). But I concede that there's only so far we could go in that direction: if a speaker doesn't make the Mary-marry-merry or wine-whine distinction or the trap-bath split, they'll likewise just use whatever sounds right in their dialect without realizing they were supposed to make a distinction for the sake of some other dialect, and yet I understand the desire to have the 'main' input to make the distinction... and for things like cot-caught I fully agree the main input should make the distinction (since it's still the norm AFAICT) even though this does mean people who merge the sounds will indeed sometimes notate the wrong sound (e.g. [43]). Other than that, I'll just observe that if one input generates multiple outputs, e.g. both US and UK, then an American adding an American pronunciation may not realize if/that the auto-generated British pronunciation is wrong, and vice versa; maybe we could provide a parameter so that at least conscientious users (if not blithe ones) could add "foobar|USonly=1" (or whatever) so only the US pronunciation they could vouch for was generated, and then entry went into some maintenance category so a Briton could check whether "foobar" also generated the correct British pronunciation and then remove the "USonly=1"...? IDK. PS I hope there's a key mapping the notation to IPA; I have to look at a key whenever I need to figure out what some enPR is intended to be.😅😂 - -sche (discuss) 22:14, 16 June 2024 (UTC)Reply

@-sche What you say makes sense and I was thinking of adding parameters to allow arbitrary pronunciations to be input (either using enPR or whatever respelling or direct IPA) with an accent qualifier added to indicate which accent would be involved; so possibly the horse-hoarse distinction could be handled that way. Take a look at {{pt-IPA}}; the way it handles multiple accents is similar to what I was thinking of doing here (except it doesn't provide parameters to input arbitrary accents). Basically, if you put a pronunciation in |1=, it applies everywhere unless you override a particular accent using e.g. |us=, |uk=, |rp=, etc.; but if you just put a pronunciation in e.g. |us=, it applies only to that accent or set of accents (depending on the parameter), and all the others are considered unspecified and don't display. As for enPR, we could have people input some diaphonemic version of IPA like is used in Wikipedia, but I would be concerned that people would have difficulty using it correctly and would tend to input whatever IPA they felt like inputting, leading to an inconsistent mess just like we have now. The advantage of enPR or English respelling is that it is a clear abstraction layer separate from IPA and doesn't allow as much flexibility, reducing the likelihood of inconsistency. And yes I'd definitely provide a key indicating how the enPR symbols map to IPA in different accents.

Another possibility of dealing with the horse-hoarse issue is to provide different notations to indicate "the horse sound", "the hoarse sound" and "the merged horse-hoarse sound". For example, hōrs "hoarse" vs. hôrs "horse" vs. hors (merged horse-hoarse). That way someone who doesn't know the difference could at least avoid being wrong, and in that case the module would only generate the merged version and not the unmerged version. (Maybe the same thing could be done with the cot-caught distinction, which is very unpredictable for words spelled with o. I don't know.)

I also think we might have to have flapping indicated explicitly, or at least have symbols to override whatever the default rules are for deciding whether a given t is flapped. There's no way, for example, the module could automatically know that capitalistic has a flapped t but militaristic doesn't. (Unless maybe it goes by whether the t is placed in the preceding or following syllable? Hence kằp-ĭt-əl-ĭ́st-ĭk vs. mĭ̀l-ĭ-tər-ĭ́st-ĭk?) Another similar case is with so-called "Canadian raising" of /aɪ/, which IMO should definitely be shown (since it's probably by now the majority pronunciation in the US?) and which has unpredictable exceptions, like spider and tiger (at least for me, where tiger has "Canadian raising" but taiga doesn't). Benwing2 (talk) 22:42, 16 June 2024 (UTC)Reply

@-sche Please take a look at User:Benwing2/enPR-table. This is my attempt so far at coming up with a list of enPR-style symbols for vowels and their mapping in three accents: RP, "traditional" GenAm and "merged" GenAm. It's not complete (but getting there), and there are certainly mistakes in the table as well as places needing further discussion. There's a column for GenAus but it's so far not filled in. Note that in some cases there are two possible symbols, particularly before r that is not followed by a vowel: a more expressed symbol (i.e. with more diacritics) and a less expressed one, corresponding to the fact that in this context there are a reduced set of possibilities. The two symbols would be equivalent. Benwing2 (talk) 05:30, 17 June 2024 (UTC)Reply

Re having three symbols, for "horse", "hoarse", and "merged horse-hoarse", I suppose the usefulness of that depends on whether we think the average person adding a pronunciation is more likely to look up the documentation page where we can spell out "if you have the same sound in horse and hoarse and don't know which of those originally-distinct classes a word is from, just use notation X; if you do know which class the word is from (consult Longman's, Dictionary.com, the old 1930s OED, [etc other references]), use Y for horse or Z for hoarse" — in which case doing so is sensible — or if they are more likely to just mimic what they see in other entries, e.g. if I know court sounds like horse or hoarse to me (just with h->k and s->t), maybe I just go to [flip a coin: one or the other of those entries] and copy what's there, changing h->k and s->t, in which case it's a coin flip as to whether I've used the right notation.
It also occurs to me that another thing people might do if we use enPR-like notation is just copy the enPR-like notation of the AHD, MW, Dictionary.com, old gazetteers, etc (and if they don't know IPA and the pronunciations used in all national dialects we're outputting, never notice if that causes wrong IPA to be output) . . . but there may be no intelligible notation system which would avoid that problem, since using IPA we equally have people who copy IPA from places without understanding whether it makes sense, e.g. blithely putting length marks and /ɒ/ in GenAm, using /r/, etc.
If we deploy the template semi-manually, not just bot-converting IPA to it, I suppose we could aspire to manually check and correctly input the horse-vs-hoarse class of words as we went along (and the wine vs whine class, etc) and then just ... maybe try to track new additions with an edit filter or something to ensure they were right? And in that case, just having the one main input make the horse-hoarse and wine-whine etc distinctions would indeed be less effort than having a separate w=wh / w=w or hh=hors vs hh=hoars (or whatever).
Regarding Canadian raising and /ʌɪ/: is this phonemically contrastive with /aɪ/? (AFAIK the contrast between writer and rider is viewed as being phonemically /t/ vs /d/?) If it's not contrastive, I would suggest leaving it as a [narrow bracket] thing (and might not consider it important to require the 'main' template input to distinguish it, though if displaying the [phonetic] difference can be done automatically and/or with simple added symbols like your +/- idea, great). Likewise, I would consider not requiring flapping to be indicated in the input (if it's not phonemically contrastive and isn't present at all in one of the major dialects our inputters will be coming from), but if it too can be accomplished by add-ons like you suggest, great. I will note that using hyphen-minus, while it has the appealing advantage of being intuitive, has the disadvantage that it'll cause unexpected behavior/interpretations when people retain orthographic hyphens when inputting the pronunciation of e.g. sky-high and hit-and-run, if the template takes ī- / t- to be signalling something about raising or flapping but in fact the inputter just meant it to signal "there's a hyphen here". (OTOH, if what the template displays in response to that hyphen is nonetheless correct — if sky-high-type words indeed don't raise and hit-and-run-type words don't flap — then I suppose it doesn't matter, hah). - -sche (discuss) 20:16, 17 June 2024 (UTC)Reply

@-sche Hmm, your point about hyphens is a possible issue, as I was thinking of using hyphens to separate syllables. Maybe instead I will use dot (.), which is also intuitive. As for whether /ʌɪ/ is phonemically contrastive, aside from cases like writer vs. rider, there are near-minimal pairs at least in my dialect of spider /ʌɪ/ vs. spied-her /aɪ/, tiger /ʌɪ/ vs. taiga /aɪ/, high school "secondary school" /ʌɪ/ vs. high school "a school that is high (e.g. in elevation)" /aɪ/, etc. I don't know if those pairs are universal, but I think at least the spider and high school exceptions are pretty standard. My thought was that the template would have a default rule "use /ʌɪ/ before an unvoiced sound, /aɪ/ otherwise" that would work in the large majority of cases, so the cases needing a specific ī+ or ī- override would be fairly rare. Similarly for flapping, the rule might be something like "syllable ends in vowel + t or rt and the next syllable is unstressed and begins with a vowel", which should work in the majority of the cases provided people put the t in the right place (which of course isn't guaranteed, but as you've shown, it's difficult to make something foolproof).

As for trying to catch people misusing the template, I think that IPA is very easy to misuse (as you've given examples of) and hopefully the use enPR will be a little less so; at least, I was thinking of having the code check for erroneous usages and throw errors in those cases to make it more likely they get fixed. Examples of erroneous usages would be omitting syllable breaks (there should never be more than one vowel in a syllable), using an unmarked vowel other than in the particular cases where it's allowed, putting two of the same consonant in a row, etc.

As for whether it makes sense to have a symbol for cases of mergers, I'm not sure what the right answer is here. If we do have such symbols, we can have cleanup categories for their use. If we don't, we can use the WT:Tracking mechanism to track cases where e.g. the horse and hoarse symbols are used, but I'm not sure how to "mark off" the ones we've checked other than e.g. to have a page somewhere containing a whitelist of terms that have been checked. Can you elaborate on how you think an edit filter would work? Benwing2 (talk) 20:53, 17 June 2024 (UTC)Reply

Dot for syllable breaks is intuitive. I've been worried people would use hyphen for things like hĭt-bī-pĭtch.ĭs — we were discussing a while ago the various unstandardized ways people indicate various kinds of word breaks in the hyphenation template — so I was thinking if we used a different symbol than - for indicating flapping / (non)raising (e.g. use t^ or something), then if people do use - to mean "there's a hyphen here", the template can easily flag it as something to clean up, whereas if the template expects t- as valid input, I was worrying it'd be harder for it to know whether a given instance is right, but your proposed checks against two vowels etc sound like they'd catch any problems. (I may be wrong to think people will input hĭt-bī-pĭtch.ĭs, anyway.)
Re raising, hopefully more people can weigh in; for my part I would rather be conservative and wait until I see more literature referring to it as phonemic rather than allophonic (AFAICT it is near-uniformly referred to as allophonic), before moving it out of [brackets] and into /phonemic/ status. (The various near-minimal pairs I see mentioned all seem to be said to exist for only some speakers and dialects; besides spider and tiger I also see hire vs higher mentioned as a pair some people distinguish, but not others.) BTW, on the subject of ʌɪ, the OED gives the British pronunciation of all these words, tiger, taiga, rider, writer, etc, as /ʌɪ/ (and the American pronunciation of them all as /aɪ/), although I suppose that's not RP.
Re an edit filter, I meant a filter could tag all new additions of the pronunciation template with a horse/hoarse vowel in the input, so people could manually review those additions to see if they were correct; this would be labor intensive / inefficient. I like your idea of having the third / merged symbol add a cleanup category; that's probably the best approach, though I think it only helps with aware/conscientious users (who know they're supposed to make the distinction, and can use the "merged" symbol if they're unsure), whereas I'm thinking about the users who don't realize they're supposed to make a distinction (but perhaps nothing can be done about them). - -sche (discuss) 04:04, 19 June 2024 (UTC)Reply

What you're saying about phonemic vs. allophonic of Canadian raising and flapping makes sense; I'll have them indicated as allophonic (but still provide symbols for inputting them if needed, since it's hard for the module to always get it right). BTW one possibility for notating the horse/hoarse distinction is to use similar +/- or whatever symbols rather than ör vs. ōr, so that e.g. you'd have merged hors vs. something like ho-rs "horse" and ho+rs "hoarse" (or maybe some other special characters); maybe that will make people more likely to look up the documentation and see that it's OK to write hors if you're not sure. (The idea is that the + and - additions will always indicate finer distinctions that can be left out in cases of doubt.) Dunno though. Also, I've been looking at sample words to come up with how I would structure the arguments to {{en-IPA}} or {{en-pr}}; the first two words I picked were tree and three and both of them have weirdly narrow IPA transcriptions added. IMO these *really* should not be there; e.g. I don't see how [t̠͡ɹ̠̊˔ʷɪi̯] possibly helps anyone. Benwing2 (talk) 04:46, 19 June 2024 (UTC)Reply

Numerals

Latest comment: 4 days ago1 comment1 person in discussion

Aside from the obvious numerical definitions, the pages on numbers like 3 and 4 include defs for one topic: indicating phonological tones in tonal languages. Is that a rule? Why include that and not other non-numerical things representable by numbers? Dewey decimal numbers, Hornbostel-Sachs numbers, Fujita scale, etc. Or are those things includable? I don't see anything relevant in WT:CFI and I have no interest in adding such defs, I was just wondering if this has been discussed. What's special about tonal markers? Mazzlebury (talk)

Probably because they are used in the transcription of words, e.g. in Jyutping. Voltaigne (talk) 13:57, 16 June 2024 (UTC)Reply

Oh ok, that makes sense, it's more like a character, I get that. Mazzlebury (talk)

AWB request (Brainulator9)

Latest comment: 3 days ago4 comments2 people in discussion

I would like access to AutoWikiBrowser, for use in helping with doing things such as diffusing categories like Category:English terms prefixed with un-. I already have been approved for this tool on English Wikipedia and Wikimedia Commons and have used them with little issue. -BRAINULATOR9 (TALK) 23:42, 16 June 2024 (UTC)Reply

Can you be more specific about what you plan to do? Generally categories like Category:English terms prefixed with un- are not supposed to be added manually. Benwing2 (talk) 05:32, 17 June 2024 (UTC)Reply

In this case, I would be adding |idN= parameters to the {{suffix}} templates, putting them in categories like Category:English terms prefixed with un- (negative). I'm not sure if every little task needs to be brought up here first, but that's the specifics for the task I mentioned. -BRAINULATOR9 (TALK) 14:16, 17 June 2024 (UTC)Reply

OK that sounds fine, I just want to make sure you have some idea what you're doing :) ... if no one objects in a couple of days, I'll add you to the list. Benwing2 (talk) 18:54, 17 June 2024 (UTC)Reply

Japanese historical kana transliteration

Latest comment: 3 days ago1 comment1 person in discussion

Hello, is there maybe a problem with how historical kana spellings are currently transliterated? I've had to change 柔和's "にうわ" to "にう.わ", because, for some reason, the former was producing "niwa" instead of "niuwa". I just now went on 飢える and find that "うゑる" is transliterated as "weru"... is there some sort of mistake here? Why would this be the default behavior and require a "." to fix it? Kiril kovachev (talk・contribs) 01:23, 18 June 2024 (UTC)Reply

Updates to WT:AINE

Latest comment: 3 days ago2 comments2 people in discussion

It was suggested that I update WT:AINE to better reflect common convention, so I went ahead and did so. Probably the biggest change is deleting some of the sort rules, which were a bit complicated and therefore mostly ignored. @Mahagaja, Rua, This, that and the other, Nicodene, Benwing2 --{{victar|talk}} 05:41, 18 June 2024 (UTC)Reply

@Victar Seems reasonable to me from looking over the changes. Benwing2 (talk) 05:46, 18 June 2024 (UTC)Reply

AWB request (Babr)

Latest comment: 1 day ago15 comments3 people in discussion

Was originally gonna wait a few weeks after the first AWB request cuz I didn't want to ask too soon after someone else, but then someone else asked again so I guess I can't control how close my request is to someone else's.

Anyway, I will be using it to clean up Tajik entries, for examples of what I am changing compare what this entry looked like before I cleaned it up to what it looks like after I cleaned it up. I've already cleaned up about ~400 entries so I will just continue what I'm already doing at a faster pace.

BTW I am User:Sameerhameedy, I just changed my username a few days ago. — BABR (talk) 08:13, 18 June 2024 (UTC)Reply

@Babr Hi! I added you to Wiktionary:AutoWikiBrowser/CheckPageJSON. Please let me know if this works; some users have said that this page doesn't work and you have to be added to Wiktionary:AutoWikiBrowser/CheckPage despite that page saying it's superseded. Benwing2 (talk) 21:08, 18 June 2024 (UTC)Reply

Unfortunately it would probably be disruptive and a bad idea to test this (if some users' AWB use in fact depends on the non-JSON CheckPage existing), but . . . iff Wiktionary:AutoWikiBrowser/CheckPage has in fact been superseded, I wonder if the issue might be that the page nonetheless still exists (with names on it and everything), so perhaps AWB first looks there, sees it exists, assumes it's operating on an old wiki that still uses the old name for the page, and looks for names there and doesn't find them: I wonder if not having a page with that title would force it to look for the new JSON page. - -sche (discuss) 02:09, 19 June 2024 (UTC)Reply

Hmmm, that is an interesting hypothesis. I wonder if we can check this in some other fashion, maybe by looking through the AWB docs or asking one of the AWB developers (wherever they hang out). Benwing2 (talk) 02:17, 19 June 2024 (UTC)Reply

BTW on Wikipedia, their CheckPage is a hard redirect to CheckPageJSON. Maybe that would work for us? Benwing2 (talk) 02:25, 19 June 2024 (UTC)Reply

I guess we could try it and revert if it turns out to cause problems. (Might as well move the useful text to WT:AWB while we're at it.) - -sche (discuss) 04:06, 19 June 2024 (UTC)Reply

We'd need the cooperation of someone who has AWB access as well as AWB installed so they could try things out to see if anything breaks. (I don't have AWB installed because (a) I'm on a Mac and (b) I have bot scripts for downloading sets of pages, editing them offline and pushing them in a batch; this is the source of those (manually assisted) notations in my bot changes.) Benwing2 (talk) 04:50, 19 June 2024 (UTC)Reply

I have AWB and can test whether it still works if the page is redirected. (If only some users find that being added to the JSON page isn't enough, that isn't foolproof; it might be better to get one of the people who found that merely being added to the JSON page wasn't enough for them.) - -sche (discuss) 05:12, 19 June 2024 (UTC)Reply

So far I can still edit, but next I'll try closing and restarting AWB, as I suspect it performs its check on startup. - -sche (discuss) 05:41, 19 June 2024 (UTC)Reply

I closed AWB and started it afresh, and: "Logged in, user and software enabled", it says. - -sche (discuss) 05:43, 19 June 2024 (UTC)Reply

OK, hmmm. Let me try redirecting the page then. Benwing2 (talk) 05:59, 19 June 2024 (UTC)Reply

@-sche OK, I merged the two pages, copying the non-user text to WT:AutoWikiBrowser, and redirected Wiktionary:AutoWikiBrowser/CheckPage to Wiktionary:AutoWikiBrowser/CheckPageJSON. Let me know if it still works after closing, logging out explicitly (if possible), logging in and seeing if you can make an edit. Benwing2 (talk) 06:12, 19 June 2024 (UTC)Reply

OK, I logged out in my other browsers, opened AWB, logged in, explicitly logged out in AWB, closed it, reopened it, logged back in (in AWB), hit the "refresh status" option (I figured if anything would "refresh my 'has-AWB-rights' vs 'doesn't' status", that seemed like a likely candidate, although I think it in fact refreshes some list of typos somewhere), logged out and back in again for good measure, and it still says I'm approved. - -sche (discuss) 06:32, 19 June 2024 (UTC)Reply

OK great! So hopefully everything is sorted now. Benwing2 (talk) 06:34, 19 June 2024 (UTC)Reply

Didn't get a chance to test it until now but it works just fine! — BABR (talk) 07:19, 20 June 2024 (UTC)Reply

Template:univerbation

Latest comment: 2 days ago2 comments2 people in discussion

Could we change the behaviour of this template please? Currently it's a mere copy of {{affix}} / {{compound}} with a specific categorisation. But univerbations are a specific type of compound: they are originally entire phrases/syntagms which came to be joined together.

For example, French aujourd’hui is not the mere sum of au + jour + de + hui: it's the whole phrase au jour(-)d’hui (now obsolete) rewritten and felt as one word.

Imo the template shouldn't take parameters (so we'd write {{univerbation|fr|[[au]] [[jour]] [[de|d']][[hui]]}} instead of {{univerbation|fr|au|jour|de|hui}}), and it certainly should not output a "+" between the components. P U C – 17:43, 18 June 2024 (UTC)Reply

@PUC This is a pretty major change. If we were to implement this we'd need to figure out a strategy for migrating the 2,000 or so pages that currently use the old format to the new one. Benwing2 (talk) 02:26, 19 June 2024 (UTC)Reply

CFI for translations?

Latest comment: 1 day ago9 comments6 people in discussion

Is there some bare level of attestability needed for translations? I ask because of the translations of Mummerset, a word that will be used vanishingly rarely - if ever - in other languages. We have Finnish and ~~Russian~~ Macedonian translations, but it doesn't look like these words have ever been used in those languages (and it's arguable whether they are right: Mummerset isn't actually a dialect, it's just a stage accent). Smurrayinchester (talk) 13:38, 19 June 2024 (UTC)Reply

Some people insist the same CFI applies to translations as does for entries. I cannot find anything in the policy to support it personally. At the same time, I don't know what people expect to happen when translation requests are added indiscriminately, without paying any attention to how likely it is for the term in question to ever be used in the target language, if outside English at all. — SURJECTION ^{/ T / C / L /} 15:44, 19 June 2024 (UTC)Reply

I would support that CFI for translations be the same as other entries, and I share your confusion. Vininn126 (talk) 20:07, 19 June 2024 (UTC)Reply

In theory, because CFI applies to entries, what you're supposed to do if you think a translation (or redlinked Derived term, etc) is wrong, is: create an entry for it. Then you RFV it and it gets removed as both an entry and a translation if it fails RFV. Because this is rather ... faffy ... there are people, as Surjection mentions, who prefer to just apply CFI directly to the translation and remove it if it fails ATTEST without creating an entry for it first, but this gets a surprising amount of pushback, so... if you think something is wrong and doesn't exist, you can always fall back on creating an entry for it. - -sche (discuss) 16:09, 19 June 2024 (UTC)Reply

If so, there has to be a solution to removing the only translation from an entry and people simply requesting a translation to be added later "because it's missing". — SURJECTION ^{/ T / C / L /} 19:43, 19 June 2024 (UTC)Reply

Also using {{not used}} and {{no equivalent translation}} seem inappropriate sometimes. Vininn126 (talk) 20:12, 19 June 2024 (UTC)Reply

I think there are still a lot of cases where the language may have an equivalent translation, but it's not attested/attestable. Like pretty much any place name in pretty much any LDL. Thadh (talk) 23:12, 19 June 2024 (UTC)Reply

This also happens with some multiword terms, or something similar, where the given English word is idiomatic, a language would translate it the same way, but that exact phrase isn't attested. Vininn126 (talk) 23:14, 19 June 2024 (UTC)Reply

@Thadh @Vininn126 This is why {{no attested translation}} exists, but it's hardly used. Theknightwho (talk) 23:16, 19 June 2024 (UTC)Reply

Entries by Geshiza

Latest comment: 13 hours ago4 comments2 people in discussion

Following up on this BP discussion back in March, after which I notified Geshiza that they'd be given time to fix their entries before they were moved out of the main space. Its been 3 months since then and Geshiza has been completely inactive and hasn't requested and language code for Eastern Geshiza, so we cannot even fix the entries they made ourselves if we wanted to. I think the best solution now is to move the entries they made to their user space and notify them that their entries have been moved there and can be fixed and readded (if and when they return). I'd be happy to move the entries myself if there is consensus to do it, but someone would still need to delete every redirect page so I suppose it's better someone else does it.

Notifiying Chuck Entz, Benwing2 and User:Theknightwho, who were involved in the BP discussion from March — BABR (talk) 07:14, 20 June 2024 (UTC)Reply

@Babr Let's go for it. Can you identify a list of pages to be moved and deleted? Benwing2 (talk) 07:50, 20 June 2024 (UTC)Reply

@Benwing2 pretty much all the entries they've made need to be moved. Lucky it seems they've added all their entries to Category:Eastern Geshiza nouns. Though, since they tagged the category manually, it's possible they missed some (though I didn't notice any missing entries when comparing the category to their contributions). — BABR (talk) 08:17, 20 June 2024 (UTC)Reply

@Benwing2 additionally, if you think it would be appropriate, I could move the entries myself with the extended mover right. But that's up to your discretion, like, if you think it's better for someone else to handle it that's perfectly fine by me. — BABR (talk) 21:51, 20 June 2024 (UTC)Reply

Htoklibang Pwo

Latest comment: 7 hours ago2 comments2 people in discussion

See this paper. Htoklibang Pwo is apparently a Pwo lect that is not mutually intelligible with any of the other Pwo languages, and is not culturally more related to any Pwo group over another. It seems neither Glottolog nor Wikipedia have entries or even descriptions of this lect, and according to this, this lect was only first identified in 2008! Should we make a distinct code for this? I have currently kept the one entry as Eastern Pwo, but that doesn't seem like a good solution. Thadh (talk) 10:10, 20 June 2024 (UTC)Reply

Support Theknightwho (talk) 03:49, 21 June 2024 (UTC)Reply

Decluttering the altform mess

Latest comment: 2 hours ago2 comments2 people in discussion

(Previous discussion.)

At the moment part-of-speech categories are practically unusable for languages with numerous altforms. For instance iluec and its 270 variants account for nearly half (!) of all entries in Category:Old French adverbs.

This state of affairs would be greatly improved by adding an optional parameter to {{head}} which disables the normal categorizations handled by that template and instead puts entries in categories named '[language name] alternative forms'.

Thoughts? Nicodene (talk) 03:48, 21 June 2024 (UTC)Reply

i agree. its worth noting that iluec is an extreme outlier, but even so, if there are a lot of examples where the same word shows up several times, a reader would waste time trying to guess which was the correct one. —Soap— 09:36, 21 June 2024 (UTC)Reply

Add topic