Hello, welcome to Wiktionary, and thank you for your contributions so far.

If you are unfamiliar with wiki-editing, take a look at Help:How to edit a page. It is a concise list of technical guidelines to the wiki format we use here: how to, for example, make text boldfaced or create hyperlinks. Feel free to practice in the sandbox. If you would like a slower introduction we have a short tutorial.

These links may help you familiarize yourself with Wiktionary:

  • Entry layout (EL) is a detailed policy on Wiktionary's page formatting; all entries must conform to it. The easiest way to start off is to copy the contents of an existing same-language entry, and then adapt it to fit the entry you are creating.
  • Check out Language considerations to find out more about how to edit for a particular language.
  • Our Criteria for Inclusion (CFI) defines exactly which words can be added to Wiktionary; the most important part is that Wiktionary only accepts words that have been in somewhat widespread use over the course of at least a year, and citations that demonstrate usage can be asked for when there is doubt.
  • If you already have some experience with editing our sister project Wikipedia, then you may find our guide for Wikipedia users useful.
  • If you have any questions, bring them to Wiktionary:Information desk or ask me on my talk page.
  • Whenever commenting on any discussion page, please sign your posts with four tildes (~~~~) which automatically produces your username and timestamp.
  • You are encouraged to add a BabelBox to your userpage to indicate your self-assessed knowledge of languages.

Enjoy your stay at Wiktionary! Ultimateria (talk) 19:18, 11 May 2020 (UTC)

Bot FailsEdit

Please check your code: {{syn}} doesn't work without a language code, and {{sense}} is not part of the synonym list. See carvallo, dobladora, Día de los Inocentes, and todo el tiempo, where I had to revert your bot due to module errors. Chuck Entz (talk) 03:21, 9 September 2020 (UTC)

Thank you catching that. For what it's worth, I made all of those mistakes by hand when running the bot in a manual edit mode, so the blame is 100% on me. I'll build in some double-checking before using it like that again. Thanks! JeffDoozan (talk) 13:08, 9 September 2020 (UTC)

Pt bot runEdit

Hi, could your bot also re-arrange synonyms for Portuguese entries? – Jberkel 19:48, 10 December 2020 (UTC)

It's running now, it should be able to clean up ~8,000 of the ~10,000 Portuguese entries with synonyms. – JeffDoozan (talk) 16:55, 17 December 2020 (UTC)
Great, thanks. So the bot doesn't make changes in ambiguous situations, which means that all remaining "Synonyms" sections (2000?) require human attention? – Jberkel 11:17, 18 December 2020 (UTC)
That's correct, the bot won't make any changes when it encounters any sort of ambiguity when matching the synonyms to the glosses (eg, when a synonym has no {{sense}} and the word has multiple glosses). I have a wild idea of looking up the synonym's glosses and matching them against the word's glosses as a way of resolving some ambiguities. From my experience cleaning up the unhandled Spanish Synonyms, I'd estimate that could resolve at least half of the remaining 2000 entries, leaving the most ambiguous cases (gustar) in need of human attention. I'll do what I can to reduce the 2000 to something more manageable. JeffDoozan (talk) 13:21, 18 December 2020 (UTC)
Ok, let me know how it goes, and maybe put a list of entries requiring attention somewhere, so the work can be distributed. – Jberkel 19:40, 18 December 2020 (UTC)
We're left with 971 articles that need manual attention.JeffDoozan (talk) 15:26, 19 December 2020 (UTC)

Inflection > DeclensionEdit

Hello,

Would you perhaps be interested in running a bot to replace all Inflection headers with Declension headers in Macedonian entries or subsections for nouns, adjectives, determiners and pronouns? Some will have three equal signs around them and others four. Since "Conjugation" has already been used for verbs, it is more logical to use "Declension" for nominal words, given that "conjugation" and "declension" are co-hyponyms of "inflection", such that having "Inflection" alongside "Conjugation" mixes degrees of specificity. Furthermore, the declension tables for nominal words use "declension" and "conjugation" in their internal title, and there is a general rule according to which nominal inflection templates are given names containing -decl-.

The change that I am proposing should only affect the bolded headers as they appear on a page when it is not being edited and not any headword-line templates or inflection templates. I am contacting you since you successfully took care of several other issues with Macedonian entries recently. Martin123xyz (talk) 08:48, 18 October 2021 (UTC)

@Martin123xyz, It's a very simple bot job to rename every Inflection header to Declension in all Macedonian entries. Is that what you're asking for, or are there some Inflection headers that should not change (eg, in subsections outside of nouns, adjectives, determiners and pronouns)?
For reference, there are currently ~18,000 Inflection headers, ~6,000 Declension headers and ~7,000 Conjugation headers. JeffDoozan (talk) 13:36, 18 October 2021 (UTC)
Yes, that is basically what I am asking for. As far as I am aware, there are no inflection headers that should remain as they are, because "Declension" and "Conjugation" cover all words and there are no cases where a single inflection table can be used for both a verb and a nominal word. I cannot think of any other scenario where "Inflection" would be the only possible choice. However, it is possible that a very small number of verb entries or subsections contain an Inflection header instead of a Conjugation header as well. In that case, those would need to be changed to "Conjugation". Could you check for the existence of Inflection headers in verb entries and subsections and program the bot based on your findings? Thank you in advance. Martin123xyz (talk) 13:54, 18 October 2021 (UTC)
@Martin123xyz, I found and fixed two verbs that used Inflection instead of Conjugation and now the bot is running to rename all remaining Inflections to Declensions.JeffDoozan (talk) 14:31, 18 October 2021 (UTC)
Thank you very much for the help. Martin123xyz (talk) 06:14, 19 October 2021 (UTC)

DRAE non-linksEdit

Hi Jeff. Perhaps you could get your bot to make a note of all pages it doesn't add a DRAE link to - this would serve a couple purposes, firstly to find some errors and secondly to smugly say "ha, we are better than the DRAE coz we have all these words and they don't" QuickPhyxa (talk) 23:15, 18 October 2021 (UTC)

@QuickPhyxa Such a list would be too large to be useful. DRAE has ~87,000 unique lemma words, we have ~82,000. We only share ~44,500 with DRAE. I'm still looking for a good way to divide that into a useful list of "hey, here are some words DRAE has that we don't" that could be of some use to human editors. JeffDoozan (talk) 23:32, 18 October 2021 (UTC)
Those are pretty good numbers. QuickPhyxa (talk) 20:24, 20 October 2021 (UTC)

botEdit

when you start your bot to creat Spanish forms Amirh123 (talk) 16:28, 21 October 2021 (UTC)

Yes. Your bot is creating lots of adjective and noun forms, that's great! MooreDoor (talk) 23:12, 23 October 2021 (UTC)

List of Macedonian comparable adjectivesEdit

Hello,

Would you by any chance like to create a list of all Macedonian adjectives which are currently formatted as comparable (with only {{mk-adj}} in the header, rather than {{mk-adj|-}}, which is for the ones lacking comparable and superlative forms) but which do not have comparative and superlative forms in the declension table, i.e. where the "c=1" parameter, which adds said forms, is missing? I could then open the pages of all such adjectives and add the c=1 parameter manually, essentially bringing the declension template in line with the headword-line template. This step should not be automatic because the adjective might currently be wrongly formatted as comparable when it is not, in which case I would add a hyphen to the headword-line template rather than adding the "c=1" parameter. Thank you in advance. Martin123xyz (talk) 12:16, 27 October 2021 (UTC)

@Martin123xyz Here's your list. 483 matches out of 2528 total adjectives. JeffDoozan (talk) 13:06, 27 October 2021 (UTC)
Thank you very much. Martin123xyz (talk) 13:09, 27 October 2021 (UTC)
I have now gone through the list and fixed just about all of the entries (barring a few obscure adjectives that I don't feel knowledgeable enough about). Martin123xyz (talk) 14:11, 28 October 2021 (UTC)

Bot added duplicate definitionEdit

See diff. I doubt this is a common error, but I figured you should know. Ultimateria (talk) 22:47, 28 October 2021 (UTC)

@Ultimateria: Thanks for the heads up. The bot doesn't currently parse {{form of}} - luckily it's only used on 39 pages and most of them are verbs forms. I'll make sure the bot learns to handle that and on the next run, it'll automatically clean up the duplicates. JeffDoozan (talk) 22:53, 28 October 2021 (UTC)

fulanasEdit

Your bot blanked the page. DTLHS (talk) 20:28, 31 October 2021 (UTC)

@DTLHS: Good catch, thank you. For now, I've reverted the bot edit on that page. It was blanked due to a bug that's been resolved: the lemma, fulano is technically a form of the Proper Noun Fulano, so the bot correctly removed the Noun form section, but failed to create the Proper Noun section. When the bot runs with the new database dump tomorrow, it should complete the page changes successfully (as well fulanos, which was also affected but not blanked because it has a Portuguese section) JeffDoozan (talk) 23:47, 31 October 2021 (UTC)

Macedonian ReferencesEdit

Hello,

Could you please fix all Macedonian reference sections where the references are not preceded by a bullet point, as you did with the IPA transcriptions? Thank you in advance. Martin123xyz (talk) 11:35, 3 November 2021 (UTC)

@Martin123xyz: All done. It fixed 94 entries. JeffDoozan (talk) 14:40, 3 November 2021 (UTC)
Thank you again. Martin123xyz (talk) 14:41, 3 November 2021 (UTC)

Macedonian verbs without labelsEdit

Hello,

Could you please compile a list of all Macedonian verbs with definitions that do not start with any label, i.e. which start directly with "# to", including those which have some definitions starting with a label and others starting without one? Until today's edit, се ороди (se orodi) belonged to this type of entry. I would like to fix them so that all verb definitions contain at least one of the labels {{lb|mk|transitive}}, {{lb|mk|intransitive}} and {{lb|mk|reflexive}}. Verb forms as well as verb lemmas with only a template in the definition (e.g. {{alternative form of}} or {{nonstandard form of}}) should be excluded. Thank you in advance. Martin123xyz (talk) 13:57, 10 November 2021 (UTC)

Here's your list. 214 verbs. JeffDoozan (talk) 22:42, 10 November 2021 (UTC)
@Martin123xyz: Oops, forgot to ping you. JeffDoozan (talk) 22:43, 10 November 2021 (UTC)
Thank you for the help. Martin123xyz (talk) 08:16, 11 November 2021 (UTC)

PreverbsEdit

What justification do you have for doing this? DTLHS (talk) 17:15, 14 November 2021 (UTC)

@DTLHS "Preverb" isn't listed as an allowed header per Wiktionary:POS#Part_of_speech, but "Prefix" is. Since there were only 87 pages using Preverb and they all ended with "-", it seemed like it was a typo or a misunderstanding. Having now read the preverb entry, I can see that may have been the wrong assumption. I'm happy to revert all pages back to Preverb if you think we should keep Preverb as a POS. JeffDoozan (talk) 18:06, 14 November 2021 (UTC)
You shouldn't change headers in languages that you don't know. Please revert your edits. DTLHS (talk) 18:07, 14 November 2021 (UTC)
@DTLHS: All fixed, thank you for catching my mistake so quickly! JeffDoozan (talk) 18:49, 14 November 2021 (UTC)

Macedonian terms with red links in their headword linesEdit

Hello,

I tried to create the category Category:Macedonian terms with red links in their headword lines with {{auto cat}} but it is empty, even though the Romanian and the Welsh equivalents, also created with that template, contain a great deal of pages. What code needs to be added and where so that the Macedonian page fills up too? I primarily need it in order to see which verbs have an aspectual pair in their headword line template for which no page has been created yet (i.e. verbs which have {{mk-verb|impf|pfn=X}} or {{mk-verb|pf|impfn=X}} where X is a redlink and n is an optional cardinal number (pf, pf2, pf3 etc.); for example, се распламти (se rasplamti) and мрази (mrazi)). If the page I have created cannot be made to work, could you create the list of verbs fitting the above description by other means? Thank you in advance. Martin123xyz (talk) 13:30, 18 November 2021 (UTC)

@Martin123xyz:, I have no idea, I haven't used the category system much, but I'd be interested in learning. If you post this on WT:BP, and get a technical explanation of the changes needed, I can help you implement them. If not, I can write something to generate the list for you. JeffDoozan (talk) 13:42, 18 November 2021 (UTC)
Thank you for the reply. I have posted my query in the Grease Pit, since it is for technical problems, whereas the Beer Parlour is for proposals. Martin123xyz (talk) 14:06, 18 November 2021 (UTC)
After receiving a reply in the Grease Pit, I looked at the language-specific headword modules and found that the redlink categories are populated by codes such as the following (for Welsh):
for _, inflection_set in ipairs(data.inflections) do
for _, inflection in ipairs(inflection_set) do
if not inflection:find("%[%[") then
local title = mw.title.new(inflection)
if title and not title.exists then
table.insert(tracking_categories, langname .. " " .. poscat .. " with red links in their headword lines")
end
end
end
end
I do not feel comfortable using such advanced code. Could you implement something like this for Macedonian at Module:mk-headword? Martin123xyz (talk) 09:32, 19 November 2021 (UTC)
@Martin123xyz: Thank you, that's exactly the code that it needed! I created Category:Macedonian verbs with red links in their headword lines, which is just now starting to populate. You may need to create additional categories for other parts of speech that have inflections or plurals. JeffDoozan (talk) 02:02, 22 November 2021 (UTC)
Thank you very much. For the time being, I do not need categories for parts of speech that have inflections in their headword lines, since I am focusing on creating new lemma pages, rather than pages for noun plurals, for example. I will leave those to a bot, when someone agrees to program one for Macedonian. Martin123xyz (talk) 07:45, 22 November 2021 (UTC)

Idioms → IdiomEdit

Have seen the latest change by your bot removing the "s" on the Idioms header on the Japanese sumo page. Weird, but doesn't this break the flow of other headers like Derived terms, Proverbs, and the like (may apply to other languages other than Japanese), that are in the plural? ~ POKéTalker) 11:05, 24 November 2021 (UTC)

@Poketalker: Before the bot edits, there were 7000+ pages using "Idiom" and only 400 using "Idioms", so it seems that the majority has already favored the singular spelling. Per WT:ELE, the rule seems to be that we prefer the singular for all parts of speech or, more broadly, for all things that would have definitions (including Proverb, without the s) and the plural for everything else (Derived terms, Alternative forms, Translations, Usage Notes, References). If you think if it as "Here are definitions for term when used as a noun/proverb/idiom" it makes sense to use the singular.
Per WT:POS, "Idiom(s)" is an explicitly disallowed section. Given its widespread usage that seems like something worth changing and while we're doing that we can decide which spelling should be used. If you have strong feelings about the category and its naming and you can get consensus to permit it and to use the other spelling, and I'll be more than happy to run the bot again to rename Idiom -> Idioms. JeffDoozan (talk) 12:24, 24 November 2021 (UTC)
@Poketalker:. I see now that the "Idioms" section in 相撲 was actually a L4 section below the L3 Noun section, so it's not subject to the some POS rules. I will fix the bot to prevent future mistakes and undo the mistakes it already made. JeffDoozan (talk) 22:28, 24 November 2021 (UTC)

Facilitating the categorization of Macedonian nounsEdit

Hello,

New categories have been created for the classification of Macedonian nouns by inflection type, and in all cases in which it was possible, I have incorporated the categories into the declension templates. For example, {{mk-decl-noun-f-e}} now automatically fills Category:Macedonian feminine nouns with vocatives in -е, which is why there are already 901 pages there (and the number is growing as I write). However, there are other inflectional categories for which a single generic template is used, the inflectional differences being dealt with by means of individual parameters. For example, masculine nouns with plurals in -ови and those with plurals in -еви use the same template, and the correct forms are generated by indicating the plural stem, including -ов- or -ев- in the second parameter. In such cases, the inflectional categories need to be added manually. Is there a way to generate lists of all the noun pages to which a particular category needs to be added so that I don't have to go through the list of all Macedonian nouns and spot the relevant cases one by one? For example, could a list be generated for Macedonian masculine nouns where the indefinite singular ends in a consonant X, with an undetermined number of characters before it, and the indefinite plural ends in Xови (to be checked in the output of {{mk-decl-noun-m}}, e.g. столб), so that I can open those one by one and add the category Category:Macedonian masculine nouns with plurals in -ови? I have written a code at {{mk-decl-noun-m}} which adds the category through pov=1.

For plurals in -еви, the singular will end in X and the plural in Xеви, but there are also cases where the singular will end in XYеви and the plural in Xеви (with deletion of the final consonant, e.g. рој), so those might be more difficult to collect into a list. Finally, for Macedonian nouns with final palatalization, the singular will end in -к, -г or -х and the plural in -ци, -зи or -си respectively (e.g. ученик). If it is not possible to compile lists for these, I will just go through all masculine nouns - it's not a very important issue.

Thank you in advance. Martin123xyz (talk) 08:22, 13 December 2021 (UTC)

Does this search have the results you're looking for with your first example? If so, you can adjust it to find the other endings you're looking for. JeffDoozan (talk) 13:01, 13 December 2021 (UTC)
I changed it to search for -ов- rather than -б- (the consonant before the plural morpheme does not need to be specified), but it also includes many false positives from Bulgarian and Russian, e.g. радон, because it has the related adjective радонов, even though it does not have a Macedonian plural *радонови. What can be written so that it looks for -ов- only within the curly brackets of {{mk-decl-noun-m}}, with an undetermined number of characters to the left and to the right (to cover cases like зет, with {{mk-decl-noun-m|зет|зетов|voc_sg2=зете}} in the declension section)?
The solution will work similarly for -еви, but not for cases where -к, -г, and -х pluralize in -ци, -зи and -си, because the plural -и will not be in the source code and because simply searching for the consonants ц, з and с in the source will return too many false positives, including Macedonian masculine nouns which already end in -ц, -з and -с, like светец, with the plural светци.
Martin123xyz (talk) 13:34, 13 December 2021 (UTC)
I have another idea: to search for -овите and -евите in pages (the definite forms, which are distinctive enough to exclude many false positives). However, when I write *овите or *евите, with the asterisk representing an indeterminate number of letters to the left, I get no results, even though this works in the other direction: ов* gives me complete results for pages containing words starting with ов-. Martin123xyz (talk) 13:37, 13 December 2021 (UTC)
@Martin123xyz: If you search for insource: /\{\{mk-noun\|m/ insource: /\{\{mk-decl-noun-m[^}]*бов/, you'll get results only inside {{mk-decl-noun-m}} JeffDoozan (talk) 13:42, 13 December 2021 (UTC)
Thank you for the help. This still gives me false hits like вдовец which contains -ов- inside the root, but I think these are few enough to be ignored. Do tell me if you come up with a solution for the plurals in -ци, -зи- and си with palatalization. Martin123xyz (talk) 13:45, 13 December 2021 (UTC)
I have come up with a solution for all the categories where insource: is not specific enough: I export Macedonian nouns into Excel and use Excel functions to order them by their final letter or letter sequence. This enabled me to identify all Macedonian nouns ending in к, г and х. I then used more Excel functions to wrap them in - (-), so that I could copy paste them into a test page on Wiktionary, efficiently open them as links, and add the parameters assigning them to the relevant category. The whole categorization process is now advancing well. I just thought I'd post this in case it gives someone else useful ideas. Martin123xyz (talk) 09:19, 14 December 2021 (UTC)

User:JeffDoozan/es forms/unexpected formEdit

Good job making this list, BTW. Could you rerun it, as I think we've corrected most of them already? Br00pVain (talk) 22:06, 16 December 2021 (UTC)

It's generated from the database exports that run on the 1st and 20th of each month. When I have new data, I'll update the list. JeffDoozan (talk) 00:14, 17 December 2021 (UTC)

Category:Requests for translations of Spanish usage examplesEdit

Also, Category:Requests for translations of Spanish usage examples is another useful cleanup list, but could be better - can you make a list of any untranslated quotations that appear more than once? I mean things like a costa de and coordinadamente, which have exactly the same quote. They should probably be done first to avoid repeating work, and I probably was responsible for most of them when adding them to help me study conectores for the C2 Spanish exam I did earlier this year (I passed, BTW) Thanks again for generally not being a substandard user. Br00pVain (talk) 22:13, 16 December 2021 (UTC)

That was a compliment, BTW Br00pVain (talk) 22:13, 16 December 2021 (UTC)
@Br00pVain: I appreciate the high praise. Here's your reward for passing the C2: Congratulations! JeffDoozan (talk) 00:14, 17 December 2021 (UTC)

Demonic demonymsEdit

Another Spanish list that would be useful is one containing "someone from ..." or "of from from" but without being in Category:es:Demonyms. Any chance of whipping that up? Br00pVain (talk) 22:16, 16 December 2021 (UTC)

How do they currently get classified into Category:es:Demonyms? Is it an explicit tag, or a template that includes them? I'm generating reports based on the wiki source code so I have to break it down into something like "page source contains this and does not contain this" JeffDoozan (talk) 00:14, 17 December 2021 (UTC)
I always just manually classify it, either with [[:Category:es:Demonyms]] or {{C|es|Demonyms}} Br00pVain (talk) 11:14, 17 December 2021 (UTC)
@Br00pVain: Here's your list. JeffDoozan (talk) 16:00, 17 December 2021 (UTC)
Pretty good. We don't want any verbs in their, of course. Or stuff already under Category:es:Nationalities. I'll work through these... Br00pVain (talk) 20:18, 18 December 2021 (UTC)

Your bot is making mistakesEdit

Hi. Your bot wrongly corrected several Yoruba entries with ==Alternative terms== to ==Related terms== when they should have been corrected to ==Alternative forms==. E.g. Ọwọ, ọna, ẹnu, etc. Benwing2 (talk) 05:26, 5 January 2022 (UTC)

@Benwing2: Thank you, I'll adjust the code and fix the 20 affected entries. While you're here, it looks like our bots have different ideas about the appropriate level of that section at ao, ua, pari, airo, ilu, agho. My bot adjusts section depth relative to the parent section so that any child with a section level > parent+1 is adjusted to parent+1 and will make this adjustment to all sections on a page whenever it's already making a more substantial edit. From your edit at aio, it looks like you might agree with the N+1 depth in this specific circumstance, but I wanted to get your feedback so we can avoid any unintentional bot edit warring before I fix the titles and depth of the other entries my bot edited at atupa, Ọwọ, ẹdun, ẹnu, ọna, aghọfẹn, aafin, ọrọkọrọ, sọrọ, ẹni, eewọ, ariro and aọn. JeffDoozan (talk) 13:56, 6 January 2022 (UTC)
What my bot does is indent ==Related terms==, ==Derived terms==, ==Declension==, etc. to L4 unless there are multiple etymology sections (==Etymology 1== etc.), in which case they are indented to L5. In some circumstances they are left alone at whatever depth they are currently at, e.g. if there are two or more parts of speech above it in the same etymology section or if it's at L3 and after all etymology sections. This is because in some circumstances people want these headers to apply to all parts of speech in a given etymology section or to all etymology sections. I did not anticipate having these headers above the part of speech, which is why they are getting intended to L4 even in that case. My bot handles ==Alternative forms== specially and recognizes that it may occur above all parts of speech, in which case it should be L3. If you fix those entries to use ==Alternative forms== and put them at L3, my bot will leave them alone, but I will see about fixing it to recognize other headers above parts of speech (where they should not be ...) and leave them alone as well. Benwing2 (talk) 02:00, 7 January 2022 (UTC)

User:JeffDoozan/lists/Indo-Aryan with template borEdit

Hello. Unfortunately, WingerBot has converted all {{bor}} templates to {{bor+}} in Indo-Aryan language entries: diff, thereby making this project page empty. What should I do now 😬? Is it possible for you to regenerate a list of all pages that formerly employed {{bor}}— possibly by tracking down archived revisions? Really sorry for the trouble cause, I should have noticed this development beforehand. Thank you. ·~ dictátor·mundꟾ 16:01, 17 January 2022 (UTC)

Perhaps the revision history could be of some help… ·~ dictátor·mundꟾ 16:32, 17 January 2022 (UTC)
@Inqilābī: Updated JeffDoozan (talk) 18:06, 17 January 2022 (UTC)
Thanks a lot :) !! ·~ dictátor·mundꟾ 16:50, 18 January 2022 (UTC)

Bot requestEdit

Could you use your bot to remove inflectional info from translation tables? Specifically under adjective headers, I see e.g. flojo m, floja f, flojos m-p, flojas f-p. I would limit the search to Spanish to start and only those that include a masculine or epicene gender. Also, I may be the only one who is bothered by this, but I believe in cases like e.g. flojo m, suelto m, etc, the gender should be removed. This would be consistent with our headword templates for Romance languages. Ultimateria (talk) 06:53, 21 January 2022 (UTC)

@Ultimateria: It should be possible. Just to clarify, using smooth as an example, the bot should make the following change:
{{t+|es|sofisticado|m}}, {{t+|es|sofisticada|f}} -> {{ t+|es|sofisticado}}
and in nominative:
{{t+|es|nominativo|m}} -> {{t+|es|nominativo}}
JeffDoozan (talk) 23:35, 21 January 2022 (UTC)
Yes, exactly. Ultimateria (talk) 03:35, 22 January 2022 (UTC)
@Ultimateria: Here's a list of the easiest changes, note that the bot will aggressively resolve forms to their "core" lemma, including forms that are "alternative spellings" (see Beige: beige -> beis, and Good: buen -> bueno, güeno -> bueno). JeffDoozan (talk) 22:38, 25 January 2022 (UTC)
Looks good to me! I hadn't thought to remove alt spellings but I think we're better off with fewer of them in t-tables. Ultimateria (talk) 01:00, 26 January 2022 (UTC)

Macedonian verbal nouns and adjectival participlesEdit

Hello,

Could you please create a list of Macedonian verbs whose verbal nouns and adjectival participles are red links (in the conjugation tables, e.g. the slots occupied by the blue links носен and носење in the table for носи), so that I can create them? I have been creating these forms manually for all verbs but I have probably missed some. Verbs where no verbal noun and/or adjectival participle exists, i.e. where the corresponding slots are filled by a dash rather than a red link, should not be taken into consideration.

Thank you in advance Martin123xyz (talk) 14:10, 6 February 2022 (UTC)

@Martin123xyz:, My tools mostly use the wikidumps of the page source, so I just see {{mk-conj-и-и|impf|нос}}, and have no way of knowing if should generate носен and носење or when it would generate a dash. If there's a simple rule like always take the paramater нос + ен or ење, I can make something for you. If it's more complex than that, it might be easier to have someone modify {{mk-conj-и-и}} to generate redlink categories for those slots. JeffDoozan (talk) 16:10, 6 February 2022 (UTC)
There are unambiguous rules, but they are complex and differ depending on the conjugation template. The generation of verbal nouns and adjectival participles is also regulated by special parameters. I was hoping that your tools could straightforwardly look at the output rather than the source code, but if that is not the case, I think it would be best for me to generate my own lists of red links manually. Thank you anyway. Martin123xyz (talk) 20:39, 6 February 2022 (UTC)

Small bot requestEdit

The category Category:Buddhas wrongly contains some English entries— they should be relocated to Category:en:Buddhas. Thanks in advance! ·~ dictátor·mundꟾ 18:41, 20 March 2022 (UTC)

@Inqilābī: I don't have a good way of doing that with my bot tools, sorry. JeffDoozan (talk) 18:47, 20 March 2022 (UTC)
You just need to change {{cat}} to {{c}}, as in diff. Seems quite simple… ·~ dictátor·mundꟾ 18:52, 20 March 2022 (UTC)
@Inqilābī: So '{{cat|en|Buddhas}}' should be replaced with '{{c|en|Buddhas}}' on these pages? JeffDoozan (talk) 19:01, 20 March 2022 (UTC)
Yes. ·~ dictátor·mundꟾ 19:02, 20 March 2022 (UTC)
@Inqilābī: Done. JeffDoozan (talk) 23:10, 20 March 2022 (UTC)

Der/Rel/AfEdit

Hey, sorry to bring this up again, it seems like some of the pages with {{l}} and {{affix}} were skipped for no reason? I took care of the lists, could you send you bot again? Vininn126 (talk) 11:33, 8 April 2022 (UTC)

@Vininn126: Done. I also updated the list of skipped pages. JeffDoozan (talk) 17:20, 8 April 2022 (UTC)
Awesome thank you. I'll do these. Vininn126 (talk) 17:28, 8 April 2022 (UTC)

Spanish noun and adj formsEdit

Hi JD. I hope your bot can mass-create lots of the forms soon - it'll make finding missing lemmas to create a lot easier! Notusbutthem (talk) 13:24, 15 April 2022 (UTC)

Rhinos → RhinocerosesEdit

Hi. Per an RFM discussion, Category:Rhinos is to be moved to Category:Rhinoceroses. In order to do that the former category would have to be emptied; for that, would it be possible for you to change all [[Category:LANG:Rhinos]] to [[Category:LANG:Rhinoceroses]], {{topics|LANG|Rhinos}} to {{topics|LANG|Rhinoceroses}} (similarly for {{c}}, {{C}}, {{top}})? —Svārtava (t/u) • 09:20, 27 April 2022 (UTC)

List requestEdit

Can you create a list of Spanish lemmas ending in [x]ismo that don't link to [x]ista (where [x] is an identical root) and vice versa? Ideally the -ismos and -istas would be sorted separately. I'll probably request more of these but I wanted to start with a short-ish list. Ultimateria (talk) 04:32, 29 April 2022 (UTC)

Here are -ismos without -istas and vice versa. I don't have a good way a checking links, so it just checks for any occurrence of the text, let me know if you need a better filter. The lists will update automatically twice a month when a new database dump is available. JeffDoozan (talk) 15:54, 29 April 2022 (UTC)
@Ultimateria: and here's a ping that I forgot in the above message. JeffDoozan (talk) 15:56, 29 April 2022 (UTC)
Thanks! I forgot to specify though, I was looking for all existing pages to make connections rather than find redlinks. Could you filter out redlinks? Ultimateria (talk) 16:19, 29 April 2022 (UTC)
@Ultimateria: Done. JeffDoozan (talk) 19:48, 29 April 2022 (UTC)

Missing Ety ListEdit

I finally migrated your list over to my own userpage, in case you wanted to remove it from yours! It's something I'm still gonna have to slowly chip away at. Thanks again! Vininn126 (talk) 13:00, 20 June 2022 (UTC)

bot change to refugiarse -- probably not helpfulEdit

Hi. Back in January, your bot made this change, from:

{{es-verb}}

# {{reflexive of|es|refugiar}}

to

{{head|es|verb form}}

# {{es-compound of|refugi|ar|refugiar|se|mood=infinitive}}

I'm not sure why you made this change, but it's not really correct. refugiarse is a pronominal verb meaning "to take shelter"; it's not literally refugiar + -se. (refugiar is a less common transitive verb meaning "to shelter, to provide shelter for".) The current convention is to place pronominal meanings like this on the non-reflexive page, and have the reflexive equivalent use {{reflexive of}}. Benwing2 (talk) 03:27, 22 July 2022 (UTC)

@Benwing2: At the time of that edit, refugiar didn't have any senses tagged as pronominal or reflexive, so the bot made the change using the best available information. Now that refugiar has a sense tagged as reflexive, the next time the bot runs against a database dump containing the new changes, it will re-edit refugiarse and replace {{es-compound of}} with {{reflexive of}}. JeffDoozan (talk) 14:25, 22 July 2022 (UTC)

User:JeffDoozan/lists/es missing drae phrases3...Edit

... and friends are excellent lists! Some expressions are so basic that it's shameful we didn't have them until now. I'll duly work through the lists, doing the easy ones first. GreyishWorm (talk) 22:46, 27 September 2022 (UTC)

Section reorderingEdit

I have a bot request to move lemma sections above non-lemma sections in Spanish entries (or any other languages you see fit). The worst offenders for Spanish are female equivalent nouns under adjective forms, and adjectives under past participles. Ultimateria (talk) 02:47, 29 September 2022 (UTC)

I like that idea. In the event that the sections are not alphabetized, should it preserve the original order, or should it sort them? I'm slightly in favor of sorting alphabetically, but there are cases like leche where the non-alphabetic sorting might be intentional. For reference, here's a list of pages with forms before lemmas, and a list of pages where the sections are not alphabetical. JeffDoozan (talk) 15:43, 30 September 2022 (UTC)
Lemma sections will be more complicated to sort. If possible they should follow the "primary" sense which can be difficult to determine. E.g. In Spanish demonyms, it's clear that the noun senses are extensions of adjective senses. The current format at leche seems ideal to me because the noun is much more common than the interjection, and also more "relevant" (whose exact meaning I can't pin down). In English dive, it's hard to say the verb is much more relevant than the noun, but the verb does predate the noun at least. Unfortunately these might be best handled on a case-to-case basis. Ultimateria (talk) 00:07, 3 October 2022 (UTC)
@Ultimateria: I could sort by the most frequently used POS for a given entry, so that van#Spanish would sort the verb form before the noun lemma, although it would likely produce inconsistent results for demonyms. Otherwise, I can preserve the lemma ordering and sort all forms alphabetically by POS beneath them. JeffDoozan (talk) 00:37, 3 October 2022 (UTC)
The latter option sounds better one to me. I'm not sure non-lemmas should go above lemmas even when they're more frequent. Ultimateria (talk) 01:39, 3 October 2022 (UTC)
@Ultimateria: This should be done now. Let me know if you notice anything else that could use some bot work. JeffDoozan (talk) 23:16, 3 October 2022 (UTC)
Great, thank you! Ultimateria (talk) 01:27, 4 October 2022 (UTC)

Portuguese headwordsEdit

Can you replace the headwords of these past participles to {{pt-pp}}? Ultimateria (talk) 00:52, 6 October 2022 (UTC)

@Ultimateria: All done. JeffDoozan (talk) 02:08, 6 October 2022 (UTC)
I undid the change on derroto that looked like an exception. JeffDoozan (talk) 02:10, 6 October 2022 (UTC)