Wiktionary:Information desk/2017/June

Are these valid citations of Windows? edit

  • [1]: As a MAC girl, I lose sleep over why people use Windows.
  • [2]: There's a reason people use Windows. It just werks.
  • [3]: ... have either vandalized or enhanced (depending on who you are) an upcoming Apple Store with a Windows logo.
  • [4]: Windows users are generally regarded as either thieves, people who lost their keys, or just complete morons too stupid to use doors.
  • [5]: If Mac Users Care About The Environment More Than Windows Users?

--2001:DA8:201:3512:BCE6:D095:55F1:36DE 23:05, 3 June 2017 (UTC)[reply]

  • Looks reasonably good to me. Not all of us agreed to the deletion of a word that obviously exists. SemperBlotto (talk) 05:54, 4 June 2017 (UTC)[reply]
    • It seems Firefox (deleted previously) can also be cited:
  • [6]: I use Firefox and MacOS, so I'm not afraid of viruses,
  • [7]: But I've realized that many Meezers dont use Firefox
  • [8]: You have to use Firefox and Yahoo for the rest of your life.
  • [9]: And with the right accessories, they can do a spot-on Firefox impression.
  • [10]: They also have a Firefox.

--2001:DA8:201:3512:BCE6:D095:55F1:36DE 06:41, 4 June 2017 (UTC)[reply]

  • [11]: For those XP users, ever wanted to muted certain applications like vista can??
  • [12]: Avast! AntiVirus in XP VM under Fusion
  • [13]: Check it out, I found something that does what CDA does and works with XP!
  • [14]: you got to limit your game because people still use XP with slow processors
  • [15]: although I can't use it because I still run XP.

--2001:DA8:201:3512:BCE6:D095:55F1:36DE 07:09, 4 June 2017 (UTC)[reply]

West (of) England vs West Country edit

Is "West England"/"West of England" synonymous with "West Country"? In particular, do they both include Cornwall? If "West England" doesn't include Cornwall, then perhaps it shouldn't be listed as a synonym of "West Country" in Module:labels. - -sche (discuss) 21:18, 5 June 2017 (UTC)[reply]

Latin vowel length edit

hi guys! really appreciated the time and effort that you took for the latin entries, especially the use of macrons for vowel length! one thing though, is it possible to create a list of all of the latin words with macrons? because when I used Special:Export and added the category Latin lemmas, it does not have the macrons. any help will be appreciated, as it would make my experience with latin easier, especially when i have no internet. thank you all! — This unsigned comment was added by 203.111.89.251 (talk) at 10:11, 6 June 2017 (UTC).[reply]

That's a bit like asking for all the Latin words that contain the letter r. —Tamfang (talk) 08:18, 8 June 2017 (UTC)[reply]
I'm not certain, but I think you can find such information in the latest database dump (SQL dump, XML dump, or whatever it's called). You should ask -sche. I think he knows about it. —Stephen (Talk) 05:05, 13 June 2017 (UTC)[reply]
Macrons are not used in entry names (page titles), only in the text in the entry. Latin headwords (the line of text at the top of the part-of-speech section) that contain macrons could be put in a category, using Module:headword. However, the macrons wouldn't display on the category page, and I don't know how to get a complete list of entry names, much less a complete list where the list entries have macrons. But perhaps a bot could do it. @DTLHS has made quite a few lists with his bot before. — Eru·tuon 05:21, 15 June 2017 (UTC)[reply]
I would need a list of Latin headword templates and the parameters they could take. DTLHS (talk) 05:23, 15 June 2017 (UTC)[reply]
It looks like all the templates use either |1= or |head= for the displayed headword. (Additional headwords will appear in |2= or |head2=, |3= or |head3=, etc. Those should probably be ignored if they are not equal to the headword when macrons are removed.) See the list of headword templates below. {{head}} might be used too. — Eru·tuon 05:49, 15 June 2017 (UTC)[reply]
list of Latin headword templates
Template:la-adj-1&2
Template:la-adj-3rd-1E
Template:la-adj-3rd-2E
Template:la-adj-3rd-3E
Template:la-adj-comparative
Template:la-adj-form
Template:la-adj-superlative
Template:la-adv
Template:la-comp-form
Template:la-diacritical mark
Template:la-gerund
Template:la-gerund-form
Template:la-gerundive
Template:la-interj
Template:la-phrase
Template:la-suffix
Template:la-letter
Template:la-location
Template:la-noun
Template:la-noun-form
Template:la-proper noun-form
Template:la-proper noun
Template:la-num-1&2
Template:la-num-card
Template:la-num-form
Template:la-part-form
Template:la-future participle
Template:la-perfect participle
Template:la-present participle
Template:la-pronoun
Template:la-pronoun-form
Template:la-punctuation mark
Template:la-suffix-1&2
Template:la-suffix-3rd-2E
Template:la-suffix-adv
Template:la-suffix-form
Template:la-suffix-noun
Template:la-verb
Template:la-verb-form
Here you go: tab delimited, or json if you prefer. Here are the rules that I used: {'la-adj-1&2': [1, 'adjective'], 'la-adj-3rd-1E': [1, 'adjective'], 'la-adj-3rd-2E': [1, 'adjective'], 'la-adj-3rd-3E': [1, 'adjective'], 'la-adj-comparative': [1, 'adjective'], 'la-adj-superlative': [1, 'adjective'], 'la-adv': [1, 'adverb'], 'la-interj': [1, 'interjection'], 'la-phrase': ['head', 'phrase'], 'la-location': [1, 'proper noun'], 'la-noun': [1, 'noun'], 'la-proper noun': [1, 'proper noun'], 'la-num-1&2': [1, 'numeral'], 'la-verb': [1, 'verb']} DTLHS (talk) 18:11, 15 June 2017 (UTC)[reply]

bačičobi edit

What language has a word for 'right (turn)' that sounds like bačičobi? —Tamfang (talk) 16:59, 10 June 2017 (UTC)[reply]

Term black people use for white people in films edit

In German dubs of Hollywood films, the go-to slang term for black characters to refer to white people, whether threatening or affectionate, is Weißbrot (wheat bread). I can't find what they say in the originals. My assumption would be 'cracker' but that seems more harsh than it comes across in the dubs. Korn [kʰũːɘ̃n] (talk) 12:35, 12 June 2017 (UTC)[reply]

A few ideas: Wikisaurus:white person, as well as the synonyms in whitey --Barytonesis (talk) 12:42, 12 June 2017 (UTC)[reply]
Actually it might be a direct calque from white bread. --Barytonesis (talk) 12:49, 12 June 2017 (UTC)[reply]
I think honky is the term usually used in films and TV shows. —Aɴɢʀ (talk) 21:01, 12 June 2017 (UTC)[reply]

Hello. How can I get the entire list of words in a category? edit

For example, at the bottom of this page, it says there are 35,830 English verbs in total, but I can only see 200 of them at a time.

https://en.wiktionary.org/wiki/Category:English_verbs

How can I get the entire list of 35,830 verbs?

I am making a word list for learners of English and I'll credit Wiktionary in it. Thanks. Eopluo (talk) 10:17, 14 June 2017 (UTC)[reply]

@Eopluo: You can use w:en:WP:AWB to generate lists from categories. —Justin (koavf)TCM 15:22, 14 June 2017 (UTC)[reply]
Wow. I'll try that. You're a lifesaver. I was seriously planning to download all the words manually. Eopluo (talk) 04:14, 15 June 2017 (UTC)[reply]
@Eopluo: No problem. Let me know how else I can help. —Justin (koavf)TCM 04:19, 15 June 2017 (UTC)[reply]
AWB has a painful upper limit of pages to fetch tho. AFAIK it is 200. Dixtosa (talk) 04:57, 15 June 2017 (UTC)[reply]
@Dixtosa: It's 25,000. If the OP needs more, he will have to use some other tool or a database dump. —Justin (koavf)TCM 05:10, 15 June 2017 (UTC)[reply]
Yes I was wrong thank you for clarification.--Dixtosa (talk) 17:30, 16 June 2017 (UTC)[reply]

It isn't obvious how to add categories to an entry edit

I think I have looked in quite a few places in the Help section to see how to add a category to an entry, but there doesn't seem to be any instructions. I ended up doing the same thing I do in Wikipedia edits, i.e. putting something like [[Category:...]] at the very end (in this case [[Category:en:Animal foods]] at the end of the entry for swill), but I'm not sure how the general newcomer will know what to do, and especially little details like including "en:" for English (and perhaps I am wrong? Shouln't I be using some curly-bracket structure? Yet it seemed to work).

So I think some more documentation is needed somewhere, and maybe with some help about using categories... e.g. answer questions like:

  • where to put the categories, and
  • what categories are added automatically;
  • is "Animal foods" also for birdseed - some ways of thinking of "Animal" include birds, some don't... is there a place to look at what people think a category should be for?
  • Should something also be added to some Wikisaurus entry when something is added to a set? Is there a fast way to do both at once (there are a lot of gaps, e.g. "Animal foods" only has a handful of entries at the moment).
  • What if there are several words for exactly the same thing (e.g. alfalfa and lucerne) or even alternative ways of writing the same thing (e.g. with or without a hyphen or space)... should they all be listed for a given category (I presume not, but where do I find guidelines?)?

Thanks, Maitchy (talk) 02:13, 15 June 2017 (UTC)[reply]

1. yes, I think most people would consider animals to be birds. 2. I wouldn't bother with Wikisaurus pages at all, the whole thing is not well thought out. 3. If there are different words for the same thing they can both be in the same category. Alternative spellings however (spaces / hyphens), just categorize the main form. DTLHS (talk) 02:22, 15 June 2017 (UTC)[reply]
@Maitchy: I agree that sometimes this project can be a little confusing, especially in regards to categorization. E.g. many Portuguese lemmas are categorized by templates. It's probably preferable to add categories to the part of a definition that matches that language. I.e. If you have a category like [[Category:en:Birds]] put that code near the English definition, immediately before any horizontal lines which separate languages. Do you have any more specific questions? —Justin (koavf)TCM 03:57, 15 June 2017 (UTC)[reply]

List of ideophones by language? edit

I noticed today that Wikipedia used to have a 'List of ideophones by Language', which was deleted in January 2017 - in part on the grounds that it had a list of words which would be better off here on Wiktionary. I agree with the sentiment, but can't find anything like this list on WT. Some of the words that were there have entries on here in various languages, but most of the handful that I checked seem not to.

I'm not very wiki-savvy, and almost entirely new to Wiktionary, so I don't know how/if one can access deleted WP pages. But, at least some version of the page is cloned on this terrible website. As far as I understand the spirit of the criteria for inclusion, most of those words ought to be included. Does anyone with more knowhow want to take this on?

I'm putting this here rather than in requests partly because it's not a request for a specific word-level entry, and partly because I'm a noobie. — This unsigned comment was added by WmGB (talkcontribs).

You can ask the admin who deleted in on en.wp (or any other admin for that matter) for a copy of the file exported as XML and then it can be imported here. —Justin (koavf)TCM 19:15, 22 June 2017 (UTC)[reply]
The List of ideophones by language was deleted by Timotheus Canens. He can easily access the deleted page if you want to save its contents. Note that entries in any given language must be in the proper script, so there will probably be a bit of work for you to convert these to the correct script and spelling. —Stephen (Talk) 07:08, 23 June 2017 (UTC)[reply]
@WmGB: I'm an admin on en.wiki so I took the liberty of copying the contents of the page prior to its deletion to my userpage: User:BigDom/List of ideophones by language. BigDom 10:31, 27 June 2017 (UTC)[reply]

Wiktionary and Wikipedia dump files understanding for parsing and indexing edit

Discussion moved from Wiktionary talk:Information desk/Archive 2014/January-June.

Wiktionary and Wikipedia dump files detailed understanding for parsing and indexing as it is not clear about which dump file to download to parse and index. For Wiktionary I have downloaded enwiktionary-20170501-pages-articles.xml mentioned xml dump but its fetching only 23742 english words and that too its not containing basic words. So can you help me out with the complete dump thing so that i will be able to make a dictionary of almost 1 lakh words. Same thing with Wikipedia able to fetch only ~851000 articles which contains Infobox but i guess its too small for a Wikipedia thing downloaded file is enwiki-20170601-pages-articles.xml. So awaiting your reply Thanks Himanshu.k (talk) 16:05, 27 June 2017 (UTC)[reply]

@Himanshu.k pages-meta-current.xml and pages-articles.xml work well for parsing. Maybe your parser is crashing before it finishes reading the whole file. — Ungoliant (falai) 16:15, 27 June 2017 (UTC)[reply]

Wiktionary and Wikipedia dump files understanding for parsing and indexing edit

Hello,

Himanshu.k (talk) 07:10, 28 June 2017 (UTC)I need a detailed explanation about Wikipedia and Wiktionary dumps .[reply]