tag translation and translations starting with capital letters
Join LibraryThing to post.
This topic is currently marked as "dormant"—the last message is more than 90 days old. You can revive it by posting a reply.
Hi! /tag/dialect lists today the following translations:
Language Translation Votes Translator
Belarusian Дыялект 3 0
Bulgarian Диалект 3 0
Croatian Dijalekt 3 0
Dutch Dialect 1 1
Finnish Murre 3 0
French Dialecte 3 0
German Dialekt 4 0
Hungarian Dialektus 3 0
Icelandic Mállýska 3 0
Irish Dialecte 3 0
Italian Dialetto 4 0
Latvian Dialekts 3 0
Lithuanian Tarmė 3 0
Norwegian Dialekt 4 0
Polish Dialekt 4 0
Portuguese (Portugal) Dialeto 3 0
Romanian Dialect 1 1
Russian Диалект 3 0
Serbian Дијалект 3 0
Slovak Nárečie 3 0
Spanish Dialecto 3 0
Swedish Dialekt 4 1
Swedish dialekt 2 0 anglemark
Turkish Lehçe 3 0
Yiddish אקצענט (the last votes are:) 4 0 (editid because of BiDirectional rendering issues).
The translations of "dialect" is in most languages (which are using lower case and upper case letters) mainly using lower case variants. Only German is using a translation staring with a upper case letter: "Dialekt".
I don't understand. Are you saying that all the other languages than German SHOULD start with a lower case letter?
No! My conjecture is that these translations are based on Wikipedia / are taken from Wikipedia article names. Please compare with http://en.wiktionary.org/wiki/dialect#translations . Both should use the same "license"
Okay, I really don't understand. Because the translations at the wiki page are all starting with lower case "d", except for German, which starts with upper case "D". That is what I said in #2, but you are now saying no. Perhaps you are misreading my post?
Looking at the translation from /tag/Einstein and /tag/Benedict+XVI one can see that for persons, countries, cities most Wikipedia titles are usefull. The only (or main) problem there is the usage of the COMMA character.
What I want to point is that the translation for "dialect" (for /tag/language probably for also for "cat" etc.) is not the "optimum".
You seem to be complaining that all the first letters of whatever dialect should be in each language is lower case when it should be upper case, except for German. But looking at that list, they're all in upper case already (aside from the ones that I can't read) except for Swedish.
In other words, your complaint and your example seem to be opposite.
#7 by jjwilson61> I think he's saying the opposite. I think when he says "is in most languages ... mainly using lower case", he means in in the "real world", not what's set in LT.
But then when I tried to say that, he told me "No!" I'm still guessing he misunderstood my comment much like I think you misunderstood his.
I think that the point is that originally Tim populated the tag translation pages with help from the existing linkages in wikipedia between articles in different languages. Unfortunately, wikipedia article names start with capital letters, and so all the tags translations that came from there also have upper case starting letters. Which is not "correct" for most languages when applied to normal words (German being the major exception for nouns).
(Gangleri is talking about the general case, and not just for "dialect" and wants an overall fix of some kind, I presume.)
I'm not entirely certain what is supposed to be done about it, and it doesn't bother me too much.
Wenn ich den Problem auch falsch verstanden habe, kannst Du es nochmal auf Deutsch schreiben? Vielleicht verstehe es besser in dem Fall...
I don't see a big problem here. It is rather hard to find members at random using languages other than English for tagging, but just as many of us in English use capitals for our tags, so do others. Certainly in French I have found both upper and lower case at the beginning of tags, depending on the member. For Spanish and Portuguese I also found members using upper case for the first letter.
As far as I know - but obviously I can't vouch for all those languages - using the uppercase isn't actually wrong. Alright, dictionaries usually avoid doing so, but encyclopedias usually do (hence also the usage on Wikipedia).
At the risk of teaching grandparents to suck eggs, I'd point out that tags are case sensitive. This was an issue for me, as some of my tags are proper nouns, and some are German common nouns. Plus I had an ingrained habit of writing my first tag for any given work entry with a capital letter because I insisted on thinking of the tag field as a sentence. Foolish fellow.
So far the only answer I have found for human ignorance or folly is to have a mammoth tag editing session every so often...
I'd point out that tags are case sensitive
Now if it were the other way around: all German nouns should indeed start with a capital letter - and since tag translation can make it do so for German, it should.
ETA: Sorry, I somehow misread the statement I was responding to. The are NOT case sensitive. (At least they're not supposed to be. I only occasionally see examples to the contrary, which IMHO is a bug.)
> 3 & 5: No, pre-populating everything with Wikipedia derived info isn't "optimal". But it's a basis, and those of us actually speaking the language(s) are kindly requested to take it from there.
I thought that Tim made tags case-insensitive when he re-did tag combination.
Yes the tags themselves. Tag translations seem to be just bits of text though. We can easily use correct spelling for each language there. (Not that that would change anything regarding combination or search, I think).
#17 by @BarkingMatt> I think maybe you misread jjwilson. He said "case INsensitive" (emphasis mine). I say this because you quoted RobertDay saying that tags were case sensitive and said the same thing you said about jjwilson's post. Correct me if I'm wrong here.
18> His answer can be the same to both of us because his answer boils down to that case doesn't matter for tag translations. I haven't really been following this feature since it seems to have minimal affect on the English site but I suspect he's right. If you think the translation is wrong you should propose a different translation. So there is no bug here.
I spend one week in a hospital. In my room a professor of arts was / is waiting the last breath. His consolation is Marc Aurel.
re: subject of this topic
>14 Nicole_VanK: "Now if it were the other way around: all German nouns should indeed start with a capital letter - and since tag translation can make it do so for German, it should."
Thanks for the clarification.
I assume that we all agreee that the tag translation proposals made by Tim are a very good starting point. All proposals have implemented votes. Maybe the easiest way would be to add two icons one for requesting the proposal starting with (an) upper case letter(s) and one icon for requesting the proposal starting with (an) lower case letter(s) .
For proper names the Wikipedia page titles can be used in most cases:
http://en.wikipedia.org/wiki/Pierre_de_Fermat i.e. Fermat, Pierre de
please note the page title at http://ru.wikipedia.org/?curid=14933 "Ферма, Пьер"
Please compare with /tag/Pierre+de+Fermat.
Sorry I contributed to confusion.
> 18 brightcopy: I somehow misread message #13. Edited my message #14.
> 13 RobertDay : The tags are only case sensitive in your own tag list. Yeah, I semi-regularly need to dive into mine too in order to make things uniform. But even before correction upper and lower case variants will lead to the same tag page, and behave the same in search.
>20 gangleri:: Sorry, I may be missing your point. But the process of importing Wikipedia supplied info has run its course. What we are left with is checking (and if necessary correcting) that, and supplying translations where none were found in Wikipedia.
Maybe the easiest way would be to add two icons one for requesting the proposal starting with (an) upper case letter(s) and one icon for requesting the proposal starting with (an) lower case letter(s).
But why should one wish to force entries to start lower case? Surely even in German using upper case to start "key words" other than names and nouns is not incorrect?
>23 Nicole_VanK: "But why should one wish to force entries to start lower case? Surely even in German using upper case to start "key words" other than names and nouns is not incorrect?"
LT has no rules about tags starting upper or lower case letters. Tags are subject to consens and votes. However languages have a long history and tradition. The nature of the problem addressed is technical. The scope of this bug report is to improuve the tag proposals in order to complay to the usage established in the corresponding languages; i.e. not the usage in LT but in the outside world.
P.S. Sorry about the confusion in the first posts. I did not want to reduce the subject to two rules only. So far two issues have been addressed. More tag related issues may arrive.
Example: handling of non letter characters in tags; see /tag/Al-Biruni and /tag/Al Biruni.
Yes, languages have long history and traditions. But is there any language zone where starting lemmata (key words as in encyclopedias or library catalogues) with an upper case letter is actually incorrect - as in unallowed - spelling?
Anyway: much as I find this interesting, I don't see any bug.
#25: I was going to say Zulu, but despite the camelCase words, it looks like the Zulu Wikipedia still capitalizes the start of words. So ... Klingon.
Klingon has neither a wikipedia site nor an LT site, so that's not much of a concern, I think.
> 26: Thanks for the info on Zulu and Klingon though. Fascinating.
Some answers, then I close this:
1. The data did in fact come from Wikipedia. That's why upper-case predominates.
In an ideal world, we'd have used a source that used lower-case, because lower-case is the more common way of tagging things (German nouns excepted, of course). Because it's the more common way, lower-case tags also tend to "win" on the global tag level. Having the US site mostly lowercase and the non-English sites uppercase presents problems. But the problems are not serious—upper case is never wrong for these languages—and they came as the byproduct of a huge win, millions of tag translations we simply could not have otherwise come by.
2. Your catalog's tags are case sensitive to the maximum degree.
3. Tag combination is NOT case sensitive. That is, "Dog" and "dog" are automatically combined, and can't be separated. I can't see many (any?) cases this misfires.
4. If you want lower-case in a language, you can suggest new translations. But it's going to be an enormous slog! I suggest we engage in more fruitful pursuits.
#29: I can't see many (any?) cases this misfires.
Smith and smith combines some works about metalworking into a tag that's largely about people named Smith. Mark and mark (alternate tagmash) combines some works with a mark in them with books written by or about Mark. I can't find works on polish in the Polish tag, but it's certainly conceivable.
Except that many of us do use the capitalized tags anyway, so there would always be this problem.
Seems like a cost-benefit thing. You could always change your "smith" tags to "smithing" to avoid the problem. Or "mark" to "marked". Would hate to throw the baby out with the bathwater.
Let's also not forget that, originally, all tags were lower-case. It was impossible to capitalize them. So separating tags on the basis of whether they're lower-case or upper-case would require a lot of people to edit a lot of tags.
This topic is part of LibraryThing's in-talk bug tracking.
Join or watch Bug Collectors to get "Bug Tracking" under "The World" in Talk all the time.
Category: Non-English LibraryThing
Assigned to all
Reported by gangleri
Status: Closed by staff
Oct 16, 2012, 11:35pm
6 years since last change
This topic is not marked as primarily about any work, author or other topic.