Quarry links for author info

Recommend Site Improvements

This topic is currently marked as "dormant"—the last message is more than 90 days old. You can revive it by posting a reply.

Oct 22, 2013, 8:24am Top

gangleri came up with an interesting idea (http://www.librarything.com/topic/160312). Would it be possible to quarry sites like wikipedia, IMDb, VIAF, etc. for links to author pages? Sure,we can (and do) add them one by one - but this could give a boost to connecting LT author pages to relevant pages elsewhere.

I do see a few snags, and also I have no idea if this is technically possible.

Oct 22, 2013, 8:38am Top

At least some Wikipedia sites were put in automatically. Most were right, but not all.

Oct 22, 2013, 10:04am Top

I'm not convinced that automatically entering the sites is necessarily a good idea. I have found tons of Wikipedia links that go, not to the author page, but to a Wikipedia disambiguation page, or, worse, to the page of some totally irrelevant person who happens to have the same name as the autor.

Oct 22, 2013, 10:10am Top

The other problem with it was often the same page ended up several times on the same author page. I have also deleted many of these pages that were either wrong or duplicates. It is sort of like using Amazon to enter books. Looks fast until you figure in the time it takes to correct things.

Edited: Oct 22, 2013, 10:14am Top

>3 lilithcat:/4: That's why I say: I see a few snags.

Oct 22, 2013, 10:21am Top

> 4

The other problem with it was often the same page ended up several times on the same author page.

That generally happens when author pages are combined. Why people don't clean up after themselves, I cannot fathom.

Oct 22, 2013, 11:56am Top

Please see:
UNICEF (Q740308) https://www.wikidata.org/wiki/Q740308
International Labour Organization (Q54129) https://www.wikidata.org/wiki/Q54129

Wikidata is the main place to add information to WikiMedia Foundation projects. There you find
a) the interlanguage links for a page
b) properties for these pages (aliases, date of birth, place of birth etc)
c) autority control identifies as VIAF, LCCN, BnF, SUDOC, NLI, BNE, LIBRIS, GND (DNB) etc.

I stared to work at wikidata.org two weeks abo. Before I was absent on the internet for more then one year.

My short Wikidata user url is https://www.wikidata.org/?curid=16710847#

Please give / make a look at that work. I read that the number of edits (mainly bots) exceded 150,000,000 Best regards gangleri

In order to transclude the datas from Wikidata to en.Wikipedia I made this edit:

14:59, October 22, 2013 (diff | hist) . . (+101)‎ . . FIDE ‎ (→‎External links: {{Authority control|NOTES=transcluded_values_from_wikidata|REMARK=testcase|TIMESTAMP={{SUBST:CURRENTTIMESTAMP}}}}) (current)

Please look at the bottom of the page in en.Wikipadia at https://en.wikipedia.org/?curid=11146

Look at the link I added today to
http://www.librarything.com/author/fdrationinternationa FIDE (World Chess Federation)

I added there https://www.wikidata.org/wiki/Q102178 .

Edited: Oct 22, 2013, 12:10pm Top

> 7 I added lots of links to different sites related to chassplayers, Yiddish writers, Esperanto writers etc.I would like to querry what authors have a link to ro,Wikipedia to is.Wikipedi to he.Wikipedia etc.

Then I would like to compare the values in Wikidata and Librarythink.
Please note that we have in Librarythink a lot of pages related to translators. Many of them do not have a page in any Wikipedia. Some have never been mentioned in the books and many do not have authority control records (yet). In Wikidat it is possible to create suchentries eaven if no article exists,

Please see http://www.librarything.com/work/11767670 Alles über Wikipedia und die Menschen hinter der größten Enzyklopädie der Welt edited by Boris Marinov (Boris Marinov) and the LT author Markus Cyron Please look at the "Other authors section for that work.

Oct 22, 2013, 12:18pm Top

Well, we can actually put up such links manually. Anywhere we wish. In itself that's no problem. The problem (as I see it) is that with the gazillions of authors, editors, illustrators, translators, etc. it's a huge job.

Edited: Oct 22, 2013, 1:18pm Top

As to possible problems: to use just one practical example - myself. If you google my name (Matthijs van Klaveren) you will also find info about a guitarist / music journalist who happens to share my name (https://twitter.com/MonsieurKlaver, for instance). Also I wasn't born in 1752 (http://www.hjmwijers.nl/le-26.htm)

Oct 22, 2013, 1:01pm Top

> 9 and > 10 I see. I know that it is a Sisyphus / Sysiphus work. I have a Javascript tool and adde 7,000 CK informations in two weeks time.


I investigated for the authority control identifiers for

Organisation for the Prohibition of Chemical Weapons
There I learned about
award received Nobel Peace Prize
point in time 2013

Could somebody investigate if there is a LT author page? Thanks in advance

Edited: Oct 22, 2013, 1:32pm Top

Organization for the Prohibition of Chemical Weapons

Based in my home town. But, as far as I can see*, not an LT author yet.

*One has to be careful, with all the possible translations.

Oct 22, 2013, 1:32pm Top

A number of years ago, Tim did create a program that scoured Wikipedia (English) for author pages. If you see a link that says Wikipedia (unconfirmed) it's from that process.

The reason he set up the results the way he did (confirm/edit/delete) is because there were so many false hits.

There's really no way to remove the false hits except by human intervention, one at a time. As I said, it's been years, but even with crowdsourcing, there's still plenty of them that have never been confirmed or deleted.

Unfortunately, databases are created by humans and scouring programs are written by humans. There's no way to verify the results without human intervention. Even WorldCat data can be wrong or muddled.

Oct 22, 2013, 1:33pm Top

note: There is a bug in the links at the helpers log for links. Authors speciafied in disambiguation notice are specified without the -digit identifier.
It is a pritty old bug.

Edited: Oct 25, 2013, 6:52am Top

Hi! I started to add links to the Romanian National Library. Please search for the sting alephnew.bibnat.ro at the 7 days helpers log
Ausländer, Rose , Celan, Paul == Paul Celan , Sterbling, Anton == Anton Sterbling etc.

I wonder if such links (links to this site) are available elswhere. Thanks for any help.

P.S. experimented with "Touchstones"

Feb 20, 2014, 3:59pm Top

bump The original title of this thread is querry . The links whare added via crowd computing. the qustion is:
How can a list of author pages having links to a particilar.subdomain.domain can be generated. Helpers log can be used for 7 days at least.

P.S. This is not the place to discuss / comment adding particular links in batch mode / via (ro-)bots, imports, etc.

Feb 20, 2014, 4:43pm Top

Gangleri, once again, can I ask that you please use only one of your accounts for posting to any given thread?

