{"id":100890,"date":"2011-03-09T11:27:22","date_gmt":"2011-03-09T16:27:22","guid":{"rendered":"http:\/\/dlibwwwcit.services.brown.edu\/cds\/?p=890"},"modified":"2011-03-09T11:27:22","modified_gmt":"2011-03-09T16:27:22","slug":"infrastructure-for-digital-humanities-challenges-for-computational-linguistics-in-mining-million-book-collections","status":"publish","type":"post","link":"https:\/\/library.brown.edu\/create\/cds\/infrastructure-for-digital-humanities-challenges-for-computational-linguistics-in-mining-million-book-collections\/","title":{"rendered":"Infrastructure for Digital Humanities: Challenges for Computational Linguistics in Mining Million Book Collections"},"content":{"rendered":"<p>The Computers in the  Humanities Users Group and the Brown University Library present:<br \/>\nInfrastructure for Digital Humanities: Challenges for Computational Linguistics<br \/>\nin Mining Million Book Collections<\/p>\n<p>David Smith<br \/>\nDepartment of Computer Science<br \/>\nUniversity of Massachusetts, Amherst<\/p>\n<p>2:00 PM Tuesday, March 15<br \/>\nBopp Room, John Hay Library<\/p>\n<p>Concerted scanning projects are making significant amounts of data &#8212;<br \/>\nhistorical data in particular &#8212; increasingly available to readers and<br \/>\nresearchers in many disciplines. To make this data useful, researchers<br \/>\nat UMass Amherst are working on improving OCR, language modeling,<br \/>\nmultiple-version alignment, syntactic analysis, information<br \/>\nextraction, and information retrieval. I will focus in particular on<br \/>\ninferring the relational structure latent in books: which books or<br \/>\npassages quote, translate, paraphrase, and cite each other? This<br \/>\nresearch requires improvements in modeling translation and other forms<br \/>\nof similarity, as well as improvements in efficiently comparing large<br \/>\nnumbers of passages.<\/p>\n<p>David Smith is a Research Assistant Professor in the Computer Science<br \/>\nDepartment at the University of Massachusetts, Amherst, where he is<br \/>\naffiliated with the Center for Intelligent Information Retrieval. He<br \/>\nholds a Ph.D. in computer science from Johns Hopkins and an A.B. in<br \/>\nclassics from Harvard.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Computers in the Humanities Users Group and the Brown University Library present: Infrastructure for Digital Humanities: Challenges for Computational Linguistics in Mining Million Book Collections David Smith Department of Computer Science University of Massachusetts, Amherst 2:00 PM Tuesday, March 15 Bopp Room, John Hay Library Concerted scanning projects are making significant amounts of data <a href=\"https:\/\/library.brown.edu\/create\/cds\/infrastructure-for-digital-humanities-challenges-for-computational-linguistics-in-mining-million-book-collections\/\" class=\"more-link\">&#8230;<span class=\"screen-reader-text\">  Infrastructure for Digital Humanities: Challenges for Computational Linguistics in Mining Million Book Collections<\/span><\/a><\/p>\n","protected":false},"author":17,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[179,177,211],"tags":[230,58],"class_list":["post-100890","post","type-post","status-publish","format-standard","hentry","category-announcements","category-blog","category-staff-2","tag-computational-linguistics","tag-digital-humanities"],"_links":{"self":[{"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/posts\/100890","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/users\/17"}],"replies":[{"embeddable":true,"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/comments?post=100890"}],"version-history":[{"count":0,"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/posts\/100890\/revisions"}],"wp:attachment":[{"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/media?parent=100890"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/categories?post=100890"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/library.brown.edu\/create\/cds\/wp-json\/wp\/v2\/tags?post=100890"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}