HathiTrust科研中心:开拓新的合作机遇
时间: 2016-05-24  作者:   浏览次数: 935

报告人:杰·斯蒂芬 唐尼教授


报告人简介:

杰·斯蒂芬 唐尼教授是伊利诺伊大学香槟厄巴纳分校图书馆与信息科学研究生院主管科研的副院长,也是伊利诺伊HathiTrust科研中心(HathiTrust Research Center简称HTRC)的联合主管、国际音乐信息检索系统评估实验室(InternationalMusic Information Retrieval Systems Evaluation Laboratory简称IMIRSEL)主管、以及音乐信息检索技术评估交流竞赛(MusicInformation Retrieval Evaluation eXchange简称MIREX)的创始人及现任主管。唐尼博士也曾是音乐分析网络平台项目(Networked Environment for Music Analysis简称NEMA)的项目负责人,该项目是由Andrew W. Mellon基金会资助的一项项目。同时,他也曾是大量音乐信息结构化分析项目(Structural Analysis of Large Amounts of Music Information简称SALAMI) 的联合项目负责人,该项目是由国家自然基金会(NationalScience Foundation简称NSF)、加拿大社会科学及人文研究协会(Canadian Social Science and Humanities Research Council简称SSHRC)以及英国联合信息系统委员会(Joint InformationSystems Committee简称JISC)联合资助的一项项目。唐尼博士是HathiTrust书虫文本分析项目(Hathitrust+Bookworm简称HT+BW)的领导人,这是一个创建视觉化效果工具来展示词汇用法是如何随时间演变的项目。唐尼教授还曾是HTRC项目中NOVEL(TM)文本挖掘项目以及创建乐谱搜索与分析的统一界面项目(Single Interface for Music Score Searching and Analysis简称SIMSSA)的首席代表,这两项项目都是由SSHRC Partnership Grantprogramme资助。所有以上提到的项目都是致力于为受版权保护的文化资源数据提供大规模的分析性数据接口。唐尼教授通过与国际音乐信息检索学术讨论会(International Society for Music Information Retrieval简称ISMIR)相关的工作,一直致力于音乐信息检索领域(Music InformationRetrieval简称MIR)的组织建设。他曾是ISMIR的创始理事长,现任ISMIR理事。在过去的一年里,唐尼教授一直与敦煌研究院合作,在“数字化敦煌”项目中帮助建立与促进数字化人文学科学者与莫高窟高分辨率数字化资源间的交流。唐尼博士是数字化图书馆联合会议(Joint Conference on Digital Libraries简称JCDL)指导委员会成员。他曾在加拿大西安大略大学(University of Western Ontario, London, Canada)取得音乐理论与作曲学士学位、图书馆与信息科学研究生学位以及图书馆与信息科学博士学位。

 

讲座简介:

这次讲座将介绍HathiTrust科研中心(HathiTrust Research Center简称HTRC)的最新发展情况。HTRCHathiTrust项目的科研分支,而HathiTrust则是一个综合性书籍文献资源仓储,它致力于为学术研究及教育领域就这些资源提供数据接口。HathiTrust目前已与超过100个重要科研机构及图书馆取得合作,共同致力于为文化资源的永久性保存及获取提供保障。HathiTrust的合作成员遍布全球。

    HathiTrust的数字档案拥有超过1420万册书籍文献(合49亿页),这些书籍文献来自于谷歌书籍(Google Books)、各个成员图书馆、因特网档案数字图书馆(the Internet Archive)以及许多私人收藏。HTRC为学术研究提供了许多便利,其途径包括:为非损耗性分析(non-consumptive analyses)提供资源数据接口、开发科学研究工具、促进科研项目及科研组织的形成、提供协助学者开发HathiTrust书籍文献的其他帮助性资源如更加完善和强化的元数据及索引等。

    本次讲座还将介绍HTRC的使命、目标及组织架构,也将会简要介绍近期的各种项目、合作及新方案。将要介绍的项目包括用于学术分析的工作集创建+数据容器(Workset Creation for ScholarlyAnalysis+Data Capsules简称WCSA+DC,由Andrew W. Mellon基金会资助)、HathiTrust+书虫项目(Hathitrust+Bookworm简称HT+BW,由National Endowment for the Humanities资助)。讲座也会介绍HTRC参与的两个由SSHRC Partnership Grant programme资助的项目,即NOVEL(TM)文本挖掘项目以及创建乐谱搜索与分析的统一界面项目(SingleInterface for Music Score Searching and Analysis简称SIMSSA)。由于在HathiTrust的资源中有将近800万的书籍文献无法被获取因为他们仍受版权保护,HTRC一直致力于实现对这些无法直接获取的资源进行非损耗性分析。HTRC的特征提取方案(feature extraction)以及数据容器方案(Data Capsule)就是为实现这一可能而做出的部分努力,这两项新方案也将在讲座中介绍。在讲座的最后还将与与会者讨论交流日后与HTRC可能的合作机遇。

 

报告时间:2016年6月3日(周五)下午2:00

报告地点:独墅湖校区文综楼1005幢太阳成集团tyc234cc主页网站学术报告厅5448

 

欢迎大家听讲!

 

 

 

附:英文介绍

 

Biography

J.Stephen Downie is the Associate Dean for Research and a Professor at theGraduate School of Library and Information Science at the University ofIllinois at Urbana-Champaign. Dr. Downie is the Illinois Co-Director of theHathiTrust Research Center (HTRC). He is also Director of the InternationalMusic Information Retrieval Systems Evaluation Laboratory (IMIRSEL) and founderand ongoing director of the Music Information Retrieval Evaluation eXchange(MIREX). He was the Principal Investigator on the Networked Environment forMusic Analysis (NEMA) project, funded by the Andrew W. Mellon Foundation. Hewas Co-PI on the Structural Analysis of Large Amounts of Music Information(SALAMI) project, jointly funded by the National Science Foundation (NSF), theCanadian Social Science and Humanities Research Council (SSHRC), and the UK’sJoint Information Systems Committee (JISC). Dr. Downie is the leader of theHathitrust + Bookworm (HT+BW) text analysis project that is creating tools tovisualize the evolution of term usage over time. Professor Downie representsthe HTRC on the NOVEL(TM) text mining project and the Single Interface forMusic Score Searching and Analysis (SIMSSA) project, both funded by the SSHRCPartnership Grant programme. All of these aforementioned projects share acommon thread of striving to provide large-scale analytic access tocopyright-restricted cultural data. Stephen has been very active in theestablishment of the Music Information Retrieval (MIR) community through hisongoing work with the International Society for Music Information Retrieval(ISMIR) conferences. He was ISMIR's founding President and now serves on theISMIR board. Over the past year, Professor Downie has been working withDunhuang Academy on the "Digital Dunhuang" project to help connectDigital Humanities scholars with the high-resolution digital materialscapturing the Mogao Caves. Dr. Downie currently serves on the JCDL steeringcommittee. Professor Downie holds a BA (Music Theory and Composition) alongwith a Master's and a PhD in Library and Information Science, all earned at theUniversity of Western Ontario, London, Canada.

 

The HathiTrust ResearchCenter: Exploring New Collaboration Opportunities

Thislecture provides an update on the recent developments and activities of theHathiTrust Research Center (HTRC). The HTRC is the research arm of theHathiTrust, an online repository dedicated to the provision of access to acomprehensive body of published works for scholarship and education. TheHathiTrust is a partnership of over 100 major research institutions andlibraries working to ensure that the cultural record is preserved andaccessible long into the future. Membership is open to institutions worldwide.

Over14.2 million volumes (4.9 billion pages) have been ingested into the HathiTrustdigital archive from sources including Google Books, member universitylibraries, the Internet Archive, and numerous private collections. The HTRC isdedicated to facilitating scholarship by enabling non-consumptive analyticaccess to the corpus, developing research tools, fostering research projectsand communities, and providing additional resources, such as enhanced metadataand indices, that will assist scholars to more easily exploit the HathiTrustmaterials.

Thistalk will outline the mission, goals and structure of the HTRC. It will alsoprovide an overview of recent work being conducted on a range of projects,partnerships and initiatives. Projects include the Workset Creation forScholarly Analysis + Data Capsules (WCSA+DC, funded by the Andrew W. MellonFoundation) and the HathiTrust + Bookworm project (HT+BW, funded by theNational Endowment for the Humanities). HTRC’s involvement with the NOVEL(TM)text mining project and the Single Interface for Music Score Searching andAnalysis (SIMSSA) project, both funded by the SSHRC Partnership Grantprogramme, will be introduced. The HTRC’s new feature extraction and DataCapsule initiatives, part of its ongoing work its ongoing efforts to enable thenon-consumptive analyses of the approximately 8 million volumes under copyrightrestrictions will also be discussed. Conversations with attendees aboutpossible collaboration opportunities with the HTRC will wrap up thepresentation.