arrowCIRSS Home arrow Events arrow CIRSS Seminars arrow Event Details


CIRSS Seminar - Extracted Features: Opening Access to HathiTrust and Beyond


Friday, September 27, 2019
4:00pm - 5:00pm

126 IS

Event Details

Session leaders: J. Stephen Downie, iSchool Professor
Description: The HathiTrust Digital Library (HTDL) contains nearly 17 million volumes (approximately 6 billion pages). Unfortunately, around 11 million HTDL volumes are under copyright restrictions and cannot be shared directly with users. To overcome this problem, the HathiTrust Research Center (HTRC) is creating a set of “non-consumptive research” services to make these closed materials more open and thus useful to scholars. This talk introduces such non-consumptive services as “Data Capsules,” “Extracted Features” and the “Bookworm + HathiTrust” tool. It will pay particular attention to the opportunities and challenges surrounding the creation, use and sharing of Extracted Features. Each HTRC service is designed to open new points of access to otherwise closed data, while still respecting all copyright limitations. Examples of real-world Digital Humanities research projects and services that have been using the HTRC Extracted Features data will highlighted.

[Based on a series of invited talks given at Peking University (China), the University of Tsukuba (Japan), the University of Wuppertal (Germany), and the University of Göttingen (Germany).]

* Biosketch:
J. Stephen Downie is Associate Dean for Research and a Professor at the School of Information Sciences at the University of Illinois at Urbana-Champaign.  He is the Illinois Co-Director of the HathiTrust Research Center (HTRC), and leads the Hathitrust + Bookworm (HT+BW) text analysis project, which develops tools to visualize the evolution of term usage over time. Professor Downie represents the HTRC on the NOVEL(TM) text mining project and the Single Interface for Music Score Searching and Analysis (SIMSSA) project, both funded by the SSHRC Partnership Grant programme. These projects share a common thread of striving to provide large-scale analytic access to copyright-restricted cultural data.

Professor Downie has been active in the establishment of the Music Information Retrieval (MIR) community through his ongoing work with the International Society for Music Information Retrieval (ISMIR) conferences. He was ISMIR's founding President and recently served on the ISMIR board. Professor Downie holds a BA (Music Theory and Composition) along with a Master's and a PhD in Library and Information Science, all from the University of Western Ontario, London, Canada.


Related People