arrowCIRSS Home arrow Research Areas arrow Data Analytics

Data Analytics

Cirss - SODA

Research and education initiatives focused on the design, development and evaluation of new technologies to better understand the dynamic interplay between information, people and systems.  Areas of expertise include data and text mining, knowledge discovery and collaboration, computational social science, and human-computer interaction.


Catherine Blake (

Associate Director, Center for Informatics Research in Science and Scholarship

Current People

Recent Publications

Guo, J., Blake, C., & Guan, Y. (2019). Evaluating automated entity extraction with respect to drug and non-drug treatment strategies. Journal of biomedical informatics, 94, 103177. Read more

Blake, C., & Kehm, R. (2019). Comparing breast cancer treatments using automatically detected surrogate and clinically relevant outcomes entities from text. Journal of Biomedical Informatics: X, 1, 100005. Read more

Blake, C., Kim, J., Mathur, M., & Aggarwal, A. (2019). A gap analysis of survivorship terminology: Knowledge resources versus literature usage. Proceedings of the Association for Information Science and Technology, 56(1), 612-614. Read more

Lee, J., Blake, C., & McInnes, B. T. (2019). Evaluating ontology coverage and internal structure to better align patients with clinical trials. Proceedings of the Association for Information Science and Technology, 56(1), 157-165. Read more

All related publications

Current Projects

Information Forum Requirements to Promote Knowledge Capture Knowledge Sharing and Community Interaction on the VHA Data Portal

CIRSS researchers are collaborating with the US Department of Veterans Affairs (VA) Information Resource Center (VIReC) on a project to analyze the socio-technical aspects of VA’s HSRData-L Listserv. VIReC is a VA Health Service Research & Development Service (HSR&D) resource center that supports VA researchers in need of information about data resources specific to their research. HSRData-L is a virtual community of VA researchers who share their collective knowledge and experience about VA data and information systems for the betterment of research focused on Veteran’s issues.

The project aims to better understand essential socio-technical aspects of the listserv and the information ecology in which it sits to maximize the findability and re-use of information generated by VA data users and to provide recommendations for improving its infrastructure.

PIs: Catherine Blake, Maria Souden (Veterans Affairs); Co-PI: Mike Twidale

All related projects

Recent News

September 24, 2019

iSchool researchers present at ro2019

CIRSS researchers will present their work at the Workshop on Research Objects 2019 (ro2019), which will be held in conjunction with eScience 2019 on September 24-27 in San Diego, California. The Research Objects approach proposes a way to "packa…

September 17, 2019

Underwood to discuss machine learning at Sawyer Seminar

Professor Ted Underwood will present his research on machine learning at the University of Pittsburgh on September 19. His talk is part of the University's Sawyer Seminar, a year-long project funded by The Andrew W. Mellon Foundation that brings …

May 6, 2019

National Humanities Center Fellowship supports Underwood book project

With support from the National Humanities Center, Professor Ted Underwood is examining patterns of human perspective throughout two centuries of literary history. The Center is a residential institute for advanced study in art history, classics, lang…

May 2, 2019

iSchool names new MS program directors

The iSchool is pleased to announce the appointment of new directors for its master's degree programs. Associate Professor Maria Bonn will serve as program director for the MS in library and information science (MS/LIS), and Associate Professor Ca…

More news

Past Events

May 7, 2021

Open Science and its Trusted Products

Open science is a principle of openness that, when applied across the entire scientific research enterprise, holds significant promise to advance the frontiers of knowledge and help ensure a nation&rs…

April 16, 2021

Reasoning with multiple data streams: An expert informed logic-based approach to improve data and model fidelity in Geoscience

Unfortunately, data streams are often semantically and conceptually misaligned to reuse needs, and inadequate documentation of uncertainty can limit data reuse. The goal of this research is to (a) eli…

April 9, 2021

The Social Media Macroscope

The Social Media Macroscope ( is an open-source project created here at the U of I in which we give access to researchers, students, and general users without coding and …

All events