University of Illinois

Current Projects

Current research projects are listed alphabetically by project title. Use this drop-down menu to sort by researcher last name.

Collaborative Research: DAT: From grant to commercialization : an integrated database which can trace, assess, and measure the impact of scientific funding

Abstract

This project will develop a freely available database that links Medline papers and U.S. patents, through identification of individuals who authored both papers and patents and analysis of citations between papers and patents. These patent-paper-author links will then enable identification of similar organizations and in some cases, science/technology field and geography. Co-authorship networks for scientists will also be prepared, annotated and made available on the Dataverse Network System (DVN), analogously to what has been done for inventors in the patent record (NSF proposal 0830287). This integrated database will enable researchers to investigate how 1) grants enable papers, 2) papers influence patenting, and 3) scientific knowledge ultimately diffuses and influences the entire patent record. All data will be made available and linked to tools at the Harvard-MIT Data Center.

Principal Investigator

Vetle Torvik

Funding

$445,165, National Science Foundation

Data Conservancy

Abstract

The Data Conservancy is a $20 million initiative led by Sayeed Choudhury at the Johns Hopkins University Sheridan Libraries. The five-year award, one of the first two in the NSF’s DataNet program, is building infrastructure for the management of the ever-increasing amounts of digital research data. The CIRSS team will receive approximately $2.9 million for their contributions to multiple aspects of the project. The Data Practices team (Palmer, Cragin, MacMullen, & Chao) is investigating expectations and requirements across the scientific communities served by Data Conservancy, establishing criteria for deposition, sharing, and quality control of data sets. The focus is on the ‘long tail of small science,’ and how to support collecting and sharing of the highly variable types of data produced by individual scientists and small research groups. The Data Concepts team (Renear, Dubin, & Sacchi) is developing formal terminology and identity conditions for fundamental data concepts. This work will provide the foundation for standardizing how Data Conservancy datasets are identified, described, related, and organized. The CIRSS research activities and other Data Conservancy efforts are feeding directly into two professional training programs at GSLIS—the Data Curation specialization in the Master's of Library and Information Science and the Biological Information Specialists master’s in the campus-wide bioinformatics program, as well as professional development in data curation principles, processes, and technologies.

Principal Investigator

Carole Palmer

Funding

$2,900,000, National Science Foundation

Defining and Solving Key Challenges in Microblog Search

Abstract

Microblogging services like Twitter are becoming an important part of how many people manage information in their day to day activities. As microblog traffic increases (Twitter currently sees about 50 million tweets per day) information management and organization will become keen problems in this area. The project will define the core problems in microblog search and propose solutions to these challenges in the form of both theoretical models and prototype search systems.

Principal Investigator

Miles Efron

Funding

$45,563, Google

Digital Collections and Content

Abstract

As efforts to integrate and federate digital resources proceed apace, we are learning more about the problems that emerge at different levels of scale and granularity. Building on prior work of the Digital Collections and Content project (DCC), we will investigate and implement a systematic approach that confronts these problems and offers robust means for adding value and improving access to existing digital aggregations.

Website

http://imlsdcc.grainger.uiuc.edu/research.htm

Principal Investigator

Carole Palmer

Extending Data Curation to the Humanities: Curriculum Development and Recruiting

Abstract

Specific objectives are: 1. Develop and refine a humanities data curation curriculum. 2. Develop a network of internship sites at libraries, museums, digital archives and digital humanities centers. 3. Promote the role of LIS professionals in humanities data curation and expand the understanding of the role of digital data curation in humanities research. 4. Disseminate best practice reports and provide a model curriculum for other data curation programs. 5. Develop a model institute for delivering the curriculum as continuing professional development. 6. Deliver the curriculum to both new masters students and continuing professionals.

Co Principal Investigator

Allen Renear, Carole Palmer

Funding

$892,028, Institute of Museum and Library Services

Hidden History of U.S. Telecommunications

Abstract

An archivally grounded research project on telecom system development between the 1870s and the 1950s.

Principal Investigator

Dan Schiller

LAMP II

Abstract

Library Access Midwest Program, also known as LAMP (www.lisaccess.org) was launched in 2006 to address a severe regional shortage of library and information science (LIS) professionals from underrepresented groups. With support from IMLS, Midwestern LIS leaders have been successful in designing and piloting a model to promote LIS careers, with special emphasis on recruitment and retention of students from statistically and historically underrepresented populations. LAMP II will build on these initial achievements.

Principal Investigator

Rae-Anne Montague

Funding

$826,502, Institute of Museum and Library Services

Meeting the Challenge of Language Change in Text Retrieval with Machine Translation Techniques

Abstract

In order for older texts to be searchable, contemporary English needs to be translated into language from various historical timeframes. The project will develop software that will let people enter a query in contemporary English, and search over English texts throughout history—from Medieval times to the present day. The project will mostly involve training statistical models that assign probabilities of the translation to a word or phrase in a target English language. The project will also look at how to display results in order to provide the user with the most probable answer to the query.

Principal Investigator

Miles Efron

Funding

$49,429, Google

Mix IT Up! Youth Advocacy Librarianship

Abstract

Mix IT Up! Youth Advocacy Librarianship focuses on creating intentionally structured, youth­-centered, engaged learning opportunities related to information technologies. Mix IT Up! enhances youth services by developing a library and information science (LIS) specialization that dovetails with community informatics and youth service in order to focus on systematically training librarians as youth advocates. Mix IT Up! actively recruits youth advocacy fellows from traditionally underrepresented groups in LIS - American Indian, Latino/a, African American, and working­ class - by providing academic and financial support. Fellows are placed in long-term youth advocacy projects that partner with community organizations serving youth. Fellows also engage with selected community affiliates and organizational participants in the Youth Advocacy Alliance (YAA). The alliance will: gain the services of emerging information professionals, including those from underrepresented groups; have the opportunity to apply for small amounts of funding in support of particular projects and/or travel; and be able to provide the youth in their organization rich opportunities for community engagement.

Website

http://mixituplis.wordpress.com/

Principal Investigator

Rae-Anne Montague

Funding

$904,314, Institute of Museum and Library Services

Narratives and New Media: Guiding Young People's Stories

Abstract

Principal Investigator

Carol Tilley

Next Generation Digital Federations: Adding Value through Collection Evaluation, Metadata Relations and Strategic Scaling

Abstract

This project will investigate and implement a systematic approach to developing useful, meaningful, and usable digital collections. Building on the prior work of the IMLS Digital Collections and Content (DCC) project, the researchers will explore how to use the relationships between collection-level and item-level metadata in federated digital repositories to preserve content and make the content more useful for scholars and the public. The project will experiment with and test metasearch capabilities, and expand and improve the IMLS DCC with new IMLS-funded and other digital content and advanced search capabilities.

Principal Investigator

Carole Palmer

Funding

$975,903, Institute of Museum and Library Services

Open Annotation Collaboration II

Abstract

The overarching goals of the Open Annotation Collaboration (OAC) are to facilitate to emergence of a Web and resource-centric interoperable annotation environment that allows leveraging annotations across the boundaries of annotation clients, annotation servers, and content collections, to demonstrate the utility of this environment, and to see widespread adoption of this environment. To this end the OAC has made available the draft annotation data model and ontology developed during Phase I. OAC Phase II focuses on directly engaging humanities scholars and involving existing collections of digital content that have well-defined communities of scholars interested in annotating such content.

Principal Investigator

Tim Cole, Larry Jackson, Allen Renear, Carole Palmer

Funding

$362,00, The Andrew W. Mellon Foundation

Preparing Future Faculty: Enhancing the Doctoral Program

Abstract

This project will enhance the GSLIS doctoral program by building a stronger research community within the school for the study of information in society, including policy, economic, and historical dimensions. Project goals include enhancing the doctoral program curriculum; connecting the research community to the wider world of librarianship; and attracting and supporting thirteen diverse students, especially those from underrepresented groups, with a specific focus on recruiting doctoral students who will teach master's students capable of becoming future leaders in public, academic, and school libraries.

Principal Investigator

Linda C. Smith

Funding

$990,234, Institute of Museum and Library Services

Preserving Virtual Worlds II

Abstract

The original Preserving Virtual Worlds project, funded by the Library of Congress’s National Digital Information Infrastructure and Preservation Program (NDIIP), investigated what preservation issues arose with computer games and interactive fiction, and how existing metadata and packaging standards might be employed for the long-term preservation of these materials. PVW2 will focus on determining properties for a variety of educational games and game franchises in order to provide a set of best practices for preserving the materials through virtualization technologies and migration, as well as provide an analysis of how the preservation process is documented.

Principal Investigator

Jerome McDonough

Funding

$785,898, Institute of Museum and Library Services

Sharing Success: Training Educational Leaders for Youth Services Librarianship

Abstract

Eight scholarships will be offered over three years to outstanding and diverse students admitted to the Certificate of Advanced Studies (CAS) program. This program will provide continuing education by offering outstanding library practitioners the opportunity to continue their education in a topic related to youth services, and by providing institutional support for these students to develop continuing education workshops for others. The two central goals of this project are: 1) leaders in the youth services library profession will provide quality continuing education for their practitioner peers in school and public libraries, and 2) leaders in the youth services library profession will meaningfully contribute to best practices and research in this field.

Co Principal Investigator

Christine Jenkins, Carol Tilley

Funding

$364,925, Institute of Museum and Library Services

Software Environment for the Advancement of Scholarly Research (SEASR)

Abstract

SEASR fosters collaboration by empowering scholars to share data and research in virtual work environments. This eases scholars’ access to digital research materials, which currently are stored in a variety of incompatible formats.

Principal Investigator

John Unsworth

Funding

$359,860, The Andrew W. Mellon Foundation

Structural Analysis of Large Amounts of Music Information

Abstract

Structural analysis of music (formal analysis) is one of the most fundamental analyses performed by music researchers, usually preceding any other types of analysis because it provides the overall view of the piece. Its importance is reflected by the fact that the course on formal analysis is often one of the core music undergraduate music curricula with several major textbooks on the subject. The main goal of formal analysis is to find similar sections within a piece of music and labeling these section such as, ABA and ABCB'A or with further analysis these sections can be marked with predefined labels such as Intro, Verse, Verse, Bridge, Chorus, Verse, and Outro (popular music) or Introduction, Exposition, Development, Recapitulation, and Coda (sonata form). Thus, the formal analysis is useful in classifying different genres of music and it can be used to compare different styles of composition within a composer’s works or between composers. It can also be used to understand historical influences over time and location. By analyzing large sets of music, new discoveries can be made about these questions.

Principal Investigator

J. Stephen Downie

Funding

$99,476, National Science Foundation

Towards Evidence Based Discovery

Abstract

Vast quantities of electronic information provide a unique opportunity for scientists identify candidate solutions for grand challenges as scientists, policy makers, and students have never had access to more electronic information than they do today. The goal in this research is to develop new text mining methods that are consistent with the manual processes that experts currently used to resolve contradictory and redundant evidence. Both discovery and synthesis are difficult activities even for people, so a socio-technical strategy will be required to achieve this goal.

Key outcomes from this study will be:
- A longitudinal study of manual discovery and synthesis behaviors of a diverse network of faculty, policy makers, and students.
- Advances in natural language processing methods that automatically identify concepts and relationships, detect entailment and paraphrasing, and generate multi-document summaries.
- A collection of gold standards that reflect diverse and realistic information needs that will drive further research in natural language processing.
- Increased understanding of the degree to which text mining methods assist in discovery and synthesis activities through a series of qualitative and quantitative user studies.
- A set of "next generation" scientists who are well prepared to explore complex research questions that span disciplines.
- Increased awareness and support for the "human side of discovery" through courses, and workshops.

Website

http://ebd.lis.illinois.edu/

Principal Investigator

Catherine Blake

Funding

$449,317, National Science Foundation

Young People's Voices about Comics in Reading Guidance Literature, 1938-1955

Abstract

Principal Investigator

Carol Tilley