Staff in these locations can work with you and a subject specialist to identify potential resources.
Text and data mining (TDM) are research techniques that use computational tools to identify and extract relevant information or patterns from large data sets or from text-based digital content.
As the use of TDM for research gains popularity, a number of challenges are presented. There are legal, ethical and logistical issues that researchers must consider when selecting sources of text and/or data for analysis. This guide was developed to help Yale researchers identify resources in our collections that may be available to use for TDM projects. It also includes sources that are freely available online.
Text and data mining is highly customized work, with varying timelines from start to conclusion. To carry out a successful project, you will need both access to data and the skills to interact with that data. What these skills entail depends on the data and what you want to do with it.
When starting a project, you need to consider:
Appropriate use of licensed resources:
Most of the library's electronic resources are governed by license agreements that limit use to the Yale community or to individuals who are physically present at Yale University Library facilities.