Elsevier opens its papers to text-mining

Erick Peirson's picture

A recent article in Nature News raises some interesting questions about copyright in the age of "distant reading." It's worth reading to the end.

Academics: prepare your computers for text-mining. Publishing giant Elsevier says that it has now made it easy for scientists to extract facts and data computationally from its more than 11 million online research papers. Other publishers are likely to follow suit this year, lowering barriers to the computer-based research technique. But some scientists object that even as publishers roll out improved technical infrastructure and allow greater access, they are exerting tight legal controls over the way text-mining is done.

Update: Why this might not be as wonderful as it sounds: Content Mining: Why you and I should NOT sign up for Elsevier’s TDM service

Survey of Digital Humanities Centers in the US

Erick Peirson's picture

A cool find by Ken: the Council on Library and Information Resources published a report in 2008 called "A Survey of Digital Humanities Centers in the United States," by Diane M. Zorich. The full text is available online. This is a great resource for learning about the DH landscape!


In preparation for the 2008 Scholarly Communications Institute (SCI 6) focused on humanities research centers, the Council on Library and Information Resources (CLIR) commissioned a survey of digital humanities centers (DHCs). The immediate goals of the survey were to identify the extent of these centers and to explore their financing, organizational structure, products, services, and sustainability. The longer-term goal was to provide SCI 6 participants with a greater understanding of existing centers to inform their discussions about regional and national centers.

Digital humanities centers, Ms. Zorich writes, are entities "where new media and technologies are used for humanities-based research, teaching, and intellectual engagement and experimentation. The goals of the center are to further humanities scholarship, create new forms of knowledge, and explore technology's impact on humanities-based disciplines."


Week 3, Tuesday 28 January - Tutorial: Regular Expressions


This tutorial explains the basic of regular expressions. Regular expressions are patterns that a piece of software (called "regular expression engine") uses to find these pattern in specified texts. Regular expressions provide a powerful searching mechanism and a way for to automatically manipulate texts.


Subscribe to Introduction to Digital & Computational Methods in the Humanities (HPS) RSS