The news below is from the Internet Archive Blogs: http://blog.archive.org/2013/04/11/450000-early-journal-articles-now-available/
Internet Archive announces today the addition of over 450,000 journal articles from the JSTOR Early Journal Content collection. Early Journal Content is a selection of pre-1923 materials from more than 350 journals and includes articles in the arts and humanities, economics and politics, and mathematics and other sciences. This content was digitized by JSTOR and is freely available through jstor.org, and it can now also be accessed and downloaded via archive.org.
Heidi McGregor from JSTOR said, “We’re happy to work with the Internet Archive to broaden access to the JSTOR Early Journal Content even further, offering people the ability to use it alongside other Internet Archive held collections.”
All 2 terabytes of the Early Journal Collection are available for bulk harvesting from the Internet Archive. Web search engines have been indexing the full-text contents of these materials already and, so far, people and robots have downloaded the articles over 400,000 times even before it has been announced. A data bundle including OCR text and metadata is also available from JSTOR’s Data for Research service for free downloading.