Government documents from microfiche are coming to archive.org based on the combined efforts of the Internet Archive, Stanford University Libraries, and other library partners.
The resulting files will be available for free public access to enable new analysis and access techniques.
Microfiche cards, which contain miniaturized thumbnails of the publication’s pages, are starting to be digitized and matched to catalog records by the Internet Archive.
Once in a digital format and preserved on archive.org, these documents will be searchable and downloadable by anyone with an Internet connection, since U.S. government publications are in the public domain.
From web archives to television news to digitized books & periodicals, dozens of projects rely on the collections available at archive.org for computational & bibliographic research across a large digital corpus. This series will feature six sessions highlighting the innovative scholars that are using Internet Archive collections, services and APIs to support data-driven projects in the humanities and beyond.
Want to participate? Register below! Do you have a research project that uses materials from the Internet Archive? We’re offering a Lightning Talks session at the end of our series to give more people an opportunity to share your research with the world. Simply complete our online form to be considered.
Many thanks to the program advisory group:
Dan Cohen, Vice Provost for Information Collaboration and Dean, University Library and Professor of History, Northeastern University
Makiba Foster, Library Regional Manager for the African American Research Library and Cultural Center, Broward County Library
Mike Furlough, Executive Director, HathiTrust
Harriett Green, Associate University Librarian for Digital Scholarship and Technology Services, Washington University Libraries
Kahle, Founder and Digital Librarian of the Internet Archive. Member, National Academy of Engineering, American Academy of Arts and Sciences, and the Internet Hall of Fame
When I started the Internet Archive 25 years ago, I focused our non-profit library on digital collections: preserving web pages, archiving television news, and digitizing books. The Internet Archive was seen as innovative and unusual.
Now all libraries are increasingly electronic, and necessarily so. To fight disinformation, to serve readers during the pandemic, and to be relevant to 21st-century learners, libraries must become digital.
But just as the Web increased people’s access to information exponentially, an opposite trend has evolved. Global media corporations—emboldened by the expansive copyright laws they helped craft and the emerging technology that reaches right into our reading devices—are exerting absolute control over digital information.
These two conflicting forces—towards unfettered availability and completely walled access to information—have defined the last 25 years of the Internet. How we handle this ongoing clash will define our civic discourse in the next 25 years.
If we fail to forge the right path, publishers’ business models could eliminate one of the great tools for democratizing society: our independent libraries.
The following is a guest post by Nicholas Taylor, Information Technology Specialist for the Repository Development Group at the Library of Congress.
Prompted by questions from Library of Congress staff on how to more effectively use web archives to answer research questions, I recently gave a presentation on “Using Wayback Machine for Research” (PDF).
I thought that readers of The Signal might be interested in this topic as well. This post covers the outline of the presentation.
The Wayback Machine that many people are familiar with is the Internet Archive Wayback Machine. The Internet Archive is an NDIIPP partner and a Founding Member of the International Internet Preservation Consortium.
Their mission includes creating an archive of the entire public web; the Wayback Machine is the interface for accessing it. While the Internet Archive has been primarily responsible for the development of Wayback Machine, it is an open source project.
Internet Archive also devised the name “Wayback Machine;” it is a reference to The Rocky & Bullwinkle Show’s homophonous “WABAC” Machine, a time machine itself named in the convention of mid-century mainframe computers (e.g., ENIAC, UNIVAC, MANIAC, etc.). The contemporary Wayback Machine thus appropriately evokes both the idea of traveling back in time and powerful computing technology (necessary for web archiving).