Research Guides: Web Archiving: Web Archiving at VT

Virginia Tech Web Archive

The Virginia Tech Web Archive is located at https://www.archive-it.org/collections/5315.

The Virginia Tech Web Archive (VTWA) is crawled using Archive-It, created by the Internet Archive.
The VTWA was created in February 2015.
Crawls are managed by the Digital Preservation Coordinator.
Crawls are performed biannually and as needed.
The scope covers the Virginia Tech domain (vt.edu).
The scope of the crawl searches for hyperlinks four levels from the original seed.
The web archive is publicly available.
Requests for adding a seed to the Web Archive can be directed to the Digital Preservation Coordinator.
WARC files are also ingested into VTechWorks as needed, such as for Creative Technologies theses.

There are some websites that the University Libraries at Virginia Tech (VTUL) will only capture once for research, grant requirements, or other purposes that do not require long-term capture or for resources that are now static. The Farm Girl at Large collection (https://archive-it.org/collections/6147) is one of these examples maintained by VTUL that is public but is not active.

Archive-It

The VTWA utilizes Archive-It, a subscription web archiving service developed by the Internet Archive. The VTWA is available to the public and can be browsed and searched by keyword. Page text of the captured website can also be searched. Clicking into a seed URL will lead you to many capture versions that can be explored like a typical web page.

We are currently working to create additional metadata per the Web Archiving Metadata schema developed by OCLC. The goal of is to increase searchability and discoverability. More information will be recorded here as updates are implemented.

Web Archiving

Virginia Tech Web Archive

The Virginia Tech Web Archive is located at https://www.archive-it.org/collections/5315.

Archive-It

LIBRARIES

FOLLOW US

CONTACT