draft Draft Draft! Draft!!! DRAFT!!! D R A F T ! ! !
This is a policy manual outlining the history and proceedures of the Electronic Theses and Dissertations (ETD) Project at the University of Notre Dame. The manual is divided into the following sections:
The goal of the Project is to systematically collect and disseminate electronic theses and dissertations created by graduate students of the University, and it is a direct implementation of the ETD Project initially developed by Virginia Tech. While paper versions of theses and dissertations are still an option for collection, dissemination, and archival purposes, electronic versions of theses and dissertations provide the means for wider accessibility and dissemination. Using a system such as this one, it is possible to implement search mechanisms against the full texts of each item in the collection. Additionally, it is possible to create a nationwide (and worldwide) network of theses and dissertations exposing the content of this very important primaray literature. For more information about these national/worldwide efforts see the home page of the Networked Digital Library Of Theses And Dissertations:
http://www.ndltd.org/
The enhanced accessibility of electronic theses and dissertations comes a cost, namely in the form of policy decisions, additional workflows, computer technology, as well as data archiving and migration proceedures. This manual is the beginings of a document addressing these issues.
In computer terms, the ETD Project is a database application with a Web front-end. The software was written by Virginia Tech using MySQL as the database back-end and Perl as the common gateway interface (CGI) between the end-user and the data -- the documents and their meta data.
The ETD software does not require very much computer horsepower. Just about any Unix/Linux computer will do. Presently our implementation resides on a computer with a 1 GHz Intel-based microprocessor, 512 MB of RAM, 18 GB of hard disk space, a tape drive, all running under RedHat Linux version 7.1. It is beyond the scope of this document to describe how to install these aspects of the system's infrastructure, but the following paragraphs describe how the Virginia Tech software was installed.
The ETD Project used here is an implementation of the software written by Virginia Tech, and it is described at the following URL:http://scholar.lib.vt.edu/ETD-db/developer/
Instructions for installing the system are outlined at the URL above, but the instructions make a number of assumptions about how the system is to be installed and some of the installation proceedures did not seem to work correctly. Consequently, there are number of thing different in our implementation:
After installation and customization, the various aspects of the system are available at the following URLs:
In the future it might be better to create an aliased host named etd.library.nd.edu or etd.nd.edu in order to make the process of remembering the host's name easier.
(Insert here the sorts of things the graduate school will do. It should probably enumerate who is going to do the work, how the work gets done, and what the format of electronic theses and dissertations will take.)
The roles and responsibilities of the University Libraries can be divided into three parts:
Each of these parts are described below.
The Libraries will maintain the computer infrastructure. This means housing the computer hardware, installing the system's software, configuring the scripts, backing up the data, and providing technical support when things break. The hardware and software maintenance will be done by the Digital Access and Information Architecture Department where regular backups are already being done. Problems with the functionality should be sent to the Department as a whole (daiad@dewey.library.nd.edu), or directed to Eric Lease Morgan (631-8604).
On a regular basis a designated individual in the Cataloging Department will check for the existence of new electronic theses and dissertations to be cataloged. When any are identified, the appropriate personnel will be informed of the titles needing cataloging. The item will then be cataloged following current local and national practices for Notre Dame print theses and electronic resources, including the addition of the item's URL in order to provide direct access from the catalog record to the electronic version of the thesis. Once cataloging is complete, the individual responsible for the record will mark it as cataloged in the ETD system in order to add it to the publicly accessible database.
Preserving and migrating the system's data has got to be one of the most important and challenging aspects of this project because it embodies the greatest number of unknown elements; preserving the data making up the system must go beyond mere system backups.
The Digital Access and Information Architecture Department will regularly copy the system's data. This process will include regularly saving to tape or CDROM the underlying database's content into standard query language (SQL) files -- a plain text files format used by all relational database applications. This data contains the meta data describing each item in the collection. This data includes things like title, abstract, keywords, author, and advisors. It also includs the URN -- Universal Resource Number -- used to link the meta data to the directory where the ETD are saved.
Similarly, the entire collection of ETD will be saved to tape or CDROM for preservation purposes. In the early stages of the Project small number of ETDs will be saved to tape or CDROM. As the Project matures and more ETD become available, a greater number of ETDs will be saved to individual tapes or preservation purposes. Each tape or CDROM will include copies of the ETDs, the metadata from underlying database and a plain text file at the root of each tape or CDROM describing the item's contents.
For redundancy purposes, we advocate the University continue to print theses and dissertations. The files of the ETD Project are Portable Document Format (PDF) files, an industry standard articulated by a company named Adobe. File formats change very quickly, and we hope PDF will be a supported format in the future and/or will easily be able to be converted into other formats when they present themselves. The key to being able to do this convertion is having the data now. That is why efforts are being made to archive this imporant literature.
Date created: 2002-06-10
Date updated: 2002-06-10
URL: http://www.library.nd.edu/daiad/etd-policy/
All libraries:
Architecture | Art
Image | Business Information Center
| Chemistry & Physics
| Engineering | Hesburgh
(Main)
Kellogg/Kroc Information Center |
Life Sciences | Mathematics
| Rare
Books & Special Collections | Radiation
Lab | Kresge Law