Ideas….

a blog for me to record thoughts and ideas

Browsing Posts tagged TEI

First Generation of Civil War Letters Project

Hamilton encoded administrative documents and letters from the civil war. They used TEI and Dublin Core metadata with their documents (for historical reasons they used Dublin Core).
They then created a PHP interface. The Dublin Core description was used as the “abstract” for the record and the text appears when you view the “full record”. The pages themselves were transformed using XSLT into HTML and then a CSS was used to style the page.

They created drop down boxes for person’s names, organizations, place names, and geographic features.

They also included images of the page. A link was encoded into the TEI to link to the images of the actual documents.  The TEI elements that you might want to use to do this would be a page break; below is an example of code that would do this:

<pb facs=”./page/image/here.jpg” n=”43″>

Second Generation of Civil War Letters Project

These letters were encoded and then loaded into a database called eXist-db. Since the db is already indexed it makes it easier to find data.

The Dublin Core records were replaced by ContentDM. So the records are in ContentDM, his search interface is in PHP, again with drop down boxes pulled from the eXist-db. The dropdown boxes then links to canned searches in ContentDM. The “search” then taxes you to records in ContentDM

If you want to view the full text of the letters, a link in the ContentDM record takes you outside of ContentDM to a webpage. The page it links to is TEI transformed by XSLT into HTML and stylized using CSS. There are links in the TEI that point toward the digital image of the original text.

Below is a link to the current digital collection:

http://elib.hamilton.edu/hc/hcbrowse.php?id=col_spe-civ

TEI Conference: Basics of TEI

No comments

Powerpoints for this session can be found on Brown’s Website.

TEI allows for…..

  • long term storage. its in xml so it should last
  • allows you to analyze information
  • if you want to share your data it needs to be meaningful for others. if you encode in TEI others can figure it out.

TEI is…

  • a standards organization
  • maintains, develops, disseminates the language
  • humanities documents of all kinds (internationally)
  • membership consortium that pays annual fee to support the TEI and activities surrounding TEI
  • social! it allows people to get together to discuss humanities
  • a set of guidelines that covers the grammar and vocabulary of the markup language

A system that you can take and apply in the ways you need to apply your data.

Different types of disciplines use different standards, ie. digital libraries might need different information than linguistics. However they are all valid in the TEI language. Some projects using TEI are listed on Wikipedia. There is a logistical way to normalize the different ways people use TEI. The MONK Project has created conversion processes to make the different types of TEI to work with their TEI.

Another thing to think about is the Resource Description Framework (RDF) developed by the W3C. Information on RDF can be found on the W3C Website.