Coming August 11, 2009

De-Babelizing (Digital) Archives

Here at the Hoover Institution Archives,  I work closely with culturally rich materials and serve both a local and international community of researchers.  I would love to engage in dialogue about the representation of linguistically diverse historical content in the digital environment.   What tools have been developed and adopted by the global digital archives community?  Which institutions produce bilingual finding aids?  I’d like to explore the use of intelligent character recognition systems in the transcription of digital manuscript and the search/display of CJK, Cyrillic and Khmer text.   (For example, Taiwan’s National Digital Archives Program & the Digital Archive Architecture Lab have the 缺字系統 or “Missing Unencoded Chinese Characters API.”)

Other prospective al dente discussion topics that I’d like to toss around:

  • Mobile devices and handheld technologies in archives (i.e., iPhone apps, QR codes, etc.)
  • Using DRUPAL as a digital asset management system and content discovery tool
  • Hyper localization as grassroots outreach to community-based archives
  • Timeline tools

Please forgive my fragmented thoughts — I seem to have more questions than answers.  Look forward to meeting fellow campers!

Comments RSS TrackBack 2 comments

Janet Carleton

in August 7th, 2009 @ 12:39

These all sound great to me. Mobile devices and timeline tools, especially. Also my institution has a large SE Asia collection that could benefit from the CJK work.


in August 11th, 2009 @ 01:00

I’d love to talk some more about your experience managing multilingual content and the challenges you’ve faced. I’m very interested in crowdsourcing translation and transcription across multilingual content, and it sounds like you’ve got a wealth of experience here. So many interesting people and topics!