Go to Laboratory Home Go to Laboratory Home PageGo to Laboratory PhoneGo to Laboratory Search
Abstract

We describe an automated script identification system for typeset document images. Templates for each script are created by clustering textual symbols from a training set. Symbols from new images are compared to the templates to find the best script. Our current system processes thirteen scripts with minimal preprocessing and high accuracy.

J. Hochberg, P. Kelly, T. Thomas, and L. Kerns. Automatic Script Identification From Document Images Using Cluster-Based Templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 19, Number 2, pp. 176-181, February 1997. Los Alamos National Laboratory Technical Report LA-UR-95-2598.   [   Abstract   ]