DOC PREVIEW
Stanford CS 224 - Automatic Name Transliteration via OCR and NLP

This preview shows page 1-2 out of 6 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 6 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 6 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 6 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Slide 1Slide 2Slide 3Slide 4Slide 5Slide 6Automatic Name Transliteration via OCR and NLP Yu CaoTao WangIntegrationOptical Character Recognition (OCR)•ICDAR 2011 dataset •character embedded in natural scene •histogram of oriented gradients (HOG)•8x8 window sliding across at step of 2 •linear kernel SVM•52 classes, i.e. capital and small letters •overall character-level accuracy 74%Bayesian Correction•Char-level bigram language model•Char-level accuracy improved to 75.3%Named Entity Recognition (NER)•essentially two types of labels, “PERSON” and “NONPERSON” •MUC 7 corpora•maximum entropy Markov model •set of features: “CUR_WORD”, “PREV_ LABEL”, “MID_INITIAL”, “IN_DICT”, “IN_NAME DATABASE”, “NEXT_WORD” •F1 score of 77.5% (Precision 76.9% & Recall 78.1%)Transliteration•character-level translation model•training data: 4,256 English – Chinese name pairs obtained online•trigram Chinese language model•alignment model IBM model 1,3,4•human evaluation•120 English names obtained by NER for testing •acceptance score 100 ± 2 /120F r a n c i s c o弗 弗 弗 弗


View Full Document

Stanford CS 224 - Automatic Name Transliteration via OCR and NLP

Documents in this Course
Load more
Download Automatic Name Transliteration via OCR and NLP
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Automatic Name Transliteration via OCR and NLP and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Automatic Name Transliteration via OCR and NLP 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?