aTCl – Automated Text Categorization/Classification Language1Programming Languages & Translators (COMS W4115)Department of Computer ScienceColumbia UniversityFall 2006Automatic Text Categorization/Classification Language(aTCl)Final ReportJawwad [email protected] 19, 2006aTCl – Automated Text Categorization/Classification LanguageDistributionCopy Number Name, Title Location1 Prof. Stephen A. [email protected] Professor for COMS W4115Document ControlChange RecordDate Author Version Comments09/26/2006 Jawwad Sultan 0.1 Language White Paper10/19/2006 Jawwad Sultan 0.2 aTCl sample program added in the tutorial section. Language Reference Manual added. Reuters 21578 corpus details added in the appendix section.12/19/2006 Jawwad Sultan 1.0 aTCl final report published. Sections like aTCl architecture, tutorial, test plan added to the report.aTCl – Automated Text Categorization/Classification LanguageTable of ContentsDistribution........................................................................................................................................................2Document Control .......................................................................................................................................................2Change Record ................................................................................................................................................2Table of Contents ........................................................................................................................................................31 Introduction ....................................................................................................................................................51.1 Purpose ...............................................................................................................................................51.2 Scope ..................................................................................................................................................51.3 Definitions, Acronyms and Abbreviations............................................................................................51.4 References ..........................................................................................................................................52 Text Categorization ........................................................................................................................................62.1 Introduction..........................................................................................................................................62.1.1 Machine Learning approach:..................................................................................................62.2 Text Classification Life Cycle ..............................................................................................................82.2.1 Parser.....................................................................................................................................82.2.2 Document Pre-Processing .....................................................................................................82.2.3 Document Indexing ................................................................................................................92.2.4 Dimensionality Reduction.......................................................................................................93 aTCl Language................................................................................................................................................93.1 Features of aTCl..................................................................................................................................93.1.1 Simple Types..........................................................................................................................93.1.2 Complex Types ......................................................................................................................93.1.3 Control Statements.................................................................................................................93.1.4 Separators..............................................................................................................................93.2 Properties of aTCl .............................................................................................................................103.2.1 Simple ..................................................................................................................................103.2.2 Portable ................................................................................................................................103.2.3 Robust ..................................................................................................................................104 aTCl Tutorial .................................................................................................................................................114.1 Sample aTCl program .......................................................................................................................115 aTCl Language Reference Manual..............................................................................................................125.1 Lexical Conventions ..........................................................................................................................125.1.1 Comments............................................................................................................................125.1.2 Identifiers..............................................................................................................................125.1.3 Keywords..............................................................................................................................125.1.4 Literals..................................................................................................................................12Integer: ..............................................................................................................................................12Decimal: ............................................................................................................................................12String:
or
We will never post anything without your permission.
Don't have an account? Sign up