User:Kku/Books/Textmining

An information-theoretic view

 * Introduction
 * Language
 * Natural language
 * Historical linguistics
 * Linguistics
 * Language model
 * Morphology (linguistics)
 * Meaning (linguistics)
 * Parsing
 * Markov model
 * Text mining
 * Text categorization
 * Terminology extraction
 * Knowledge extraction
 * Information extraction
 * Relationship extraction
 * Concept mining
 * Concept search
 * Information retrieval
 * Ranking (information retrieval)
 * Document retrieval
 * Document clustering
 * Machine learning
 * Hidden Markov model
 * Machine translation
 * Compound term processing
 * Document classification
 * Alphabet (computer science)
 * Bag-of-words model
 * Combinatorics on words
 * co-occurence
 * Predictive classification
 * Text linguistics
 * Speech recognition
 * Sentiment analysis
 * Natural language processing
 * Named-entity recognition
 * Latent Dirichlet allocation


 * Sources
 * Text corpus
 * Corpus linguistics
 * Lexicon
 * Thesaurus


 * Grammar
 * Grammar
 * Formal grammar
 * Context-free grammar
 * Stochastic context-free grammar
 * Synchronous context-free grammar
 * Chomsky hierarchy


 * Parts of Speech
 * Formal language
 * Part of speech
 * Part-of-speech tagging
 * Phrase chunking
 * shallow parsing
 * Phoneme
 * Morpheme
 * Null morpheme
 * Lexeme
 * Adjective
 * Verb
 * Clause
 * Predicate (grammar)
 * Sentence (linguistics)
 * Sentence diagram
 * Noun phrase
 * Subject (grammar)
 * Conjunction (grammar)
 * Object (grammar)
 * Phrase structure grammar
 * Dependency grammar
 * Argument (linguistics)
 * Finite verb
 * Agreement (linguistics)
 * Grammatical gender
 * Preposition and postposition
 * Adverb
 * Determiner
 * Article (grammar)
 * Grammatical case
 * Grammatical number
 * Pronoun
 * Copula (linguistics)
 * Affix
 * Suffix
 * Marker (linguistics)
 * Grammatical person
 * Inflection
 * Grammatical conjugation
 * Syntax
 * Pragmatics
 * Semantics
 * Noun
 * Dependent clause
 * Classifier (linguistics)
 * Noun class
 * Anaphora
 * cataphora
 * endophora
 * Pro-form:Quantification
 * Indefinite pronoun
 * Possessive determiner
 * Demonstrative
 * Predicative expression
 * Subject complement
 * Phrase
 * antecedent (grammar)
 * Determiner phrase
 * Bag-of-words model
 * Word


 * Statistical text processing
 * Topic model
 * Stochastic grammar
 * Collocation
 * Collostructional analysis
 * Dynamic topic model
 * F1 score
 * Factored language model
 * Glottochronology
 * Lexicostatistics
 * Lexical analysis
 * N-gram
 * Trigram tagger
 * Dissociated press
 * K-mer
 * W-shingling
 * Vector space model
 * Jaccard index
 * Katz's back-off model
 * Markovian discrimination
 * Noisy channel model
 * Noisy text analytics
 * Semantic analysis
 * Coherence (linguistics)
 * Cohesion (linguistics)
 * Probabilistic latent semantic analysis
 * Sinkov statistic
 * Statistical machine translation
 * Statistical parsing
 * Tf–idf
 * Mean reciprocal rank
 * Topic model
 * Latent semantic indexing
 * Variable rules analysis
 * Computational linguistics


 * Software tools
 * Natural Language Toolkit
 * General Architecture for Text Engineering
 * OpenNLP


 * Text similarity
 * Text clustering
 * Semantic similarity
 * Cosine similarity
 * String metric
 * Approximate string matching
 * MinHash
 * Bloom filter
 * Edit distance
 * Levenshtein distance
 * String searching algorithm
 * News analytics
 * Sequential Pattern Mining
 * Name resolution
 * Stop words
 * Semantic translation
 * Faceted search
 * Generative grammar
 * Biomedical text mining
 * Medical literature retrieval
 * Coreference
 * Nearest referent
 * Automatic summarization
 * Multi-document summarization
 * Semantic role labeling
 * Lexical semantics
 * Thematic relation
 * Morphosyntactic alignment
 * Differential object marking
 * Chomsky normal form
 * Question answering