|
|
|
||
Last update: Pátková Vlasta (08.06.2018)
|
|
||
Last update: Pátková Vlasta (08.06.2018)
Students will know:
|
|
||
Last update: Pátková Vlasta (08.06.2018)
R: Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Second edition, Addison-Wesley, 2011. R: Weiss, S.M. et all: Text Mining? Predictive Methods for Analyzing Unstructured Information. Springer, 2005. |
|
||
Last update: Pátková Vlasta (08.06.2018)
Lecturer materials |
|
||
Last update: Pátková Vlasta (08.06.2018)
Introduction to information retrieval, uncertainty, relevance, text document normalization, Zipf's law Text documents indexing, querying and searching - metrics, vector model - dimensionality reduction, latent semantic indexing Document and keyword clustering, distance, similarity metrics, centroid, clustering algorithms Document classification, Bayesian classification, k nearest neighbors, decision trees, metoda support vector machines The aims and capabilities of text mining, linguistic methods in text mining, tokenization, part-of-speech tagging, named entity recognition, parsing, coreferences Text mining in information retrieval: document content extraction, automatic document summarization, automatic question answering |
|
||
Last update: Pátková Vlasta (08.06.2018)
oral exam |