Dalibor Fiala, Ph.D.

Roman Tesař

Phone: +420 377632479
E-mail: roman.tesar@gmail.com
WWW: http://www.sweb.cz/romant1/CV.pdf

Roman graduated at the University of West Bohemia in 2003, specialized in software engineering. Currently, he is a PhD student focused on internet document filtering, web mining, text classification and generally information retrieval. He also examines the possible utilization and inluence of word n-grams on the areas mentioned above.

Publications:

Sort by:

Year |

Title |

Citations

Extracting Information from Web Content and Structure
Authors:	Dalibor Fiala, Roman Tesař, Karel Ježek, François Rousselot
Source:	Proc. 9th Int. Conf. on Information Systems Implementation and Modelling ISIM’06, Přerov, Czech Republic, pp. 133-140, 2006. (ISBN 80-86840-19-0)
Download:	Full text [213 kB]
	citations: ...

A comparison of two algorithms for discovering repeated word sequences
Authors:	Roman Tesař, Dalibor Fiala, François Rousselot, Karel Ježek
Source:	The 6th International Conference on Data Mining, Text Mining and their Business Applications (Data Mining 2005), Skiathos, Greece, pp. 121-131. (ISBN 1-84564-017-9)
Download:	Full text [245 kB]
	citations: ...
View record in Web of Science®

Projects:

Document Classification
Authors:	Jiří Hynek, Karel Ježek, Michal Toman, Roman Tesař, Zdeněk Češka, Petr Grolmus
Desc.:	Use of inductive machine learning methods in classification of short text documents.

Extracting Information from Web Content and Structure
Authors:	Dalibor Fiala, Roman Tesař, Karel Ježek
Desc.:	This project deals with classification of Web documents and determination of authoritative Web sites. It was supported in part by the Ministry of Education of the Czech Republic under grant FRVS 1347/2005/G1.

Internet Content Filtering
Authors:	Roman Tesař, Karel Ježek
Desc.:	This project includes Web sites processing, analyzing, classification by means of their content and searching for other Web sites with similar content.

Dalibor Fiala, Ph.D.

University of West Bohemia

Roman Tesař

Publications:

Extracting Information from Web Content and Structure

A comparison of two algorithms for discovering repeated word sequences

Projects:

Document Classification

Extracting Information from Web Content and Structure

Internet Content Filtering