Popis tématu

Oborový projekt v programu, specializaci Ostatní / Nespecifikováno.

Pretrained DL model for multilingual PreProcessing

The Deep Learning model should be able to be pre-trained on a sufficient dataset for low-resource languages such as Czech, Polish, or others. The main aim is to make the model self-sufficient to perform preprocessing operations on any given data. These operations might include stopword removal, stemming, POS tagging, etc. The overall programming stack will be based on Python, supposedly PyTorch.

Téma vypsal: Noman Tahir (UN 326)

Vypsáno pro akademický rok 2025/2026 dne: 2025-04-28

Rezervace tématu

Toto téma je zatím volné. Pokud o téma máte vážný zájem, vyplňte prosím následující formulář, kterým si téma zamluvíte (všechny položky jsou povinné).

Detail tématu

Popis tématu

Pretrained DL model for multilingual PreProcessing

Rezervace tématu