Word EmbeddingsNatural language models cannot operate directly on words as strings.
Subword TokenizationA language model cannot process raw text directly. Text must first be converted into a sequence of token IDs. The procedure that performs this conversion is called tokenization.
Text ClassificationText classification assigns one or more labels to a piece of text.
Named Entity RecognitionNamed entity recognition, usually abbreviated NER, identifies spans of text that refer to named or typed entities.
Machine TranslationMachine translation converts text from one language into another. Given a source sentence in one language, the model generates a semantically equivalent sentence in a target language.
Question AnsweringQuestion answering, often abbreviated QA, is the task of producing an answer to a question.
Conversational SystemsA conversational system processes dialogue between users and machines.
Language ModelingLanguage modeling is the task of predicting text sequences. A language model assigns probabilities to sequences of tokens and learns the statistical structure of language.