RecconEmotionEntailmentTokenizer¶
-
class
RecconEmotionEntailmentTokenizer
(vocab_file: str, merges_file: str, do_lower_case: bool = False, **kwargs)[source]¶ Constructs a Reccon Emotion Entailment tokenizer, derived from the RoBERTa tokenizer, using byte-level Byte-Pair-Encoding.
- Parameters
vocab_file (
str
) – Path to the vocabulary file.merges_file (
str
) – Path to the merges file.do_lower_case (
bool
, defaults toFalse
) – Whether or not to lowercase the input when tokenizing.
Example:
from sg_nlp import RecconEmotionEntailmentTokenizer tokenizer = RecconEmotionEntailmentTokenizer.from_pretrained("roberta-base") text = "surprise <SEP> Me ? You're the one who pulled out in front of me ! <SEP> Why don't you watch where you're going ? <SEP> Why don't you watch where you're going ? Me ? You're the one who pulled out in front of me !" inputs = tokenizer(text, return_tensors="pt")