utilsΒΆ

Functions

clean_text

This function cleans the text in the following ways: 1.

download_tokenizer_files_from_azure

Download all required files for tokenizer from Azure storage.

download_url_file

Helpder method to download url file.

encode_dataset

get_attention_masks

load_datasets

load_preprocessed_dataset

load_transform_dataset

data_path

pad_batched_sequences

pad_sequence

pad_structure