TensorRT  7.2.1.6
NVIDIA TensorRT
Looking for a C++ dev who knows TensorRT?
I'm looking for work. Hire me!
All Classes Namespaces Functions Variables Typedefs Enumerations Enumerator Friends Pages
text.cleaners Namespace Reference

Functions

def expand_abbreviations (text)
 
def expand_numbers (text)
 
def lowercase (text)
 
def collapse_whitespace (text)
 
def convert_to_ascii (text)
 
def basic_cleaners (text)
 
def transliteration_cleaners (text)
 
def english_cleaners (text)
 

Variables

 _whitespace_re = re.compile(r'\s+')
 
list _abbreviations
 

Function Documentation

◆ expand_abbreviations()

def text.cleaners.expand_abbreviations (   text)
Here is the caller graph for this function:

◆ expand_numbers()

def text.cleaners.expand_numbers (   text)
Here is the caller graph for this function:

◆ lowercase()

def text.cleaners.lowercase (   text)
Here is the caller graph for this function:

◆ collapse_whitespace()

def text.cleaners.collapse_whitespace (   text)
Here is the caller graph for this function:

◆ convert_to_ascii()

def text.cleaners.convert_to_ascii (   text)
Here is the caller graph for this function:

◆ basic_cleaners()

def text.cleaners.basic_cleaners (   text)
Basic pipeline that lowercases and collapses whitespace without transliteration.
Here is the call graph for this function:

◆ transliteration_cleaners()

def text.cleaners.transliteration_cleaners (   text)
Pipeline for non-English text that transliterates to ASCII.
Here is the call graph for this function:

◆ english_cleaners()

def text.cleaners.english_cleaners (   text)
Pipeline for English text, including number and abbreviation expansion.
Here is the call graph for this function:

Variable Documentation

◆ _whitespace_re

text.cleaners._whitespace_re = re.compile(r'\s+')
private

◆ _abbreviations

list text.cleaners._abbreviations
private
Initial value:
1 = [(re.compile('\\b%s\\.' % x[0], re.IGNORECASE), x[1]) for x in [
2  ('mrs', 'misess'),
3  ('mr', 'mister'),
4  ('dr', 'doctor'),
5  ('st', 'saint'),
6  ('co', 'company'),
7  ('jr', 'junior'),
8  ('maj', 'major'),
9  ('gen', 'general'),
10  ('drs', 'doctors'),
11  ('rev', 'reverend'),
12  ('lt', 'lieutenant'),
13  ('hon', 'honorable'),
14  ('sgt', 'sergeant'),
15  ('capt', 'captain'),
16  ('esq', 'esquire'),
17  ('ltd', 'limited'),
18  ('col', 'colonel'),
19  ('ft', 'fort'),
20 ]]