Creates ngrams from ragged string data.
tf.raw_ops.StringNGrams(
    data, data_splits, separator, ngram_widths, left_pad, right_pad, pad_width,
    preserve_short_sequences, name=None
)
This op accepts a ragged tensor with 1 ragged dimension containing only strings and outputs a ragged tensor with 1 ragged dimension containing ngrams of that string, joined along the innermost axis.
Args | |
|---|---|
data
 | 
A Tensor of type string.
The values tensor of the ragged string tensor to make ngrams out of. Must be a
1D string tensor.
 | 
data_splits
 | 
A Tensor. Must be one of the following types: int32, int64.
The splits tensor of the ragged string tensor to make ngrams out of.
 | 
separator
 | 
A string.
The string to append between elements of the token. Use "" for no separator.
 | 
ngram_widths
 | 
A list of ints. The sizes of the ngrams to create.
 | 
left_pad
 | 
A string.
The string to use to pad the left side of the ngram sequence. Only used if
pad_width != 0.
 | 
right_pad
 | 
A string.
The string to use to pad the right side of the ngram sequence. Only used if
pad_width != 0.
 | 
pad_width
 | 
An int.
The number of padding elements to add to each side of each
sequence. Note that padding will never be greater than 'ngram_widths'-1
regardless of this value. If pad_width=-1, then add max(ngram_widths)-1
elements.
 | 
preserve_short_sequences
 | 
A bool.
 | 
name
 | 
A name for the operation (optional). | 
Returns | |
|---|---|
A tuple of Tensor objects (ngrams, ngrams_splits).
 | 
|
ngrams
 | 
A Tensor of type string.
 | 
ngrams_splits
 | 
A Tensor. Has the same type as data_splits.
 |