View source on GitHub |
Wraps proto.SplitInfo
with an additional property.
tfds.core.SplitInfo(
name: str,
shard_lengths: List[int],
num_bytes: int,
filename_template: Optional[naming.ShardedFileTemplate] = None,
statistics: statistics_pb2.DatasetFeatureStatistics = dataclasses.field(default_factory=statistics_pb2.DatasetFeatureStatistics)
)
Attributes | |
---|---|
name
|
Name of the split (e.g. train , test ,...)
|
shard_lengths
|
List of length |
filename_template
|
The template used to create sharded filenames. |
num_examples
|
Total number of examples (sum(shard_lengths) )
|
num_shards
|
Number of files (len(shard_lengths) )
|
num_bytes
|
Size of the files (in bytes) |
statistics
|
Additional statistics of the split. |
file_instructions
|
Returns the list of dict(filename, take, skip).
This allows for creating your own
When |
filenames
|
Returns the list of filenames. |
filepaths
|
All the paths for all the files that are part of this split. |
Methods
from_proto
@classmethod
from_proto( proto: proto_lib.SplitInfo, filename_template: naming.ShardedFileTemplate ) -> 'SplitInfo'
Returns a SplitInfo class instance from a SplitInfo proto.
replace
replace(
**kwargs
) -> 'SplitInfo'
Returns a copy of the SplitInfo
with updated attributes.
to_proto
to_proto() -> proto_lib.SplitInfo
Class Variables | |
---|---|
filename_template |
None
|