Package ai.djl.basicdataset.nlp
Class TatoebaEnglishFrenchDataset
java.lang.Object
ai.djl.training.dataset.RandomAccessDataset
ai.djl.basicdataset.nlp.TextDataset
ai.djl.basicdataset.nlp.TatoebaEnglishFrenchDataset
- All Implemented Interfaces:
ai.djl.training.dataset.Dataset
TatoebaEnglishFrenchDataset is a English-French machine translation dataset from The
Tatoeba Project (http://www.manythings.org/anki/).-
Nested Class Summary
Nested ClassesNested classes/interfaces inherited from class ai.djl.basicdataset.nlp.TextDataset
TextDataset.SampleNested classes/interfaces inherited from class ai.djl.training.dataset.RandomAccessDataset
ai.djl.training.dataset.RandomAccessDataset.BaseBuilder<T extends ai.djl.training.dataset.RandomAccessDataset.BaseBuilder<T>>Nested classes/interfaces inherited from interface ai.djl.training.dataset.Dataset
ai.djl.training.dataset.Dataset.Usage -
Field Summary
Fields inherited from class ai.djl.basicdataset.nlp.TextDataset
manager, mrl, prepared, samples, sourceTextData, targetTextData, usageFields inherited from class ai.djl.training.dataset.RandomAccessDataset
dataBatchifier, device, labelBatchifier, limit, pipeline, prefetchNumber, sampler, targetPipeline -
Constructor Summary
ConstructorsModifierConstructorDescriptionprotectedCreates a new instance ofTatoebaEnglishFrenchDataset. -
Method Summary
Modifier and TypeMethodDescriptionprotected longbuilder()Creates a new builder to build aTatoebaEnglishFrenchDataset.ai.djl.training.dataset.Recordget(ai.djl.ndarray.NDManager manager, long index) voidprepare(ai.djl.util.Progress progress) Methods inherited from class ai.djl.basicdataset.nlp.TextDataset
getProcessedText, getRawText, getSamples, getTextEmbedding, getVocabulary, preprocessMethods inherited from class ai.djl.training.dataset.RandomAccessDataset
getData, getData, getData, getData, newSubDataset, newSubDataset, randomSplit, size, subDataset, subDataset, subDataset, subDataset, toArrayMethods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface ai.djl.training.dataset.Dataset
matchingTranslatorOptions, prepare
-
Constructor Details
-
TatoebaEnglishFrenchDataset
Creates a new instance ofTatoebaEnglishFrenchDataset.- Parameters:
builder- the builder object to build from
-
-
Method Details
-
builder
Creates a new builder to build aTatoebaEnglishFrenchDataset.- Returns:
- a new builder
-
prepare
public void prepare(ai.djl.util.Progress progress) throws IOException, ai.djl.modality.nlp.embedding.EmbeddingException - Throws:
IOExceptionai.djl.modality.nlp.embedding.EmbeddingException
-
get
public ai.djl.training.dataset.Record get(ai.djl.ndarray.NDManager manager, long index) - Specified by:
getin classai.djl.training.dataset.RandomAccessDataset
-
availableSize
protected long availableSize()- Specified by:
availableSizein classai.djl.training.dataset.RandomAccessDataset
-