Class TatoebaEnglishFrenchDataset

java.lang.Object
ai.djl.training.dataset.RandomAccessDataset
ai.djl.basicdataset.nlp.TextDataset
ai.djl.basicdataset.nlp.TatoebaEnglishFrenchDataset
All Implemented Interfaces:
ai.djl.training.dataset.Dataset

public class TatoebaEnglishFrenchDataset extends TextDataset
TatoebaEnglishFrenchDataset is a English-French machine translation dataset from The Tatoeba Project (http://www.manythings.org/anki/).
  • Constructor Details

    • TatoebaEnglishFrenchDataset

      protected TatoebaEnglishFrenchDataset(TatoebaEnglishFrenchDataset.Builder builder)
      Creates a new instance of TatoebaEnglishFrenchDataset.
      Parameters:
      builder - the builder object to build from
  • Method Details

    • builder

      public static TatoebaEnglishFrenchDataset.Builder builder()
      Creates a new builder to build a TatoebaEnglishFrenchDataset.
      Returns:
      a new builder
    • prepare

      public void prepare(ai.djl.util.Progress progress) throws IOException, ai.djl.modality.nlp.embedding.EmbeddingException
      Throws:
      IOException
      ai.djl.modality.nlp.embedding.EmbeddingException
    • get

      public ai.djl.training.dataset.Record get(ai.djl.ndarray.NDManager manager, long index)
      Specified by:
      get in class ai.djl.training.dataset.RandomAccessDataset
    • availableSize

      protected long availableSize()
      Specified by:
      availableSize in class ai.djl.training.dataset.RandomAccessDataset