Files
pylingual/model_training/statement/README.md
T
2025-03-07 16:44:23 -06:00

449 B

seq2seq

  • train_tokenizer_auto.py:

    • trains the manual tokenizer
  • tokenize_seq2seq.py:

    • tokenize the dataset for the seq2seq model
  • train_seq2seq.py:

    • finetuning the pretrained model
    • will create a sequence-to-sequence translation model
  • StatementConfiguration.py

    • defines the JSON format for statement translation training

manual1

Contains JSONs mapping bytecode instructions and their configurations to use in training.