This commit is contained in:
caandt
2025-03-13 16:56:36 -05:00
parent b2439eee3e
commit 046e80cdd1
27 changed files with 0 additions and 0 deletions
+18
View File
@@ -0,0 +1,18 @@
# seq2seq
- train_tokenizer_auto.py:
- trains the manual tokenizer
- tokenize_seq2seq.py:
- tokenize the dataset for the seq2seq model
- train_seq2seq.py:
- finetuning the pretrained model
- will create a sequence-to-sequence translation model
- StatementConfiguration.py
- defines the JSON format for statement translation training
# manual1
Contains JSONs mapping bytecode instructions and their configurations to use in training.