Training and Tokenizing data on kaggle. ( connected remote session of kaggle to Vscode for time tracking )
There were some problems for example:
- Whole deduped dataset was loaded at once which caused the GPU to be out of memory and i recevied Cuda error so i am planning to train with partitions
- Kaggle has a limit of 12 hours and after the session ends the whole data is wiped out so i had to figure out to save it temporarily and build the training on checkpoints rather doing it at once. ( used Hugginface and kaggle persistance).
- I also had exams during this time
Log in to leave a comment