Discord AI Chatter banner

Discord AI Chatter

3 devlogs
13h 42m 49s

1GB, 22M param model trained (from scratch) on custom data! Has limited RAG capabilities, helping improve memory (theoretically).

This project uses AI

GitHub Copilot helped with code completion and a lot of explaining and debugging.

Demo Repository

Loading README...

Wasn't Me
  • added a feature where the llm generates 2 responses and has the user pick their favourite response
  • currently data is being stored but may potentially be used to train a “teacher” model which will allow for unsupervised learning

Probably gonna end the project here because of size restrictions. I might write a custom NN library from scratch and run it on a beefier device next time!

Attachment
0
Wasn't Me

Changes/Updates:

  • Updated LLM architecture (expanded capacity)
  • found and parsed new training data
  • found new testing data
  • increased training speed (active cooler on the pi!)

Notes:

  • had to retrain due to an architecture change
  • it might be a bit schizophrenic (sentient perhaps)
Attachment
0
Wasn't Me

Created the basic transformer architecture, added some basic Discord commands allowing me to interact and monitor with the bot. Currently training the bot on a Pi locally. (It is not very good at writing yet)

Attachment
0