wandering trader

mattseq worked on wandering trader

about 2 months ago

1h 34m logged

not sure why this shows so much time honestly. probably just me training the models and testing with different stocks. anyway, i tried shipping but got rejected bc the terminal output was little misleading and the random forest model took a stupidly long time. the reviewer also wanted gradient boosting EXEs but after DMing them and explaining that it was such a small part of my project which i barely spent any time on, they said it was fine. I still tried, but the xgboost python library was giving me a lot of trouble so i gave up. anyway, i made the RF model smaller so it takes around 5 mins. also somehow, the commit automatically listed github copilot as a co-author. i literally only used it as autocomplete for the single print line i added. cant seem to get rid of it so i just disabled copilot. still shameful just to see that on github tho

0

Log in to leave a comment

mattseq shipped wandering trader

about 2 months ago

Shipped this project!

Hours: 32.72

Cookies: 🍪 1040

Multiplier: 26.49 cookies/hr

I built a machine learning model for stock prediction! It uses an LSTM model and trains with data from Yahoo Finance. It’s also tested with past (but unseen) data as if it were actually deployed. I’m still not quite happy with it, because it can still be quite inconsistent but it’s noticeably improved. I plan to work on it again in the future and try to include sentiment analysis and insider trading data, which could give it enough data to be truly worth using. I also tried several different models like random forest and gradient boosting, which are also included in the repo.

mattseq worked on wandering trader

3 months ago

2h 26m logged

forgot to write this devlog for a few weeks or smth

Prep for Release

add README and add images/graphs to them
remove nn.py
cleanup LSTM + RF code
softcode timeframes and ask for user input
prevent any data leakage through early stopping or scheduler
build executables with pyinstaller and put on GitHub Release: https://github.com/mattseq/wandering-trader/releases/tag/v1.0.0

I did all of this before it was announced that FT was extended, so I’m probably going to improve on the model instead of shipping it as is.

0

Log in to leave a comment

mattseq worked on wandering trader

3 months ago

3h 5m logged

i was pretty much at a wall. nothing i changed seemed to work. i tried using claude to generate a few different programs using binary classification or probability based ones, multi-horizon predictions, etc. none of them worked very well and i had no idea what claude was doing. i went back to my model and just changed some of the hyperparameters like the interval, sequence length, and hidden size. i also added a “smart” strategy which isnt really smart, it just buys only if the model signal is the same as the naive signal. might mess around with combining the model signal with some of these other signals later. but right now i need to get this shippable. here’s graphs of TSLA and WDC

0

Log in to leave a comment

mattseq worked on wandering trader

3 months ago

0h 51m logged

ok after a ton of messing around, i cant find a way to make it consistently better than just buying and holding. my theory is that it would work way better with daytrading rather than something this long-term. the shorter the timeframe the more the price is ruled by patterns rather than sentiment. the other path is parsing sentiment data from alpha vantage and giving that to the model, which i think could totally boost its performance. unfortunately alpha vantage intraday data is not free and the sentiment data is rate limited to 25 requests per day. either i find another source or i end the project here.

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

4h 49m logged

ok so i kept working on the lstm. i really dont want to give up or start over. i added a cool attention mechanism after the lstm output. not sure how much it helped but i also graphed how much each sequence timestep is valued using that which was cool to see. i also printed some more stuff like sharpe and directional accuracy. the model didnt seem to be picking up anything valuable but it was doing really well on both training and test sets so i made the model smaller to reduce overfitting and made a directional mse loss (not sure if that already exists with pytorch) so that it punishes it more for predicting the wrong signal (which is whats most important). its doing much better, at least for IBM and BA stocks but kind of breaks down for others.

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

5h 9m logged

i dont even know what im doing anymore. i am completely and utterly lost. just about every lstm model demo i’ve seen online doesnt actually try to trade stocks based on their data and its clear that they might also struggle with the variance problem (not predicting prices that are varied and instead predicting prices close to the previous one). when i try predicting returns rather than prices or try predicting buy/sell signals, the model signals stagnate into just one buy/sell signal. random forest might actually be more promising

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

2h 4m logged

basically just messed around with smarter strategies and comparing strategies. spent a while thinking of and trying out some smarter strategies paired with the model. so far, i just have a really simple one that doesnt buy unless the predicted return is higher than a certain percent (not just 0). i also decided to plot the model’s prediction compared to the last closing price it was given to see if the model was just parroting the input. it seems like its actually predicting properly bc the difference fluctuates quite a bit and isnt extremely close to 0. lastly, i tried to recursively predict the stock prices by giving the model only the first 60 days and letting it feeds its own next day prediction into itself. i was hoping it might fluctuate at least a little, but it just kinda flops in a gentle slope. here are some of the graphs (not all are from the same cycle). looking at the main one with the cumulative returns, you can actually notice some interesting patterns from the model-based strategy compared to buy and hold

1

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

3h 13m logged

GOT LSTM TO WORK! took a look at a few kaggle notebooks. basically i just made the model bigger, increased the sequence length, had it predict prices rather than just binary classification, and used MinMaxScaler. i also simplified features so its really just the close price.

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

2h 44m logged

tried an LSTM model. not sure how, but the youtube algorithm figured out i’d be interested in this video about someone making an LSTM stock prediction model: https://www.youtube.com/watch?v=V2l7cZxUpQs. they’re using tensorflow which might actually be easier but theres not that much of a difference. unfortunately their code is not fully opensource, you have to be a github sponsor to gain access to it. i couldnt really get my lstm model to do very well, it kind of just accepts defeat and learns to just output the average of the outputs. here you can see that my targets are just 0 and 1 for up and down and the model just slowly starts to average out to predicting around a 0.5. i think im going to go back to random forest for a little while, there were some things i wanted to try out.

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

2h 12m logged

another 2 hrs down the train. i tried a ton of different configurations including relative features, expanding vs rolling windows for training and just made sure that there was no leakage of testing data into the training data. i also added vix as a volatility index but it doesnt look like it helped much. i also realized that i should prob be graphing the actual stock prices not just cumulative returns, so i can see where the model is failing. i also found an video of someone using LSTM for stock prediction, which i want to try, but that might take a lot more work. im probably going to either do that or add sentiment scores and other data to my feature set next.

1

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

0h 40m logged

tried gradient boosting models but they didnt seem that much better, maybe i’ll test them more thoroughly later (xgboost is much faster i think). i went back to random forest and i noticed something interesting. although the cumulative model returns are clearly below the buy and hold returns, it begins to more or less mirror them and actually starts gaining back the distance it lost. i changed the training window to expand instead of roll so that it gets as much training data as possible and then ran it on about 20 yrs. it started off rocky but then clearly began beating the buy and hold returns and compounding. im not sure if this is due to some sort of data leaking again from me, bc i double checked everything and it seems plausible with that much data.

0

Log in to leave a comment

mattseq worked on wandering trader

4 months ago

5h 25m logged

sorry that this devlog is 5 hrs in, i wasnt really sure how far i’d get with this project. at first, i started by using neural networks which im most familiar with but they didnt get great accuracy. then i learned about random forest models (and other decision tree based models) and started using that. i was getting insanely high returns which at first was exciting but then i quickly realized that i was probably doing something wrong. as it turns out, i was accidentally leaking the targets to my model by how i was training it. after fixing that, random forest wasnt much better than buy and hold returns. so now im researching and testing gradient boosting models. this is the latest performance of random forest on IBM stocks from 2005 to 2025

0

Log in to leave a comment

2 Followers

Shipped this project!

Prep for Release