oshot - Screenshot tool for extracting text on the fly banner

oshot - Screenshot tool for extracting text on the fly

54 devlogs
126h 21m 40s

Screenshot tool to extract text on the fly (OCR and Bar/QR codes)

This project uses AI

Used AI for complex math and fixing some issue on the fly (pun intended).
Always tested before shipping

Demo Repository

Loading README...

toni

Shipped this project!

Hours: 122.57
Cookies: 🍪 2812
Multiplier: 29.75 cookies/hr

Note: please read the demo release notes

I’m so happy to ship oshot v0.4
this is the greatest update the project has come across so far. Even though the interface remained similair, a lot of things as changed in the backend.
It was a super challenge rewritting the codebase, from naming convention to its structure.

For instance, the tool has introduced macOS support! Thus making it now cross-platform compatible!

Something that had to be changed was the removal of translation entirely, because it was driving me crazy and it sucked so bad that I’m ashammed of even introducing it in the first place.

For the rest, annotations and QR/Bar codes image scans have been introduced with a lot of other bugs being fixed. The OCR is not perfect, though it’s best used for documents, and sometimes may be wrong at catching up, but it’s all a question of how you select the region to scan the text, and also which language model you are currently using.
I’ll work on improving it from time to time, and patch releases may be done in the future, but you can also help me out by either submitting an issue report or DM me on slack (@toni).

Thank you so much for the patience, and thank you again for trying out my software. <3

Note: please read the release notes

toni

made text annotation input color the same as when placing it. This can be useful if typing in certain backgrounds colors, for example entirely white ones and you can’t see what you typed

0
toni

This devlog is used for reporting every bug fixes I have fixed since v0.4.0-rc1 release.

Changelog

Attachment
0
toni

I wanted to put in this devlog the changes I have done which are mostly fixes on mutli-monitor setup captures, new PRs being created and merged (thanks for the interest!) and other fixes for the new upcoming v0.4.0 release.

Stay tuned!
the image below was taken when there was an issue with multi-monitor setup capture, that’s now been fixed!

Changelog

Attachment
0
toni

Translation has been purged from oshot.
It was a PITA trying to make it work and stable without API keys and subscriptions. The only solution would be to use libretranslate, but I doubt anyone would host a kinda heavy server for just making a screenshot tool translation work.

But as William Afton says: I always come back

Changelog

Attachment
0
toni

MacOS support has arrived

Oh my God. What a journey.
Having to refactor from GLFW to Metal just for macOS was a great PIA. I’m glad the results has shown off though.

It is on a separate branch and will be merged to main after some issues have been fixed. Soon will post the video demo too.

Shootout to @Hridya for testing my program on his mac and allowing me to post a devlog using his screenshot as a preview.

btw I think this took MORE than just 5h sob-pray

Attachment
0
msw

Tagged your project as well cooked!

🔥 msw marked your project as well cooked! As a prize for your nicely cooked project, look out for a bonus prize in the mail :)

toni

dev stuff: added better result return in functions. instead of unknown failed errors, let’s actually be a bit more precise (a bit buggy ik, fixing it when I want. edit: fixed)
also added the —debug flag in releases build too for reporting issues

0
toni

decided to make a bunch of code cleaning and reformatting, along side adding the portal screenshot capture as fallback on linux. Overall, these changes have been just some fixes such as the refactor of DXGI capture on windows, and code style reformatting. Once the payouts are developed, i’ll add some features and release with a new ship

Attachment
Attachment
0
toni

Shipped this project!

Hours: 32.38
Cookies: 🍪 561
Multiplier: 17.32 cookies/hr

I’ve made this project for the purpose of extracting text on the fly without relaying on online websites that takes ages and are wrong at extracting text.
It’s useful in videogames, extracting and translating text you can’t select, etc.

Please open an issue if any feedback, bug, etc.

Thanks for trying my software :)

toni

still learning glfw+opengl3+imgui

Attachment
1

Comments

toni
toni 3 months ago

in the end is just copying the example code for the main function, and then do the rest with only ImGUI (unless talking about textures)