GPT-4V + TTS = AI Sports narrator 🪄⚽️
Passed every frame of a football video to gpt-4-vision-preview, and with some simple prompting asked to generate a narration
No edits, this is as it came out from the model (aka can be SO MUCH BETTER)
Building hardware again, starting with a robot dog
Full 3D-printed, open-source design, with an ML controller inspired by Disney's bipedal animated robot
Documenting the entire build on stream
not a lot of long-form hardware engineering content out there
been liking this series from @geepytee
it’s cool to see how a pro builds things, from start to finish
I see a lot founders trim their absurdly ambitious ideas when pitching to investors
This is a mistake
You want to be Sam Altman telling a room full of VCs with a straight face that he will take billions of their money, build AGI, and then ask it how to generate a return
Video is 1131 frames long, but only passed every 10th frame all of them together in a dictionary (I think this only counts as 1 request but a ton of tokens?). Not cheap (~$30)
Here's the full code: github.com/ggoonnzzaallo/…
Based on OAI's cookbook