
@VictorTaelin
Kind / Bend / HVM / INets / λCalculus
300k tokens trying to teach 4.8 how Bend's termination checker works 🫠 maybe not so bright, but somehow a pleasure to talk to and definitely my favorite model of all time
Update: it now gets it, and just found a flaw on my design. This is incredible. I'm speechless.
can you guys be slightly higher dimensional please x.com/VictorTaelin/s… ffs
I'm afraid GPT 5.5 has a cheating problem ): I left 4 Codex tabs each working with 4 agents in an optimization. I put a section on the goal demanding them not to cheat. After 8 hours of work, ALL 4 tabs did an: if (input == test) { return hardcoded_result; } ALL of them.
I lament to inform that I had one of the best coding experiences today with Opus 4.8 on fast mode. It is 3x cheaper now. I guess I should thank our overlords
I'll cool down a bit again, see you next time something happens
Bend2 is: - 90% as fast as C single core (& faster on GPU) - safer than Rust; it is a literal proof language - compiler scales just like Go's - no first class C++ support. no. just no Should be released last week if I didn't fuck up As soon as I trust my own monster codebase x.com/schteppe/statu…
I'm anxious about shipping a GPU compiler with closures and shit because there may be some obscure bug I haven't caught. meanwhile a company with access to god in a box is struggling fix a flicker bug in a tui they shipped years ago inspired by my own OSS work I must... reflect x.com/ClaudeDevs/sta…
Status update: I've been on/off AI agents in the last few days and it is a verifiable truth that every day I didn't use agents, I was more productive. I still attribute that to how slow they are, and my own inability to multi-task efficiently. The magic is there but the slowness
although the AI did write some embarrassingly stupid shit, to be very honest, Bend's codebase is better than I made it look, and my frustration boils down to my own unfamiliarity with many of its parts
Status update: I've been on/off AI agents in the last few days and it is a verifiable truth that every day I didn't use AI I was more productive. Also, Bend2's C/Metal compiler codebase is a clusterfuck right now. I regret letting AI agents write it. All tests pass, and GPU
People aren't afraid of database leaks or becoming paperclips. People are terrified of losing their jobs, becoming irrelevant, inequality. They built a powerful tool that could bring unfathomable progress to humanity and actually save lives *today* - yet, instead, they choose to
So I'll start posting less for a while, I don't know why it keeps getting so much reach even though I've been posting nothing of substance at all. I think the bots just do their thing on my posts. This annoys even me. I'll be back when I have real value to add. Sorry!!
Since you are too dumb to understand, I'm not leaving X. Just avoiding dumb viral posts so you all can enjoy some Taelin-free timeline for a while
ffs I'm not leaving X. I'm just avoiding viral posts so you all can enjoy some Taelin-free timeline for a while
This is the worst I ever felt about model selection → Opus 4.6: dated → Opus 4.7: bad model → Mythos: literal myth → Gemini 3.5: main value not on API → Composer 2.5: very good, no API → Chinese models: 2 hours thinking → GPT 5.5: best, but too slow Everything just sucks
OpenAI can you PLEASE fix this? Not having an option to show thinking traces makes your product significantly worse. Makes me want to go back to Pi. Codex just entered a black hole of thinking (or something) and I have no way to tell what is even going on
I think this is a bug? @thsottiaux
I discovered a new joy in life. Don't ask Codex to do stuff. Ask Codex to ask Codex to do stuff. Rejoice as you watch it handling and correcting all the dumb shit that it does and that you'd be dealing with otherwise
I will miss these 10x credits so much I think I'll burn all my remaining quota with a last request: /goal find out how to extend my 10x credits indefinitely
"agent 3 reported a huge breakthrough, but upon closer inspection its code was just hardcoding the solution" SURE IT WAS. AND IT IS YOUR PROBLEM NOW 🥳
@sama can you please throw a second party and not invite me again
78K likes is concerning for humanity sometimes I'm glad our species faces no competition x.com/fuckyouiquit/s…
x.com/fuckyouiquit/s… beware the luddites are learning how exponentials work
GPT is able to solve Erdos problems but still not come up with simple solutions on Interaction Net programming... I left 5.5 fixing a bug on a SupGen variant overnight and it failed. Obviously it did: the solution requires writing a HOAS interpreter on HVM, and doing so is x.com/VictorTaelin/s…
GPT is able to solve Erdos problems but still not come up with simple solutions on Interaction Net programming... I left 5.5 fixing a bug on a SupGen variant overnight and it failed. Obviously it did: the solution requires writing a HOAS interpreter on HVM, and doing so is x.com/VictorTaelin/s…
Let me see the incredible progress the AI made while I was out
this is super cool but I still do not understand how they get a model to coherently and usefully reason for that amount tokens and at this point I'm to afraid to ask x.com/voooooogel/sta…
new setup
Deleted again because misinformation 🥲 Gemini 3.5 Flash *is* available on the API. Yet, both the API and the CLI versions are 3x slower than on the IDE! See the video below. → Antigravity IDE: 4 seconds (smooth) → Antigravity CLI: 15 seconds (buggy) So the point holds: x.com/VictorTaelin/s…
... it is not an IDE anymore... why can't I write a single paragraph without spreading misinformation I just give up. anyway my point holds and I got work to do
Narrator: they already fucked up → Gemini 3.5 Flash not available on API. → Fast mode locked to Antigravity only. I don’t understand why companies keep doing this. They invent a portal gun, only to lock it behind a taxi subscription, because they completely fail to realize x.com/VictorTaelin/s…
Translating the same text, IDE vs CLI → IDE: smooth, 4 seconds → CLI: buggy, 15 seconds I'm NOT using an IDE in 2026. I really want to stop giving money to Anthropic but everyone else is making it so hard
I'm getting only ~80 tokens/s on Gemini 3.5 Flash after launch? It peaked at 1000+ before. Since there is no API, it is hard to measure though...
The new Gemini 3.5 Flash solved the HVM3's wnf bug in 1/3 attempts. This is my main test to take a model seriously. So far only the big models like GPT 5.5 solved it. And seems like it is 20x faster than Opus 4.6 ! Promising but Google will still find a way to fuck up
autodiff is just interaction net evaluation ? x.com/VictorTaelin/s…