I do the same, and have excellent results. Gemini 3.1 Pro high diagnosed and solved 3 complex issues today that Opus Max was stumbling on for a few hours in one shot. This was even when I started new chats and tried debugging with Ultracode instead with Claude.
As much as people on HN like to dunk on Gemini, I’ve always found it to be pretty good at understanding a code base more than Claude.
I sometimes think about the time I was on a road trip with a friend, and for what was generally a 3 hour trip to the mountain, was 5 hours for my friend driving.
Midway through the trip I was suspicious of the duration, traffic was fine. He was adamant he knew where he was going.
I pulled up google maps, and sure enough. 3 hours.
Turns out, his mapping app wasn’t aware of an offramp, so instead it wanted us to drive an extra hour, then do a u-turn and drive back to take an offramp.
No, it's more that those apps needs to be able to make all of the tool calls Siri AI can make, which would allow third-party developers to collect data they shouldn't have access to.
App developers can already access the on-device foundational models through an API, but I don't think many developers want to do that because there are better models.
The only thing you saw was phoning home, that’s what’s gonna be interesting when Apple releases their version. Is it phoning home 100% of the time or can you turn off the Internet and have it perform in the same way, there will be plenty of YouTubers that will give it the test a test I might add that they haven’t done up until this point for anything that Google has put out?
The transfer of wealth is diminishing also as older generations didn’t save as much, are unable to retire, or are spending through most of their retirement
Well, obviously it’s cherry-picked. It’s an example of something that challenges my intuition. Most things align with my intuition because I’m in my late 30s and have seen enough of the world to have a fairly good idea of the rough numbers. Here’s another one: the London Underground is older than the telephone.
There’s a light board game called Timeline where you have stuff like this and there are so many surprises. Temporal stuff is hard to reason about and the game catches that. But with large numbers one loses intuition easily: NYC’s subway vs. all domestic and international US air travel is closer in total passengers than one would think. The median American did not fly last year.
Stuff like this. It’s just Gladwell-fodder but numerically fun.
I misinterpreted your intent here, and that’s on me. Thank you for explaining, you clearly picked the sample as a comparison of fact, not as narrative. Apologies
Mythos was announced a few month ago and has been actually demoed in many companies who have all reported its abilities, supporting the claims made by Anthropic. How is this in any way similar to the FSD situation?
You're conflating the protracted promises for full-self-driving with the current rollout of autonomous driving features in Teslas, a feature people are using today. I've driving with multiple people who use their Tesla self-driving and report its quality/accuracy, this isn't some overpromised future feature.
And I mean, it's not like Anthropic is a zero-product company that is only offering gated access to their only product, Opus 4.7/4.8 are very good and are driving billions in revenue. Anyone can use it and see how good it is, and it is clear that it is a very good model at many things. It is no huge leap to imagine that a model that is 10x bigger is also better at many of the tasks that Opus is good at.
They are gating the release because of cybersecurity/misuse concerns, which makes sense because
1. Existing models are already being used to find exploits and hack into systems
2. We don't know the effects of releasing a tool which can autonomously exploit systems, especially in a world driven by a "security through obscurity" philosophy. It makes sense to give a heads-up to patch up software that affects billions of users before releasing it.
Imagining that this delayed rollout is all a big marketing scheme, that they have gotten dozens of multi-national companies to play along, and that Anthropic is somehow now just patently being dishonest about something while they have every incentive to not be dishonest (especially when they are neck and neck with OpenAI and their relative success depends on verified claims about model abilities), is pure conspiratorial thinking and driving more by a motivated cynicism about AI companies rather than a reasoned examination of the claims being made.
Mythos was announced a few month ago and has been actually demoed in many companies who have all reported its abilities, supporting the claims made by Anthropic. How is this in any way similar to the FSD situation?
Funny that your take away from protecting science was “where are you going, we have all the money so just dance with the devil and pray your research doesn’t conflict or debunk any political views”.
Shame. Moderation on HN has been really heavy handed recently I’ve noticed.
Don’t get me wrong, my friends are all making killer money…but they are also all out of therapy sessions, many are back on ADHD and SSRI meds, and the company seems to be full of egos and heavy handed mid management. I get the joy of meeting up with friends and their coworkers so I get to hear…a lot.
It’s very much the journey, life is a journey. Much like learning is a life practice, so is happiness. Everything that makes you happy today will not be the same 20 years from now. The sooner you can find peace in simply being human, the more fulfilling your life will become. It’s cliche to say, but I find it honest and true.
Tech culture preys on making you feel inferior if you aren't loaded with RSUs and equity, but so many of these people hoarding wealth are miserable in real life but will never admit it as it’s their identity they’ve built up. They buy Porsches or homes just to feel something, or have something to show people.
Money at some point has diminishing returns, and once you have enough to be whatever your version of secure is, you should stop before you can’t even enjoy happiness. Until then, enjoy the process and do not look to others as it’s the theft of your own happiness.
Actually, I want to make lots of money to help my friends.
I've already committed to pay for a friend's kid to attend college( within reason though, I'm thinking about 15k as that's a good headstart).
Another I occasionally help with nominal amounts. In exchange he's shown me around the world. He's basically a genius who is fluent in like 4 languages. I always respect others who can do what I can't.
That's the dream anyway. Sell your soul, then take care of your friends in an attempt to buy it back.
As much as people on HN like to dunk on Gemini, I’ve always found it to be pretty good at understanding a code base more than Claude.
reply