More

meken · 2026-05-11T12:41:33 1778503293

Love the story and the article. The only nit I have with it:

> “His answers are… understandable, and maybe in some ways more digestible than we would get from an expert,” he said.

This does not reflect his actual responses? The interviewer keys off his most emphatic sounding words to keep the conversation flowing, but his answers are generally inscrutable.

He did a great job given the cards he was dealt though.

meken · 2026-05-04T17:28:56 1777915736

I was curious to get a sense for the overall "success rate" at a glance, so I uploaded the author's data as a spreadsheet and color-coded the conversations based on length (short=red, medium=yellow, long=green) with the help of Claude:

https://docs.google.com/spreadsheets/d/1VqMF0xWzJMXWNndeY4P1...

It's particularly nice if you zoom out so you can see all the rows at once.

I hope the author doesn't mind - if you do please tell me and I will take it down!

meken · 2026-05-03T15:05:54 1777820754

> but part of me wonders if a lot of their success is attributable to the place just being well run in general

That was my sense reading the article - that the author would be running a successful engineering org using any language really.

meken · 2026-04-30T10:42:29 1777545749

I’m not sure how that was your takeaway..?

> We retired the “Nerdy” personality in March after launching GPT‑5.4. In training, we removed the goblin-affine reward signal and filtered training data containing creature-words, making goblins less likely to over-appear or show up in inappropriate contexts. Unfortunately, GPT‑5.5 started training before we found the root cause of the goblins.

The prompt is just a short term hotfix/hack because they couldn’t get the proper fix in in time.

Orygin · 2026-04-30T12:27:34 1777552054

Then maybe stop training and make a real fix?

If you need to put baby guardrails on your model because the training is effed up, maybe you should rethink how you make these models and how much control you really have on it.

meken · 2026-04-29T01:40:26 1777426826

There’s a difference between a relationship with a person and an organization. I think the difference is large enough that the analogy doesn’t really hold.

zamalek · 2026-04-29T04:28:19 1777436899

Exactly, only humans should have at least one chance to grow and improve. Orgs are heartless legal entities that deserve no loyalty whatsoever, they are all one acquisition away from turning on you (as a customer or an employee).

rob74 · 2026-04-29T12:00:22 1777464022

Organizations are made up of humans too... but, the bigger they get, the less you notice that. Back in the day when GitHub was still a small company with one (very good) product, I can understand having a feeling of loyalty towards them. Since they are part of M$ and more beholden to M$'s KPIs then to their users, sticking with them only because of nostalgia is probably ill-advised.

pinkgolem · 2026-04-29T03:07:11 1777432031

I think the example is still valid, orgs will not change if the still get what they want from you

hiccuphippo · 2026-04-29T23:19:36 1777504776

For businesses, staying signals their KPIs that everything is great and there's no need for change.

meken · 2026-04-26T13:45:02 1777211102

Just some feedback - I would love to see some screenshots in your GitHub READMEs!*

*I saw the second project has a partial screenshot, but not a full one.

solraph · 2026-04-27T12:00:45 1777291245

Done, for what it's worth. Krbtray doesn't have much in the way of a GUI to show though!

solraph · 2026-04-26T23:31:32 1777246292

Good point, I'll do that.

meken · 2026-04-26T13:35:32 1777210532

> “What’s beginning to emerge is that the problem was maybe easier than expected, and it was like there was some kind of mental block.”

Even if AI never progresses past this point, it still seems like a huge win for math research to “clear the deck” of these.

wslh · 2026-04-26T19:33:23 1777232003

The current state of AI is incredible, and useful and doesn't need to reach AGI to be revolutionary. For example, I uploaded a conversation between a few people and not only asked about translating the text but doing a psychological analysis on turn-turning and other conversational cues. Just around a decade ago, the speech-to-text Dragon Naturally Speaking[1] was not reliable with only one speaker without any background noise.

[1] https://en.wikipedia.org/wiki/Dragon_NaturallySpeaking

meken · 2026-04-21T15:07:22 1776784042

I love Kernighan’s Law:

> "Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it"

zelphirkalt · 2026-04-22T09:08:03 1776848883

Hm, I find this one to be very dubious. When you make an effort to write correct code, you think hard. When you debug, it's more like just looking at the execution of what you have thought of before and thinking "OK where did I go wrong this time? Show me in the process." and it is usually much easier to see why something is wrong or at least at which step it breaks.

meken · 2026-03-24T23:37:45 1774395465

I had so much fun making videos with my mom when it came out. During the first two weeks, we made over 100 cameo videos together - we were constantly running up against the upload limit. It unleashed tons of genuine creativity, joy, and laughter from us.

After those first two weeks though, we just… didn’t use it again. The novelty wore off and there wasn’t anything really to bring us back. That was the real downfall of Sora.

yoz-y · 2026-03-25T07:34:29 1774424069

The problem is that due to the ease these can be made there is also really no reason to make this social. “Why would I look at somebody else’s creations when I can do mine.”

Cthulhu_ · 2026-03-25T09:06:53 1774429613

I can see some usage for this use case - "look Morty, I turned myself into a pickle!" - but just like image / meme generators, this is like 10-30 seconds of engagement within a friend circle at best (although some might go viral, but that won't bring in much money for in this case OpenAI).

There will be (or is, I'm behind the times / not on the main social networks) an undercurrent or long tail of AI generated videos, the question is whether those get enough engagement for the creators to pay for the creation tool.

WarmWash · 2026-03-25T14:20:07 1774448407

I'm not an artist or creative person in any sense. My persona is closer to a settings menu than a colorful canvas.

The AI art I have seen creatives produce is far beyond anything I have been able to come up with. We're not at the point yet where you can just prompt "Make me a video that is visually stunning and captivating" and get something cool.

dylan604 · 2026-03-25T15:59:17 1774454357

> My persona is closer to a settings menu than a colorful canvas

ah, but what a persona that would be if you were a Kai's Power Tools settings menu!

pjc50 · 2026-03-25T16:27:32 1774456052

> The AI art I have seen creatives produce is far beyond anything I have been able to come up with

.. such as? What's the "Mona Lisa of AI art"? Is there, like, a gallery? Awards?

WarmWash · 2026-03-25T18:11:29 1774462289

Unfortunately I don't have a solid reference point or checklist for the defining qualities of "good art". And frankly I don't take those who do very seriously. To me art is all about the personal vibes you get from it. So I enjoy Zach London (gossip goblin), Bennet Weisbren, and voidstomper/gloomstomper if you want something to measure with your "real true art" checklist.

teekert · 2026-03-26T08:15:58 1774512958

Didn't we used to think the same of Photos?

muzani · 2026-03-25T07:56:55 1774425415

They're different impulses. Some want to consume. Others want to create.

TikTok and social media is a strange mix of both, people posting response videos to everything.

Personally, I've stopped subscribing to Spotify, YT music, etc because the slop from Suno is good enough to replace mainstream music or whatever lofi playlist. It's free, it's good enough, and it's not grating to hear after a few days of that favorite song.

The video slop can well replace TikTok and Reels. Make educational content about your hometown. Explain how to throw an uppercut.

But I guess the desire to create something that others would consume is also different from the desire to simply create.

hansmayer · 2026-03-25T10:29:05 1774434545

Sweet Jesus. You realise this is the mental equivalent of stuffing your stomach full of junkfood and soda every day?

muzani · 2026-03-25T15:45:53 1774453553

This is a mainstream break up song: https://youtu.be/ekzHIouo8Q4

This is a vocaloid break up song: https://youtu.be/9pQR4a5sisE

The first isn't bad by any means. There's a million break up songs and that's one of the best sad ones. Most are just... angry? Blaming? Empowering? They work fine. They sell records. Many have have a billion views.

But the second one, even with the clunky translation, strikes somewhere deeper. It's written by someone who had enough time ruminating on a break up. The ending hits a little harder, because break up songs are about endings.

Both are sincere, but the first feels more formulaic. I'm inclined to think the first one is the soda.

I feel Suno leans towards this group of songwriters and poets who have something to say. Sora doesn't.

pesus · 2026-03-25T21:28:52 1774474132

Vocaloids are hardly similar to fully AI-generated songs. Vocaloids are still human controlled.

q7m · 2026-03-26T10:59:41 1774522781

And also that VOCALOID uses "traditional" signal processing techniques as opposed to generative deep learning techniques.

weirdmantis69 · 2026-03-25T18:11:48 1774462308

As opposed to the kardashians and real house wives and Chappell Roan?

hansmayer · 2026-03-25T18:55:33 1774464933

No, the whole horseshit belongs together of course. Just that the AI slop is the logical culmination of the dumbed down pop-culture of the last 15ish years or so.

noelsusman · 2026-03-25T13:55:55 1774446955

That doesn't sound meaningfully different from what people are already doing on Instagram and TikTok all day.

hansmayer · 2026-03-25T14:11:06 1774447866

Absolutely correct and my comment is by no means dedicated just strictly to the AI slop.

neutronicus · 2026-03-25T13:39:58 1774445998

For a lot of people music is a focus aid, not the object of contemplation.

camillomiller · 2026-03-25T08:02:41 1774425761

Some want to consume... content that they don't think they could do in one minute themselves. They want to consume content made by other humans, even if it's still brain-eating algorithmic fodder, but still. Sora proved it quite clearly. These clips had ZERO value.

jaapz · 2026-03-25T08:39:45 1774427985

> Personally, I've stopped subscribing to Spotify, YT music, etc because the slop from Suno is good enough to replace mainstream music or whatever lofi playlist.

The musician in me just shed a tear

whaleofatw2022 · 2026-03-25T10:45:51 1774435551

Pink Beatles, in a purple Zeppelin comes to mind

Geedis · 2026-03-25T11:05:34 1774436734

Had to create an account just to let you know that someone out there got the reference.

seedboot · 2026-03-25T10:52:28 1774435948

That comment for sure made me sad

NickC25 · 2026-03-25T15:25:25 1774452325

I occasionally use Suno to re-imagine songs in different keys, tempos, and genres, and sample them. Most of the output from Suno is slop, but occasionally has a few good bits you can sample, chop up, re-pitch, and create something totally new from, which also has the added benefit of being unrecognizable to rights algorithms and lawyers from major labels.

It's a neat tool for genuine creators, and a crutch for people interested in slop.

criley2 · 2026-03-25T10:48:56 1774435736

Modern music has done this to itself. When the human product is already pure corporate slop, it's not hard for AI to compete.

Hopefully AI outcompeting humans at slop sparks a renaissance of humans creating truly beautiful human artwork. And if it doesn't, then was anything of value truly lost?

BigTTYGothGF · 2026-03-25T12:48:32 1774442912

> Modern music has done this to itself

I get my modern music from Bandcamp. If you can't find good stuff to listen to, that's a 'you' problem.

criley2 · 2026-03-28T13:42:32 1774705352

How much of your super-awesome bandcamp music is topping charts, selling millions, packing mega stadiums, and is penetrating the zeitgeist so deeply that people around the world are addicted to it?

Maybe, just maybe, I'm not talking about "my" music tastes, but offering commentary on the state of music at a global scale. Weird that this point was so hard to follow!

animuchan · 2026-03-25T11:26:42 1774438002

So true. AI music gens like Suno can't do Paul Shapera works even remotely, but can recreate a lot of pop or EDM music very faithfully. There's just no distance to close, it's already mainstreamly bad.

azan_ · 2026-03-25T12:11:21 1774440681

> Modern music has done this to itself. When the human product is already pure corporate slop, it's not hard for AI to compete.

What are you talking about? There’s lots of modern music that’s not corporate slop and that’s absolutely great. Never in history was access to great music as easy as it is now.

criley2 · 2026-03-28T13:41:17 1774705277

I'm talking about modern music. Just because a couple of dweebs on hackernews have "totally amazing underground music" doesn't mean the overall zeitgeist agrees. Regardless of your esoteric music tastes, music by sales and music by charting tells a very different story. And that story is one of replaceable slop.

voidUpdate · 2026-03-25T13:10:25 1774444225

So find music you like that isn't modern corporate slop. My music right now consists mainly of indie stuff I've found on youtube and daft punk. No plagiarism machine needed, just human-made music

muzani · 2026-03-25T15:51:08 1774453868

"No plagiarism machine needed, just human-made music"

From wikipedia: Many Daft Punk songs feature vocals processed with effects and vocoders including Auto-Tune, a Roland SVC-350 and the Digitech Vocalist. Bangalter said: "A lot of people complain about musicians using Auto-Tune. It reminds me of the late '70s when musicians in France tried to ban the synthesiser. They said it was taking jobs away from musicians. What they didn't see was that you could use those tools in a new way instead of just for replacing the instruments that came before. People are often afraid of things that sound new."

voidUpdate · 2026-03-25T15:58:56 1774454336

Did Daft Punk put in a lot of effort to remix existing sounds to make their own music? Yes. Did they type "pls make french house electronic music number 1 chart" into a text box? No. Did they also credit original authors? Yes. I've not gone through their whole library, but for example, Edwin Birdsong has songwriting credit for harder, better, faster, stronger

criley2 · 2026-03-28T13:44:58 1774705498

There's this fallacy with AI generation that people think that all you have to do is type "i lik musik pls remake favrite song but better" and you get amazing results.

This is patently untrue.

It's like how if a junior engineer and a principal engineer use claude opus 4.6 they get radically different results. The junior doesn't have the taste or knowledge to know good from bad so the AI oversteers and slop is made. The principal has finely tuned sense of taste and deep knowledge, so they aggressively steer the AI at every step. This is also true in other AI domains.

To be absolutely clear: you can't make good AI music. Try all you want. Try the prompt you just wrote. Show and tell. It's not something you're going to be able to do.

bojan · 2026-03-25T08:41:30 1774428090

> The video slop can well replace TikTok and Reels. Make educational content about your hometown. Explain how to throw an uppercut.

There is a fundamental issue of trust here. Facebook has me tagged as history nerd so I get to see those slop videos. They are fun, but always superficial and often plainly wrong. So unless the slop comes from a known, trustworthy source, the educational element is simply not there.

For throwing an uppercut it's even more important, if you follow wrong slop instructions you can end up breaking your wrist or fingers.

delta_p_delta_x · 2026-03-25T09:01:11 1774429271

> the slop from Suno is good enough to replace mainstream music

I wonder what OP categorises as 'mainstream'. As a classical musician this breaks my heart.

muzani · 2026-03-25T15:27:27 1774452447

Many of the things on a top #100 list for the last few decades. That includes plenty of "indies" as well as pop.

There are exceptions though. FUKOUNA GIRL by STOMACH BOOK, for example. AI can't come close to replicating something like this. Not the cover art, not the off-key voices, not the relatable part of the lyrics. I don't believe this is a top #100 song, though it certainly is popular.

delian66 · 2026-03-27T11:16:33 1774610193

It is chaotic crap.

code_for_monkey · 2026-03-25T16:16:09 1774455369

you could not waterboard an admission of bad taste like this out of me

mlrtime · 2026-03-25T10:56:48 1774436208

How do you get Suno songs for free? You listen to others or make your own?

animuchan · 2026-03-25T11:38:20 1774438700

Almost nobody listens to others' songs on Suno, that's the entire point.

You wouldn't care to order the food as I personally like it -- might be too spicy (or too bland) for your taste.

Suno songs are overtuned for personal preference in the same way.

mlrtime · 2026-03-26T11:00:14 1774522814

I get that, but you have to pay to create your own.

And on the second part, I somewhat disagree. I mean, yes everyone has a personal preference, but if you bucket all those personal preferences they all fit nicely together (In many buckets).

animuchan · 2026-03-26T12:57:06 1774529826

A fairly narrow buckets, sure.

I think the point of Suno is to make you not search for your specific thing though, and instead produce your own. Searching for niche music has always been a thing. If our goal is to listen for free, we don't care about Suno (or any other way to make music) one bit, it's just another DAW for those making music.

And AI music in general sure has its fans, check out Only Fire for example.

muzani · 2026-03-25T13:36:11 1774445771

They have a discover section for songs made public.

wartywhoa23 · 2026-03-25T18:30:16 1774463416

I'm with you here, resonates so much. I'm so fed up with endless subway tunnels, they all look and sound utterly same and boring.

So I quit riding the overpriced subway altogether and now consume AI-generated subway imagery and soundscapes for free, they are just good enough to feed my passion for boring tunels.

Some ego-bloated edgelords had nerve to tell me that there are, like, other modes of transportation, but I honestly find their high-horse elitism despicable.. Damn morons.

teekert · 2026-03-25T07:04:35 1774422275

Sounds like when we first had smartphones with orientation sensors and we could drink a beer from the phone, so cool... for 2 weeks.

moritzwarhier · 2026-03-25T11:05:56 1774436756

But now you can vibe the same app 1000 times for root beer, coca cola, ginger ale, even a milkshake, and nobody will ever have to have a new idea again!

Cthulhu_ · 2026-03-25T09:07:29 1774429649

I wouldn't be surprised that the beer apps cost less to develop than one AI generated video.

closewith · 2026-03-25T08:39:06 1774427946

Was there a Send Me to Heaven for Sora?

Applejinx · 2026-03-25T19:26:41 1774466801

That is for loved things

mathattack · 2026-03-24T23:59:34 1774396774

This is consistent with a lot of AI apps. I fell in love with Gamma and haven’t used it in forever. Same with NotebookLM.

wholinator2 · 2026-03-25T00:11:09 1774397469

I somewhat consistently use notebookLM for podcasts of academic papers I'm reading in my PhD. You have to go read it yourself afterwards but it makes better use of time in the gym or doing dishes/groceries.

internet_points · 2026-03-25T08:23:22 1774427002

> You have to go read it yourself afterwards

^ this is important.

Otherwise you may very well be missing anything really surprising or novel.

See for example https://www.programmablemutter.com/p/after-software-eats-the... , an experience report of NotebookLM where

> It was remarkable to see how many errors could be stuffed into 5 minutes of vacuous conversation. What was even more striking was that the errors systematically pointed in a particular direction. In every instance, the model took an argument that was at least notionally surprising, and yanked it hard in the direction of banality.

WarmWash · 2026-03-25T14:22:45 1774448565

On one hand 2024 in AI time was a decade ago.

On the other, Google might not have done much to upgrade the podcast feature since them.

internet_points · 2026-03-26T09:35:03 1774517703

This regression towards the mean is still very much a feature of the newer models, in my experience. I don't see how a model that predicts the most likely word based on previous context + corpus data could possibly not have some bias towards non-novelty / banality.

mathattack · 2026-03-25T14:59:26 1774450766

It’s gotten somewhat better over time though clearly not their top priority.

nytesky · 2026-03-25T02:18:13 1774405093

The bantering of the podcast I found distracting and the breathless enthusiasm. I guess there was a way to make it more no nonsense? I found I lost content if tuned for brevity.

djsavvy · 2026-03-25T02:22:22 1774405342

I just use elevenreader for this. I copy in essays or whatever text I want to listen to and it works decently well. It's far from perfect, but certainly good enough.

Sometimes I'll take deep research output and listen to it too that way.

mathattack · 2026-03-25T14:59:55 1774450795

I tell them “no idle conversation or verbal tics” in the instructions.

ludicrousdispla · 2026-03-25T07:07:22 1774422442

I found notebookLM to consistently make up about 20% of it's summary. Entertaining but unreliable.

mathattack · 2026-03-25T14:55:26 1774450526

I used it most key to learn about history. There isn’t much damage if it got 1600s or 1700s detail wrong. My high school teachers got much of it wrong too.

qnleigh · 2026-03-25T01:31:32 1774402292

I've found notebookLM summaries to be too high-level and oversimplified to be useful. Hopefully in a few years they can go deeper.

SXX · 2026-03-25T05:42:10 1774417330

You can alao use NotebookLM's as source for Gemini app and ask it to do more in-depth summaries with custom prompting.

This somewhat makes whole NotrbookLM less useful, but still.

p4coder · 2026-03-25T01:29:28 1774402168

I also like doing that for topics that I am tangentially interested in. One minor thing that I find annoying is that the narrators switch roles in the middle of conversation. They start with the female voice explaining a concept to the male voice and suddenly they switch. In the meantime I have identified myself with the voice being explained to.

SecretDreams · 2026-03-25T01:15:55 1774401355

> You have to go read it yourself afterwards

Or before! Either is mandatory to actually learn the content.

shimman · 2026-03-25T00:34:24 1774398864

Just listen to actual audio books... literally doing double the work for no benefit... why?

blharr · 2026-03-25T01:00:11 1774400411

There aren't a lot of highly technical audiobooks or ones that give the same specificity that would be the same as an academic paper

shimman · 2026-03-25T15:45:45 1774453545

Okay but the user is describing listening to papers, then having to read the papers because listening to them isn't efficient. So why bother listening to it in the first place if you're going to read it?

wolvoleo · 2026-03-25T02:23:40 1774405420

Not yet but it seems like they're getting to the point of AI narration finally being good enough to make any text an 'audiobook'.

Having said that I absolutely hate the audio format, I only used it when I had to drive or when I swam lanes. But these days I do neither.

coke12 · 2026-03-25T05:22:16 1774416136

No, reading verbatim from a technical paper is way too dense. You need a lot of filler words to slow it down and repetition to make it stick when read aloud.

wolvoleo · 2026-03-25T23:49:11 1774482551

Hmm fair enough but text manipulation is exactly something where LLMs do shine. Writing and modifying text is what they were meant for.

Ps I don't mean the word 'manipulation' in a negative context.

arthurcolle · 2026-03-25T01:10:59 1774401059

Writing a book takes like 2-3 years on average. Papers are published everyday. Having a cute two-person "conversational chat" w/ audio works for a lot of people vs. just reading a paper. "No benefit" to you perhaps. Don't generalize the lived experience.

shimman · 2026-03-26T13:56:13 1774533373

Okay but this person is literally saying that listening with LLM tools isn't helping their understanding and they have to still read the paper... why listen at this point? Why listen using a tool that literally causes you to do more work?

We all have the same amount of time on this Earth, saying how great a tool is that is causing you to do more work is just... weird?

I'd personally never do this, I value my time.

mathattack · 2026-03-25T14:58:14 1774450694

It can synthesize and summarize many topics.

For example, I can give it 8 papers on best practices in online marketing, it will turn it into a 20 minute podcast.

There are errors, but also with real podcasters.

conartist6 · 2026-03-25T01:04:28 1774400668

Yeah it's not just the hardware depreciating, it's the social impact of what the model can do

anshumankmr · 2026-03-25T05:35:04 1774416904

NotebookLM is great for learning I feel

bookofjoe · 2026-03-25T13:39:08 1774445948

It's not just software: I use my Vision Pro (now in year 3) less than once a month now, and each time I do the painful/awkward/unpleasant set-up and prep and difficult interface sours me on the device yet again, until a new blockbuster movie like "Project Hail Mary" appears that when watched on the VP in 4K on a virtual 40-foot screen blows my mind.

yabutlivnWoods · 2026-03-25T01:38:31 1774402711

https://en.wikipedia.org/wiki/Hedonic_treadmill

24/7 titillation is boring

salt-thrower · 2026-03-25T06:25:33 1774419933

The interesting difference here is that other hedonic activities do bring people back even after the first time they build up a tolerance and get bored. But many of these AI "creative" apps seem like a one-and-done thing. Once the novelty wears off there isn't anything more deeply rewarding to bring people back.

Gigachad · 2026-03-25T08:31:47 1774427507

It’s because they are slop which is only funny by the novelty of it. Stephen hawking at a skate board park it’s funny for a bit but as soon as the novelty wears off it’s just slop.

Nifty3929 · 2026-03-25T16:22:58 1774455778

It's not really that people wouldn't come back - it's that they were losing money on each customer.

Those 100 videos probably cost $100+ for them to create. Did you pay them $100+? (not a critisism, just a re-framing)

staticcaucasian · 2026-03-25T18:16:48 1774462608

When it launched we all talked about the serving/inference costs being massive. In hindsight if they had a paywall, it might not have self-imploded so fast, might have stayed aspirational, and they might have a profitable business today. Interesting case study.

bit1993 · 2026-03-25T08:04:56 1774425896

I thinks its the same reason why chess tournaments, where two AIs play against each other are not as popular, compared to when two humans play each other. Maybe its because humans generally compare themselves to other humans and that's part of how they value.

josefresco · 2026-03-25T12:10:48 1774440648

This tracks my usage exactly. It was like Mad Libs - in that moment it was THE MOST FUN but after a while it became just a novelty bordering on... creepy. Now I feel kind of guilty for having exposed so many friends to what looks like a data gathering scheme.

Cthulhu_ · 2026-03-25T09:02:55 1774429375

It's the same with e.g. faceapp, fun for a minute but then... then what?

And this is the challenge that these tools have - they have to have a free tier to get people to explore it, but unless they can make it a habit, those people will never upgrade to a paid subscription.

I have no figures, but if I'm being optimistic, these freemium subscription services have 10% conversion rate at best; can that 10% pay for the other 90%? For a lot of services that's a yes, but not for these video generators which are incredibly compute intensive.

I'm sure there's a market for it, but it's not this freemium consumer oriented model, not without huge amounts of investments. Maybe in 5-10 years, assuming either compute becomes 10-100x cheaper / more available, or they come up with generators that run cheaper.

whateveracct · 2026-03-25T01:24:43 1774401883

A lot of AI hype is parlor tricks

afro88 · 2026-03-25T07:28:55 1774423735

Sounds like me with listening to AI covers. After a couple of weeks I couldn't care less. But I was so stoked in it at the start

qingcharles · 2026-03-25T03:38:14 1774409894

The Cameo feature is really excellent. The likeness of both the person and the voice is exceptional. I really enjoyed making some funny Cameo videos with my friends. I don't know of another simple way to insert your own avatar with your own voice into a video, and I'm pretty deep in this space.

thisOtterBeGood · 2026-03-27T12:42:29 1774615349

Yes, Sam Altman was talking about how he lost you on his blog and how it lead to Sora's downfall :D But honestly... I believe this too. There's just no value.

JeremyNT · 2026-03-25T12:33:24 1774442004

Yep. Impressive toys, but not useful day to day.

There's some market for b2b I'm sure, but as a consumer facing product it's tough to see how it could ever come close to paying for itself.

Dumblydorr · 2026-03-25T12:25:15 1774441515

Reminds me of when photo filters and initial stickers and mirror filters came out on MacBook in like 2007. It was super fun for a couple days then the novelty wore off.

m3kw9 · 2026-03-25T13:09:45 1774444185

Humans are very good at pattern recognition, even if you generate different stuff, you still see a pattern, either in the cutting, color, cadence of movements, the color grading, camera lens used, everything, your mind will tag it as slop.

Essentially you are watching the same videos over and over subconsciously

pjc50 · 2026-03-25T16:32:11 1774456331

This is something that people working on procedurally generated games have already noticed. No Man's Sky has billions of planets, each with "unique" plant and animal species, but you can easily sort them into a few dozen templates with minor variations.

Procgen has a niche, but it never became ubiquitous, because for most people exploring a nice hand-made intentional environment is better.

rustystump · 2026-03-25T15:38:57 1774453137

U say that but then when u look at most “content” on social media it is the same video over and over again. How many JRE podcasts are basically the same crap as last time? How many influencer “life” videos are the same thing over again? Even the stuff i like is formulaic to the point ai can almost write the scripts.

I think people attach to other people more than “ai”. When there isnt a narrative “person” behind the content it is way less interesting.

meken · 2026-03-25T14:47:07 1774450027

Wow that's a really good point. The style of the videos did become quite repetitive.

y-curious · 2026-03-26T12:42:25 1774528945

You know who the novelty didn’t wear off for? My in-laws, who for some ungodly reason are superusers on TikTok. Once the audio-enabled, realistic videos of babies and children hit the feed, it was a virtual 9/11 moment. The group chat is spammed by 90% believable videos of babies arguing, dogs doing smart shit and it’s all slop.

I am hoping against hope that this will stem the tide because the slop-generators are too lazy or too poor to run other models locally or search them online.

disqard · 2026-03-25T18:39:24 1774463964

"...and when everyone's super, no one will be"

I think this is starting to play out.

When I personally see a blog post which didn't need an image, but still does have an AI-slop image banner, I mentally check out. I might have Claude summarize it, or (more likely) just skip it altogether.

urda · 2026-03-25T06:10:36 1774419036

I honestly forgot about Sora until this post, and yeah same behavior played with it for a bit, then moved on with my life.

bibimsz · 2026-03-25T01:09:37 1774400977

[flagged]

ares623 · 2026-03-25T01:13:24 1774401204

probably one of the few human commenters remaining here

dhon_ · 2026-03-25T01:24:18 1774401858

Cue a flood of crass jokes as the bots attempt to prove their humanity

robotnikman · 2026-03-25T19:41:01 1774467661

Such a stupid joke but it gave me a laugh.

jklein11 · 2026-03-25T01:11:35 1774401095

noice

AbanoubRodolf · 2026-03-25T02:37:38 1774406258

[flagged]

toraway · 2026-03-25T03:47:50 1774410470

(FYI, this is an LLM bot, check their comment history and note the repetitive structure with every comment they've ever posted all within the last hour)

  > This is the right question but hard to answer in practice ...

  > The brownfield vs greenfield split is the real answer to ... 

  > The babysitting point is the one people keep glossing over ...

torginus · 2026-03-25T03:01:53 1774407713

I dunno, it was the same for me and creative writing with AI.

First it looked like it was crazy inventive, good at writing snappy dialouge, and in general a very good font of ideas.

Then the same concepts, turns of phrase, story ideas kept reappearing, and I kinda soured on the concept.

I haven't done it in a while, but that kind of usage really shows the weakness of LLMs - if you keep messing with its generations, editing what it made, and as the context length keeps increasing, its more end more likely it goes into dumb mode, where it feels like talking to GPT3, constantly getting confused, contradicting itself etc.

1bpp · 2026-03-24T23:56:40 1774396600

I've never seen an AI video that made me feel anything other than bland dread. What were you generating that was so entertaining? Had you ever actually developed creative skills before?

dang · 2026-03-25T00:42:02 1774399322

Please don't cross into personal attack. Your comment would be fine without the swipe at the end.

https://news.ycombinator.com/newsguidelines.html

Waterluvian · 2026-03-25T00:10:58 1774397458

I think you’re fumbling on an important distinction.

Sometimes people want to paint, sometimes people want a painting.

To have wonderful time with their mom… I bet they had absolutely zero interest in the act and process of making silly videos.

dqv · 2026-03-25T00:21:59 1774398119

Totally. This wasn't a situation where a stranger was slopping another stranger, it was a mother and son doing something fun together.

apsurd · 2026-03-25T00:17:37 1774397857

I get your point but it goes too far in the opposite direction. We should now discuss absolutely nothing in relation to Sora and genAI videos? That seems overly charitable to the platform.

Waterluvian · 2026-03-25T00:25:55 1774398355

Here, let me try this approach:

Read the main comment out loud to yourself while imagining it’s someone sitting at a table at a pub.

Now imagine someone turning to this person in the pub, and speaking the subsequent comment, word for word.

No seriously, try it out.

apsurd · 2026-03-25T00:32:59 1774398779

Agreed. I did try this out! So the reply to the original comment is dumb. I actually dismissed it for being flippant.

Your reply is more interesting. Hence my (albeit maybe snarky) chiming in. So the original comment does end at a very specific app/sora related conclusion. "Sora didn't keep us coming back."

If I may amend your scenario: imagine this bar is actually in the center of SF or across the street from Open-AI or whatever. We're on HN discussing a post on X about Sora.

The appeal to humanity is not wrong. My point is more let's keep the connection with that humanity in relation to AI, to Sora, to what's going on in this forum.

jcims · 2026-03-25T00:11:50 1774397510

Come on now...'We're curing cancer, right?!'

You didn't at least puff a little ack through your nostrils for that one?

meken · 2026-03-12T14:43:43 1773326623

> Current Common Lisp implementations can usually support both image-oriented and source-oriented development. Image-oriented environments (for example, Squeak Smalltalk) have as their interchange format an image file or memory dump containing all the objects present in the system, which can be later restarted on the same or distinct hardware. By contrast, a source-oriented environment uses individual, human-readable files for recording information for reconstructing the project under development; these files are processed by the environment to convert their contents into material which can be executed.

Am I reading this right that people can (and do??) use images as a complete replacement for source code files?

dfox · 2026-03-12T14:58:17 1773327497

All the magic of Smalltalk is in the development tools that work by means of introspection into the running image, writing source code in text files causes you to lose all that. Add to that the fact that Smalltalk when written as source files is quite verbose.

Smalltalk does have standard text source file format, but that format is best described as human-readable, not human-writable. The format is essentially a sequence of text blocks that represent operations done to the image in order to modify it to a particular state interspersed with "data" (mostly method source code, but the format can store arbitrary stuff as the data blocks).

One exception to this is GNU Smalltalk which is meant to be used with source files and to that end uses its own more sane source file syntax.

meken · 2026-03-12T16:46:39 1773333999

Fascinating. Thanks for the explanation.

blenderob · 2026-03-12T15:17:43 1773328663

> Am I reading this right that people can (and do??) use images as a complete replacement for source code files?

Images are not replacements of source code files. Images are used in addition to source code files. Source code is checked in. Images are created and shipped. The image lets you debug things live if you've got to. You can introspect, live debug, live patch and do all the shenanigans. But if you're making fixes, you'd make the changes in source code, check it in, build a new image and ship that.

em-bee · 2026-03-13T04:57:36 1773377856

in smalltalk you make the changes in the image while it is running. the modern process is that you then export the changes into a version control system. originally you only had the image itself. apparently squeak has objects inside that go back to 1977: https://lists.squeakfoundation.org/archives/list/squeak-dev@...

igouy · 2026-03-13T17:45:21 1773423921

Does "originally" mean before release from the offices and corridors of Xerox Palo Alto Research Center.

Perhaps further back: before change sets, before fileOut, before sources and change log ? There's a lot of history.

I wonder if the Digitalk Smalltalk implementation "has objects inside that go back to 1977".

em-bee · 2026-03-13T21:15:52 1773436552

with originally i meant before the use of version control systems became common and expected. i don't know the actual history here, but i just found this thread that looks promising to contain some interesting details: https://news.ycombinator.com/item?id=15206339 (it is also discussing lisp which bring this subthread back in line with the original topic :-)

igouy · 2026-03-16T18:59:30 1773687570

In that case

https://news.ycombinator.com/item?id=47354288

em-bee · 2026-03-16T22:56:22 1773701782

that's very interesting, thank you, i should have realized that even early on there had to be a way to share code between images. (and i don't know why i missed that comment before responding myself)

but, doesn't building a new system image involve taking an old/existing image, adding/merging all the changes, and then release new image and sources file from that?

in other words, the image is not recreated from scratch every time and it is more than just a cache.

what is described there is the process of source management in the absence of a proper revision control system. obviously when multiple people work on the same project, somewhere the changes need to be tracked and merged.

but that doesn't change the fact that the changes first happen in an image, and that you could save that image and write out a new sources file.

igouy · 2026-04-22T16:04:20 1776873860

Sorry I failed to notice your reply.

> image is not recreated from scratch every time and it is more than just a cache

Yes, some vm & image & sources & changes can be taken as the base implementation for development purposes -- a persistent cache.

The state of whatever IDE tools were in use will be saved -- is that what makes you say "more than just a cache"? If I sleep a windows desktop is that more than just a cache?

> changes first happen in an image

What if I write a plain-text source code file using Notepad, and use Smalltalk file handling and byte code compilation and command-line argument handling (packaged in the image) to write the result of a computation to stdout (and quit the image without saving)?

em-bee · 2026-04-23T01:05:26 1776906326

If I sleep a windows desktop is that more than just a cache?

yes, so basically what i meant here is that a cache just stores data, but it doesn't store the whole application.

this is significant in that i can shut down an application (say my webbrowser), then i can upgrade it to a new version, restart and and the application will reinitialize itself and load data from the cache, but now i have a new version of the application.

whereas if i put my laptop to sleep, or better yet, hibernate, then the whole state of the laptop is frozen in place, and i can't do anything to it until i run it again. same is true for smalltalk images.

What if I write a plain-text source code file using Notepad, and use Smalltalk file handling and byte code compilation and command-line argument handling (packaged in the image) to write the result of a computation to stdout (and quit the image without saving)?

you could be doing that, but then you would use the image as your IDE and runtime environment, but not building the actual application in your image. so you wouldn't using what i have been taught is the traditional way of doing smalltalk development.

i am not trying to be pedantic here. it does not matter either way. i just find the smalltalk image approach interesting because it forces you to think about software development in a different way.

this mattes to me because i am working with a web development platform (written in pike) that uses a similar approach. albeit more by accident than intentional. the developers of the platform added support for programmable objects that are stored in the platforms database. these objects can change the behavior of the platform itself, like plugins, but because they are stored in the database they can be changed at runtime, like a smalltalk image. and all the same implications for doing that apply here too. the database becomes more than a cache. and in theory the whole platform could be rewritten such that almost all of its code is stored in the database and only a small bootstrapping system needs to remain outside. this is simply made possible because pike can load and update code at runtime and code changes can be applied without restarting, just like smalltalk.

the downside of the image approach is that it makes upgrading the base image harder, because there is no clear distinction between the base image and any user added changes. i kind of have to take extra steps to pick out my changes and apply them to a new image.

it would be interesting if that process could be improved. it probably would require some compartmentalization just like an OS where i have the base OS, my home directory and the system configuration. i can take a disk image, upgrade the OS and the rest still works. it would be nice if upgrading pharo for example would work the same way.

btw: thanks for the email. i have to ask, how did you manage to reply to a comment more than a month old. normally the reply function is disabled on comments that are 14 days old.

igouy · 2026-04-23T19:28:32 1776972512

> traditional way of doing smalltalk development

Is that in-conflict with a reproducible build process or can we have both.

> makes upgrading the base image harder, because there is no clear distinction between the base image and any user added changes.

We've been keeping "user added changes" in external files? (Plus the changes file.)

Port "user added changes" from the source code archive to each vendor release.

(btw: I ask nicely and don't abuse this small kindness.)

em-bee · 2026-04-25T23:24:41 1777159481

Is that in-conflict with a reproducible build process or can we have both

in part the question is what makes a build reproducible. what actually needs to be reproduced? the point of a reproducible build is that a version of source code always produces the same binary.

how do you make reproducible builds in smalltalk? reproducible builds depend on what goes into the build process. so they depend on the compiler and build tools. in smalltalk those are all in the image, and the question is then what happens when i load code into an image. am i getting that right? i am not so familiar with the details here, but i would guess that it depends on how smalltalk compiles the code and how the import process deals with timestamps and source paths, etc.

however if i work on my source code within the image and i share the code by making a copy of the image then the image is the source and the binary and there is nothing to reproduce. your copy and my copy of the image are going to be identical until one of us makes changes to the image.

i'd be curious to learn more here. outside the smalltalk world my editing tools do not affect the reproducibility of the builds of my code. in smalltalk. just getting a new version of the code browser would change the build, wouldn't it? how do you track that or keep that separate?

Port "user added changes" from the source code archive to each vendor release

right, but that's the "wrong" way around from the perspective of a desktop. i don't need to port my code to new versions of VS Code or vim or which ever tools i use to develop. only smalltalk forces me to do that. so i don't mean distinction in the file structure but distinction in the code dependency.

(i would never have considered to ask for being allowed to make a late reply, especially in this case. it is unlikely that anyone else is going to see our conversation. we could have just continued over email. but hopefully we can dig out some worthwhile details that not only me but anyone searching can learn from)

igouy · 2026-04-26T19:54:04 1777233244

> just getting a new version of the code browser would change the build

Outside the smalltalk world could getting a new version of the code browser "library" affect the reproducibility of the builds of your code.

(indeed: vanity)

em-bee · 2026-04-29T02:03:41 1777428221

it sounds like you are asking a question, but there is no "?" at the end, so i am slightly confused. if it is a question, my answer is no, because the tools i use to read or edit the code are completely distinct from the build tools.

what affects the reproducibility is the compiler, and that's the issue with smalltalk. you can't upgrade the IDE without upgrading the compiler. say if i use pharo, and switch to a new version of pharo then i get a new version of smalltalk, and i can't compile my own code in the image with the old version any more.

based on that i don't understand how you even enable or test reproducibility in smalltalk. i'd like to learn more about that.

(re: vanity. since a while i have wondered how i would go about creating the longest running discussion thread on hackernews. and how long i would be able to keep it going. i think we are off to a good start here. just remember to peek in here once a while to see if there is a response. and if there isn't feel free to poke me by email. (unless we decide that there is nothing to add))

igouy · 2026-04-29T17:15:27 1777482927

> … you can't upgrade the IDE without upgrading the compiler.

Do you mean because the IDE and compiler are packaged together, so when you get image' it may contain changes to both?

> … i can't compile my own code in the image with the old version any more.

Do you mean because the image' package has compiler' not the old version?

(Probably an abuse of kindness.)

em-bee · 2026-04-29T20:02:15 1777492935

yes to both.

(not as long as we keep to the topic and respond before the time to reply expires)

igouy · 2026-04-30T20:28:45 1777580925

> … can't upgrade the IDE without upgrading the compiler.

When packaged together IDE & compiler can't be upgraded separately (without doing the work to in-effect make separate packages).

> … can't compile my own code in the image with the old version

So we could try to compile own-code' in image' with compiler', and we could try to compile own-code' in image" with compiler", but we want to try to compile own-code' in image" with compiler' ?

Not without doing the work to in-effect make separate packages. (Say we could port compiler' to image" but would that mean trying to compile compiler' with compiler".)

And now we're back to what does "traditional way of doing smalltalk development" mean because supposedly "Team/V could forward and backwards migrate versions of Smalltalk “modules” within a running virtual image."

> the point of a reproducible build is that a version of source code always produces the same binary.

Scope: does "a version of source code" just mean own-code or does it mean sources+changes.

em-bee · 2026-05-10T12:56:29 1778417789

i have just been reading https://news.ycombinator.com/item?id=48081245 about reproducible builds in debian, and that reminded me of this discussion. truth is i still struggle to understand how reproducible builds with smalltalk images even work.

in debian, a build is reproducible if the checksum of the resulting package of my build matches that of your build. how do you do that in a smalltalk image? you take the checksum of what? which objects or rather entities do you compare, and how do you compare them, to verify that a build is reproducible? it can't be the whole image, because that is guaranteed to be different somewhere, and thus the image checksums won't match. so how does that really work?

em-bee · 2026-05-10T15:28:10 1778426890

there is another thought that i just realized. what are reproducible builds with runtime compiled languages like python, ruby, perl, even javascript? the point of a reproducible build is to verify that the binary i use, or that the package i use is based on the same source that i have. python et al don't have binaries. so my reproducible build check is a diff of the source tree. if in smalltalk the image is my source then i care about knowing that the image has not been modified. that's impossible because the image changes just by using it, unless i never save the changes. what's left is exporting the source and running a diff on that maybe.

igouy · 2026-05-11T17:21:50 1778520110

> … the point of a reproducible build is to verify that the binary i use, or that the package i use is based on the same source that i have.

I have been talking about how to reproduce a particular build, not how to verify.

From the same initial state, perform the same sequence of actions, implies arrive at the same final state. Reproducible.

"Retaining your old change logs gives you a record of all the changes you have made to the system. This will prove invaluable when you receive a new release of Smalltalk/V. … In short, back up the image and change log together, and you shouldn't have any problems."

page 285 Smalltalk/V 286 Tutorial and Programming Handbook 1988

igouy · 2026-05-10T16:31:32 1778430692

> if in smalltalk the image is my source

As-before it doesn't need to be.

https://news.ycombinator.com/item?id=47354288

igouy · 2026-05-05T17:03:20 1778000600

My imagination has failed. I don't understand which issues you are concerned with. I think we're done.

em-bee · 2026-05-10T15:33:36 1778427216

i didn't see that comment, sorry. my primary purpose here is to learn more about smalltalk and the image based development model. you brought up a few interesting things that i have questions about. reproducible builds is one of them. and by question i don't mean to verify the validity of claims but to ask about them to learn more.

igouy · 2026-03-12T17:26:48 1773336408

The image is not stand-alone: there should also be a sources file and a changes file (and of course a virtual machine).

"When you use a browser to access a method, the system has to retrieve the source code for that method. Initially all the source code is found in the file we refer to as the sources file. … As you are evaluating expressions or making changes to class descriptions, your actions are logged onto an external file that we refer to as the changes file. If you change a method, the new source code is stored on the changes file, not back into the sources file. Thus the sources file is treated as shared and immutable; a private changes file must exist for each user."

1984 "Smalltalk-80 The Interactive Programming Environment" page 458

The image is a cache. For a reproducible process, version and archive source-code.

1984 "Smalltalk-80 The Interactive Programming Environment" page 500

"At the outset of a project involving two or more programmers: Do assign a member of the team to be the version manager. … The responsibilities of the version manager consist of collecting and cataloging code files submitted by all members of the team, periodically building a new system image incorporating all submitted code files, and releasing the image for use by the team. The version manager stores the current release and all code files for that release in a central place, allowing team members read access, and disallowing write access for anyone except the version manager."

jlarocco · 2026-03-12T20:15:36 1773346536

I've never heard of anybody doing it, but in theory it could work.

SBCL (and maybe others) use a "core image" to bootstrap at startup. It's not unheard of for people to build a custom core image with the packages they use a lot from the REPL. It's become less common as computers have gotten faster, and most people use systems like Quicklisp or Roswell to automatically get updates and load from source. Of course the SBCL core image is generated from the compiler source code when building it, and the dependencies are loaded and compiled from source initially, too, so there's still going to be source code files around.

You could, in theory, start with the compiled SBCL image, exclusively type code into the REPL, save the image and exit, and then restart with the new image and continue adding code via the REPL. I really doubt anybody uses that workflow exclusively, though. At the very least most people will eventually save the code they entered in the REPL into a source file once they've debugge it and got it working.

pjmlp · 2026-03-15T06:50:33 1773557433

Ironically JIT caches are nothing other than core images.

Several JVM implementations, including the ART cousin, .NET, and apparently node.js is getting one as well.

kqr · 2026-03-12T15:30:25 1773329425

I imagine some systems may start out by tinkering with definitions in the REPL in the live system, and then as it grows, the best definition of the system is found in the current state of the REPL, rather than any more formal specification of the system – including by source code.

At some point maybe the system state will be captured into source code for longer term maintenance, but I can totally see the source code being secondary to the current state of the system during exploration.

After all, that's how I tend to treat SQL databases early on. The schema evolves in the live server, and only later do I dump it into a schema creation script and start using migrations to change it.

meken · 2026-03-12T16:44:35 1773333875

> After all, that's how I tend to treat SQL databases early on.

Ah, that’s a very helpful analogy/parallel that didn’t occur to me. Thank you!

pjmlp · 2026-03-15T06:47:18 1773557238

Besides all answers, this concept exists in modern IDEs like Eclipse, anything JetBrains, Netbeans, Visual Studio,...

Even though their appear to be file based, the plugins API makes use of a virtual filesystem that allows for managing the code as if it was the image based concepts from Smalltalk, Lisp and other systems like Cedar, Mesa, Oberon,....

Also something that many don't think about, databases with stored procedures.

tzot · 2026-03-12T15:05:56 1773327956

Maybe you understood image as in photo-image instead of image as in memory-image (like disk-image); a glorified memory dump, more-or-less.

meken · 2026-03-12T16:42:39 1773333759

I understood it as the latter.