More

j-bos · 2026-04-23T20:23:57 1776975837

Thank you kindly.

j-bos · 2026-04-23T17:59:39 1776967179

But why are they only using one sensor?

samrus · 2026-04-23T18:00:52 1776967252

Because we used to be a high trust society where degenrate gamblers wouldnt mess with scientific equipment to rip each other off

j-bos · 2026-04-23T19:31:02 1776972662

I hear what you're saying man, but honestly it's a sensor. Sensors can fail all the time even without deliberate tampering. It doesn't seem to really make sense to have a single one in a single location. Besides for clarity, my question was more why was the market referenced on a single sensor rather than on multiple sensors?

pavl- · 2026-04-23T18:30:20 1776969020

lol, the OP is such a classic HN take. "Why doesn't society simply absorb a negative externality created by gambling and add high cost redundancy layers that are otherwise useless" - Not every problem is technical.

MattGaiser · 2026-04-23T18:41:45 1776969705

To be more generous, it is the action that can be taken unilaterally.

Trying to eliminate gambling has vexed many an emperor and cleric.

slackfan · 2026-04-23T18:28:52 1776968932

Counterpoint, why are we measuring temperature on what is basically a giant radiator?

dymk · 2026-04-23T18:29:46 1776968986

Because that wasn't what caused the problem? The reason it didn't work was some asshole intentionally tampered with the equipment, not because of where it was.

slackfan · 2026-04-23T22:11:06 1776982266

Counterpoint - that thermometer is used for important temperature & climate data.

gorjusborg · 2026-04-23T18:39:40 1776969580

Counter-counterpoint, why are people wagering on an airport thermometer measurement?

wcfields · 2026-04-23T19:22:24 1776972144

It's a catch-22: I already gambled the coin I was going to use for the heads-or-tails bet.

jrm4 · 2026-04-23T20:34:41 1776976481

I'm sorry, what is this "high trust" society of which you speak?

belorn · 2026-04-23T18:20:04 1776968404

I am a bit surprised by that for a scientific application where you want high accuracy reading. Temperature sensors has a error margin, and I think they can also drift a bit.

In a bit of expensive equipment I own it happens to have 4 temperature and pressure sensors, located in two different locations on the unit. All of them generally disagree with each other on the exact temperature by around 0.1 C, which is fine for my use of it.

Analemma_ · 2026-04-23T18:03:19 1776967399

Why didn't you put two locks on your door? Clearly you deserved to get burgled if you only used one.

Building Byzantine fault tolerance into absolutely everything is expensive, and makes everything we do and buy more expensive. It would be better and cheaper to rely on social trust, if that's possible-- and it was possible before these gambling sites. Prediction markets are burning social trust as fuel to make profit, they should be heavily taxed, the way polluters are taxed, as destroyers of the commons.

jgeada · 2026-04-23T18:11:46 1776967906

Prediction markets and all the market manipulation are the symptom, not the cause. Our society used to have real consequences for breaching public trust, but with our mere decades old "money is speech" legal system, there have not been any consequences for moneyed interest in quite a while. And as long as there are no consequences, they keep trying more and more egregious violations of public trust to establish where the new red line (if any) actually are.

pocksuppet · 2026-04-23T18:47:12 1776970032

Money is speech - but speech isn't speech. What's the latest thing a US citizen said in the US that got them arrested? They said in a private WhatsApp group that Benjamin Netanyahu should come and bomb their school to get them out of an exam. Benjamin Netanyahu was not in the group chat, but they got arrested anyway.

sixothree · 2026-04-23T18:47:58 1776970078

It's not like there aren't literally hundreds of sensors across the city.

dessimus · 2026-04-23T18:28:48 1776968928

>Why didn't you put two locks on your door? Clearly you deserved to get burgled if you only used one.

I can reasonably test that the bolt is in a locked position every time I close the door and turn handle on the lock. But a single remote sensor could have malfunctioned or simply be out of calibration a few degrees.

If the actual temperature at the airport is important to any set of users enough so that the difference between it being 18 or 22 deg C is relevant, one should expect that there be at least 3 sensors (much like clocks) and assuming variances between the 3 sensors are within tolerance an average of the 3 temperatures is taken.

drtz · 2026-04-23T18:56:49 1776970609

> If the actual temperature at the airport is important to any set of users enough so that the difference between it being 18 or 22 deg C is relevant, one should expect that there be at least 3 sensors

Three sensors doesn't solve the problem. Manipulating becomes marginally more difficult with three sensors, but it's still very possible, and with enough monetary incentive it's still even likely that it happens again.

So why not 5 sensors? How about 10?

And what about more consequential issues like the toppling of governments or military blockades where true redundancy is impossible and people are actually harmed?

Is there not a point where you start to blame the incentives that are being chased?

dzhiurgis · 2026-04-24T08:09:15 1777018155

Probably would make sense for polymarket, but otherwise you get things like “mothers on average have 1.9 babies”

j-bos · 2026-04-22T19:42:54 1776886974

"based entirely on feels"

Now there's a word I haven't heard in a long, long time.

j-bos · 2026-04-20T20:15:37 1776716137

Seems like a great challenge for all these systems, see fromtier labs serving quants when under hesvy load.

j-bos · 2026-04-20T16:38:48 1776703128

This. I see much cheap naysaying without referenece to the vuln hashes. If it is smoke and mirrors, then the naysayers should loudly shout down the specific hashes and when they get revealed, or don't, then they will have done a great service to dissuading fake claims to world changing tech.

j-bos · 2026-04-20T01:51:41 1776649901

Agreed. Sprawling system prompts like that are building for the least common denominator, nerfing for anyone or anytime going further.

stingraycharles · 2026-04-20T02:06:00 1776650760

You do realize that similar biases are also present in the training data?

j-bos · 2026-04-20T13:18:36 1776691116

I do, inevitable, but ime the prompts force certain behaviors at similar strength (instruction following). So it's one thing that the model is biased towards any particular direction by its latent space, it's another that it is biased by an immodifiable prompt which can only be contradicted for the benefit of the lcd at the expense of the more involved operator.

xpct · 2026-04-20T04:10:38 1776658238

Sure, but now we have to remodel whatever bias we want for our use case with every new release because the system prompt changes, whereas the underlying data does not.

stingraycharles · 2026-04-20T06:10:35 1776665435

Underlying data changes all the time, as do training methodologies / preferences.

You do realize that these LLMs are trained with a metric ton of synthetic examples? You describe the kind of examples / behavior you want, let it generate thousands of examples of this behavior (positive and negative), and you feed that to the training process.

So changing this type of data is cheap to change, and often not even stored (one LLM is generating examples while the other is training in real-time).

Here's a decent collection of papers on the topic: https://github.com/pengr/LLM-Synthetic-Data

xpct · 2026-04-20T12:03:16 1776686596

Well, I'd say it's a reasonable expectation for the model to behave similarly across releases. Am I wrong to assume that?

I imagine the system prompt can correct some training artifacts and drive abnormal behavior to the mean in the dimensions that Anthropic deems fit. So it's either that they are responding to their brittle training process, or that they chose this direction deliberately for a different reason.

j-bos · 2026-04-19T01:03:36 1776560616

Yeah for anyone seriously using these models I highly reccomend reading the Mythos system card, esp the sections on analyzing it's internal non verbalized states. Save a lot of head wall banging.

j-bos · 2026-04-17T17:39:14 1776447554

As sibling comments mentioned deceptive comparison as well. How about comparing in percentage of Gross Energy Output. https://www.sciencedirect.com/science/article/abs/pii/S09218...

j-bos · 2026-04-15T18:40:08 1776278408

What's your setup/usecase? Enhanced intellisense?

j-bos · 2026-04-12T21:09:54 1776028194

I guess they don't like Monika.

kulahan · 2026-04-13T03:04:06 1776049446

Just Monika?