More

kvam · 2026-05-25T20:40:32 1779741632

As a Norwegian this sounds like a mistake. Who will use this LLM? Where? For what? The underlying data could be made more easily searchable and digestible for agents in general if the goal is better knowledge of Norwegian culture.

vidarh · 2026-05-25T21:47:33 1779745653

I agree in principle.

That said, they are quite limited in what they are allowed to share of in-copyright works, and nb.no is a fantastic resource as it is (though you'll need a Norwegian IP address for too much of it - it's one of th main reasons I maintain a VPN) - if they are allowed to make it accessible there, it'd be great.

But they also have vast amounts of out-of-copyright data that I hope they'd make more easily accessible...

dalemhurley · 2026-05-25T20:46:27 1779741987

Hard disagree. This is the first step not the last and proves to other countries that this can be done.

xadhominemx · 2026-05-26T03:24:48 1779765888

This model is going to start miles behind the frontier and the gap will only grow.

weregiraffe · 2026-05-26T04:34:50 1779770090

Why would the gap grow? There is no more training data to acquire, frontier model are training on the entire internet. Everything from now on is just fine-tuning.

kvam · 2026-05-26T06:30:02 1779777002

Your statement assumes training data is the only thing that matters for the big players, while not considering it limiting for the small Norwegian model. That’s a fallacy.

weregiraffe · 2026-05-26T07:05:41 1779779141

Nowhere in the article does it say the Norwegian LLM will train _only_ on Norwegian data.

xadhominemx · 2026-05-28T01:03:02 1779930182

Orders of magnitude less compute for pre- and post-training than the the frontier labs.

spwa4 · 2026-05-25T20:45:33 1779741933

Exactly, if there's one thing transformers are good at it's translation. One I've found particularly nice: any question ChatGPT can answer in English it can answer in French. I'm assuming Norwegian too. So there's no point.

sgt · 2026-05-25T20:49:19 1779742159

There's quite a bit more to culture and language than just being able to have transformers come up with believable language and/or dialect.

sisve · 2026-05-25T20:56:03 1779742563

The point is that norway willl have its own LLM. And will not have dependencies to another state or private company. The goal is not to be the best model. But to have a model that include more Norwegian data then other LLM and that it's not screwed against other sources.

kvam · 2026-05-26T06:32:30 1779777150

But what does that give you? If the model is far less capable? What will it do for you with that Norwegian data, that a better model could not do with better search or context?

sisve · 2026-05-26T10:48:06 1779792486

A model has many dimensions. You can't have them on one scale from good to bad. The model will most likely be poor at coding. But will give better answers about Norwegian cultur. I assume the tone of voice will be (by default) much closer to how Norwegians talk and write then what we current see from model from the US. They seem to be a bit to much.. Norwegian people are a bit more down to earth

fastasucan · 2026-05-27T20:25:14 1779913514

Other better models cant use the training data this model have...

dalemhurley · 2026-05-25T21:46:14 1779745574

Yes transformers are great at translation as that is their purpose.

LLMs are not great at preserving cultural uniqueness and diversity. Take how “delve” has reentered the lexicon because the human assessors for pre training dialect of English uses “delve” a lot.

There is a lot of benefits to training specifically for a unique culture with unique norms to preserve the culture as we increasingly rely on LLMs.

https://www.scientificamerican.com/article/chatgpt-is-changi...

fastasucan · 2026-05-27T20:24:07 1779913447

There is a lot more to it than literal translations. Even if an american can talk Norwegian it doesn't mean that they get the cultural context right.

"Oh yeah after you drive 2 hours home from work your wife and kid will greet you with some delicious pie" doesnt work so well even if its in Norwegian.

dzhiurgis · 2026-05-25T21:53:33 1779746013

Model can speak Lithuanian too, but with a Russian accent which is a big taboo for us.

spwa4 · 2026-05-26T12:47:12 1779799632

I wonder, can you ask it not to do that first, and check if that still happens?

otabdeveloper4 · 2026-05-25T20:51:41 1779742301

They're only good at it because they were trained on massive amounts of English and French data.

vidarh · 2026-05-25T21:54:00 1779746040

Not really true.

Both Claude and ChatGPT can translate into minor dialects of Norwegian they will have seen very few works in because very few printed works exist in them.

E.g. I've tested both my local spoken dialect, which is rarely written, and a sociolect used by a 1970's Maoist group consiting of a few hundred people, where most of the printed material consists of novels from a couple of ex-members that became authors.

In the latter case, it claimed to not know, but was able to get a good match from just a description.

I also just had it ape Norwegian orthography from the 1910's by having it look up the rules and translate a text it had first translated from English to modern Norwegian, and it did just fine.

They will have seem some work in these dialects, but mostly it transfer really well to know related languages (English, Dutch, German, Swedish, Danish, roughly form a continuum from least in common to most in common with modern Norwegian; they all share vocabulary and significant parts of grammar with Norwegian), and then a relatively limited exposure to Norwegian itself is sufficient to do fairly well.

They're also really good at "style transfer" of text in the form of tweaking orthography, word order, and minor grammar changes from descriptions and examples.

(incidentally, the latter is one way of getting an LLM to sound a lot less like an LLM)

otabdeveloper4 · 2026-05-26T05:23:31 1779773011

This is all true, but I assumed the original posters were talking about cultural knowledge, not linguistic correspondences.

To do translation well you still need cultural knowledge. (E.g. the particular modes of specific kinds of legalese, or slang and the nuances of social class, etc)

vintermann · 2026-05-26T05:49:46 1779774586

I think it's not that this knowledge isn't present in the model somewhere, but probably more that it gets killed by instruction tuning for US corporate values.

vintermann · 2026-05-26T05:46:42 1779774402

Definitively not the case in distilled models.

kvam · 2026-05-05T15:32:20 1777995140

Riff is Databutton. They rebranded.

kvam · 2025-12-16T06:52:55 1765867975

Some people have mentioned that this is a U.S incorporated company (Delaware). Recommend reading Moneyland by Oliver Bullough if you want to know more about the U.S role as the new shell company haven.

The island states have been dethroned.

kvam · 2025-12-05T09:10:41 1764925841

Infrastructure-as-Vibe?

kvam · 2025-11-19T20:47:09 1763585229

What are the alternatives for aspiring tinkerers now?

My wife (cybernetics engineer) and I are buying a 3D printer and planned getting an Arduino as an entry point. What should we do instead? What are the best communities and resources?

whynotmaybe · 2025-11-19T21:03:43 1763586223

ESP32.

I'm using ESP32 with platformio which has a dedicated community https://community.platformio.org/tag/espressif32

I've used devkit from M5stack, waveshare and adafruit.

(M5Stack has a full line of products for tinkering with many sensors & controllers)

You can also find many cheaper no-brand devkit anywhere but quality & docs can be unreliable.

bschwindHN · 2025-11-20T06:30:51 1763620251

* RP2040 / RP2350 - If you don't need connectivity, this is a great chip for flexible IO. Good software support, easy to use, well documented.

* ESP32 - Good community support, bluetooth and wifi connectivity, some powerful variants as well for driving screens and other things.

* STM32 - Widely used, and an absolute boatload of chip variants for different tasks, from small little GPIO twiddling cores, to beefy chips running DSPs and outputting high-res images to displays.

* nRF52840 (and other variants) - Good for bluetooth devices, should be lower power than ESP32.

My recommendation would be to buy something like a Xiao RP2040:

https://www.seeedstudio.com/XIAO-RP2040-v1-0-p-5026.html

They're cheap, have USB-C, and are super easy to use. Oh, and they have a reset button which for some reason, the official pico board does not. On top of that, the official pico board uses micro USB, so overall I would recommend NOT buying them, they're annoying to work with. The Xiao boards don't have a ton of IO pins, but they're at least good for learning and if you determine you need more IO you can move to a different dev board, or design your own PCB.

radeeyate · 2025-11-19T20:49:10 1763585350

I first got into Raspberry Pi Picos, but I've also been experimenting with Esp32's and some of the nRF chips. I mostly do CircuitPython on them but Arduino is a supported platform on those I believe.

swsieber · 2025-11-19T21:05:50 1763586350

I got a couple of RP2040 boards recently and I'm amazed at how easy it is to just get stuff done. Between the native usb support and the circuit python support it's been a breeze. I just got a couple of boards up and running uart in a daisy chain. It was intimidating, but the circuitpython docs made it relatively simple.

doph · 2025-11-19T21:16:46 1763587006

ESP32 - quite a range of dev boards and places like Seeed and Adafruit have a nice selection of accessories. Adafruit develops CircuitPython which is IMO the lowest barrier to entry for programming MCUs. Adafruit even has CircuitPython sketches on their site for how to interface with the components they sell.

Rust on ESP32 is still a bit early - the HAL crate is still pretty unstable, but the toolchain is quite nice and I'm able to be productive enough that I never reach for C or C++.

rramadass · 2025-11-20T05:39:53 1763617193

You are on the right track i.e. stay with an Arduino in the beginning. Note that "Arduino" is a family of boards with different MCUs but all providing a common API (mostly). So you choose the Board/MCU combination best suited for your system and can always move to something else later after you have gotten some experience. The reason is the Arduino Ecosystem. There are thousands of free tutorials, designs, libraries etc. all available for you to try out for your app and more often than not you can have your PoC/MVP by just plugging in some libraries and writing some glue code. You only have to learn the Arduino API and not any specific MCU's datasheets unless and until you are doing something more lower level. It is all way easier.

The Arduino Cloud offering (runs on AWS) makes integrating your Arduino-based system into an end-to-end SaaS app simple (just watch and follow some tutorials on Youtube). There is also the Arduino PRO series of hardware for you if and when you want industrial-grade hardware for demanding systems/environments.

If the Qualcomm c-suites have half a brain amongst themselves they will not kill the goose that lays the golden eggs.

chasd00 · 2025-11-19T22:27:24 1763591244

The feather series of boards from Adafruit + Curcuit/Micropython works really well if you just want to make stuff happen instead of tuning a toolchain and, like, setting up clocks with asm.

giobox · 2025-11-19T22:10:21 1763590221

Everyone I know who is into tinkering with microcontrollers moved onto ESP32 a long time ago now. I actually thought this headline was going to link to an article about ESP32's popularity. VSCode with the PlatformIO extension has been great for me when working with them:

https://platformio.org/

euroderf · 2025-11-20T11:57:49 1763639869

I'd like to use apps out there for model railroading - locomotive control and accessory automation, especially 3-rail. There is a LOT written for Arduino; I wonder if any other platforms come close. Someone mentioned some sort of Arduino emulation layer on top of ESP32.

skhameneh · 2025-11-19T22:06:54 1763590014

STM32 boards and PlatformIO.

ESP32 is quite popular (as seen by other suggestions) but I find the quality of Espressif, hardware/software/support, is widely varied.

FWIW PlatformIO works with Arduino and ESP32 (and will give you a better experience in so many ways)

jononor · 2025-11-20T08:30:59 1763627459

ESP32 or RP2 based boards with for example MicroPython/CircuitPython, or platform.io + VSCode. Though the good old Arduino IDE seems to be unaffected by this change though.

euroderf · 2025-11-24T18:27:16 1764008836

In this reply thread I see no mention of TinyGo or Gobot. Are they not so good in this space ?

kvam · 2025-11-20T13:40:12 1763646012

This is fantastic. Thank you all!

kvam · 2025-11-19T20:18:09 1763583489

Please blog and post about this. I need a how to.

ceejayoz · 2025-11-20T15:47:45 1763653665

If I complete the project! Haha.

kvam · 2025-11-19T07:10:13 1763536213

That might be precisely why OpenAI is pushing an over investment in infrastructure. When VCs are no longer willing to substitute compute, having more compute available than natural demand will drive the prices down.

kvam · 2025-11-07T18:19:15 1762539555

110k spiders, built from two separate species.

kvam · 2025-11-07T18:21:04 1762539664

More detailed article: https://7news.com.au/news/world/worlds-largest-spider-web-in...

kvam · 2025-10-29T18:53:07 1761763987

Agree! We saw this a lot. Launching with the Quest 3, we were often the first company to do X, Y, Z despite being months after new features had been released in the SDKs because they were poorly documented (and often even conflicting).

Diverging even slightly from the demo use case would quickly feel like Sisyphus; so close, but never succeeding in getting over the hill.

Good for marketing in certain cases (to be the first), but bad for the community of builders

kvam · 2025-10-04T08:34:21 1759566861

Why would cloudflare be the company to do this?

noir_lord · 2025-10-04T08:44:36 1759567476

Rent seeking probably.

Cloudflare sits in the middle of a vast amount of web traffic now, offering easy global payments and skimming off the top of that is going to be very profitable potentially.

I don't trust Cloudflare, the larger they get the bigger the abuse potential becomes.

deadbabe · 2025-10-04T16:36:11 1759595771

But of all the players, Cloudflare is the best one for this kind of thing.

rozenmd · 2025-10-04T08:48:09 1759567689

Curious: how do you feel about AWS, Azure, GCP?

willtemperley · 2025-10-04T09:45:30 1759571130

CF: No criminal convictions I know about. No reason to distrust yet. Most controversies seem to be about providing services to political organisations.

GCP: Earth Engine is quite good, but Google have multiple criminal convictions. As a repeat offender they should be avoided at all costs. They are just so exceptionally good at manipulating people, markets and academia it's genuinely terrifying.

Azure: Microsoft still don't take security seriously. They're just a bit bumbly, not really smart enough to be as terrifying as Google.

AWS: Pretty useful, annoying to use, distrust because I can't bear Amazon's use of dark patterns in consumer products.

noir_lord · 2025-10-04T08:51:46 1759567906

Same way I do about any large corporation, I don't trust them.

octo888 · 2025-10-04T08:50:27 1759567827

You work for Cloudflare, so is your comment more a "we're no different" statement than genuine curiosity about their opinion?

rozenmd · 2025-10-04T09:14:15 1759569255

Genuinely curious.

thrown-0825-1 · 2025-10-04T08:52:48 1759567968

You mean Amazon, Microsoft, and Alphabet?

acdha · 2025-10-04T16:18:35 1759594715

Their customers pay them to be the front door to their websites. Customers want a way to reduce the massive traffic from AI crawling, to block malicious traffic, and to be compensated for access.

That leaves Cloudflare well positioned to implement a pay-for-access check along with all of the existing bot services they offer. AI crawling goes from a threat to a win if I can serve OpenAI a demand to pay for each request. Bot abuse goes down if traffic isn’t free. Businesses like journalism become stable again if readers pay for content rather than relying on advertisers to subsidize it.

Cupprum · 2025-10-04T08:40:36 1759567236

At least its not facebook or microsoft.

But the better question would be, who should be the company (or entity) we should trust to do such a thing?

SXX · 2025-10-04T08:42:33 1759567353

MiTM is their business. Obviously mitming of financial transactions is the most profitable business of all.

thrown-0825-1 · 2025-10-04T18:10:27 1759601427

I dunno, running for president of the US and launching a crypto coin seems pretty profitable.

unglaublich · 2025-10-04T08:40:58 1759567258

Because they are establishing themselves as "the gatekeeper of the internet".

miohtama · 2025-10-04T15:59:12 1759593552

Because their customers want to monetize AI crawlers

jbverschoor · 2025-10-04T09:01:02 1759568462

Finally build an infrastructure for real micro transactions. First for AI agents to access paid content, then for consumer to access content.

thrown-0825-1 · 2025-10-04T08:49:56 1759567796

So they can directly monetize page views with their own token and fully embrace the role of internet gatekeeper / tax man.

Spivak · 2025-10-04T16:13:54 1759594434

But then why do a blockchain thing? Why not just make CloudflareBucks they centrally control? Surely that is much easier to monetize and cheaper to implement.

thrown-0825-1 · 2025-10-04T17:44:06 1759599846

Less regulation to get in the way of the blatant corruption.

spwa4 · 2025-10-04T09:51:44 1759571504

... which will then be immediately destroyed by law because it gives the actual tax man a single target, along with a money flow that comes from within the control of the tax man.

PLUS just imagine how many corrupt politicians will be tempted to force these payments to go through their company.

thrown-0825-1 · 2025-10-04T09:54:18 1759571658

rule of law no longer exists in the US and their large corporations are filling the gap.

h33t-l4x0r · 2025-10-04T08:51:56 1759567916

Their customers are hit the hardest by the shift away from google search to AI. They probably are the right company to try to help them monetize their content.

aiisthefiture · 2025-10-04T09:11:53 1759569113

Google search is used more now than ever…

ACCount37 · 2025-10-04T09:21:13 1759569673

What does the traffic from Google Search look like though?

I can't imagine all the low effort content farms that were providing things like dictionary definitions or ridiculously elongated ad-stuffed versions of kitchen recipes are doing too hot under the pressure from AI Overviews. And they can't be the only ones impacted.

mapmeld · 2025-10-04T08:48:21 1759567701

I agree this makes little sense for Cloudflare to jump on the crypto bandwagon now. Maybe they want to retain some talent by turning this into an official project.

Is the premise that it makes more sense for an AI agent to pay in prepurchased stablecoin tokens instead of direct access to a credit card?

rjh29 · 2025-10-04T08:50:15 1759567815

They are the moat between AI content crawlers and websites. They will probably start charging a fee and a stablecoin is a good way to do that globally.