More

goodroot · 2026-06-08T21:41:51 1780954911

Starting making hyprwhspr because no other stt library was quite there for performance and model availability.

After that I started writing opub.dev because even minimal success in recent oss showed me just how much has changed, and I’m worried about how expensive everything will get for maintainers.

So, now I’m trying to GIVE people compute so they can start building a helpful filter layer above their projects.

goodroot · 2026-05-26T22:28:05 1779834485

Yikes. Good thing you didn't wind up there.

The furthest I've gone in these jazz style culture interviews is asking people what they do outside of work for fun. This was for fully remote async positions. And it was important to know you had other stuff going on because the mental/personal health risk in failing at remote work is massive and life altering.

If, through wherever that discussion went, I wasn't 100% sure that you could stand on your own feet and wouldn't sink into the abyss, it was impossible to move forward. It was a tough line to walk sometimes because you don't want to pry personally. But that doesn't appear to be a universal opinion, it turns out.

nicbou · 2026-05-26T22:32:45 1779834765

That question would not be received well in many places. What candidates do in their private time is none of your business.

nomel · 2026-05-26T22:51:48 1779835908

Not sure why this is downvoted.

Even if I wanted to, these questions aren't allowed in the company I work for, along with feedback related to "team fit". This is dictated by execs, dictated by legal, because it has nothing to do with proving competence, and opens up for employment discrimination lawsuits since you're persuading them (you have to understand the power dynamic) to reveal potentially protected info. For example, if a man say "Oh, I go hiking with my boyfriend!", he could also say "They didn't hire me because I told them I was gay!". Or, even "I spend time with my kids." since familial status is a legally protected class where I am.

As a person who does interviews, I have exactly zero interest in what people do for fun. I just want competent people that are nice to work with (in a productivity sense), and I only have 45 minutes to prove that, knowing that nearly everyone fucking lies. I see it serving no purpose other than helping enforce some monoculture within the group, because, genuinely, why else would you ask about free time activities during an interview?

Related, the only time I've asked this was early on when I didn't know how to interview. The only time I've been asked this, and answered, was with people who had just started interviewing (small startups and new hiring managers).

ryandrake · 2026-05-27T03:08:03 1779851283

Great comment. It's really shocking how close to the legal line Silicon Valley tech companies get, and the extent to which many of them actually cross way over the line. A huge number of interviewers I've encountered are in extreme need of training so they don't so casually put their companies at legal risk. If I was Lawful Evil, I could probably make a career out of just suing companies for discriminatory hiring practices, due to the various landmines poorly trained interviewers routinely step into.

BigTech seems to be the best at it. They tend to have rigorous training, and often have a "safe question bank" that interviewers pull questions from, which are all vetted by lawyers and are known not to put the company at legal risk.

pjsmith404 · 2026-05-26T22:39:29 1779835169

I think that's the best you can do for culture fit, cause at the end of the day it's just "can they shoot the shit and are they pleasant to be around". You can't really know a person technically or socially until they've been in the job for at least a little bit though.

dnnddidiej · 2026-05-27T06:23:41 1779863021

Hmmm. Maybe overindexing on anecdata. Did that one guy go a bit crazy once?

I think you gotta trust adults to be adults.

goodroot · 2026-04-06T21:41:33 1775511693

Whisper is very good in many languages.

It's also in many flavours, from tiny to turbo, and so can fit many system profiles.

That's what makes it unique and hard to replace.

goodroot · 2026-04-06T20:08:38 1775506118

Nice one! For Linux folks, I developed https://github.com/goodroot/hyprwhspr.

On Linux, there's access to the latest Cohere Transcribe model and it works very, very well. Requires a GPU though. Larger local models generally shouldn't require a subordinate model for clean up.

Have you compared WhisperKit to faster-whisper or similar? You might be able to run turbov3 successfully and negate the need for cleanup.

Incidentally, waiting for Apple to blow this all up with native STT any day now. :)

VorpalWay · 2026-04-06T22:12:04 1775513524

How does it compare to the more well established https://github.com/cjpais/handy? Are there any stand out features (for either option)? What was the reason for writing your own rather than using or improving existing software?

goodroot · 2026-04-06T22:17:00 1775513820

Not sure I know what you mean by IR...

But in this case I built hyprwhspr for Linux (Arch at first).

The goal was (is) the absolute best performance, in both accuracy & speed.

Python, via CUDA, on a NVIDIA GPU, is where that exists.

For example:

The #1 model on the ASR (automatic speech recognition) hugging face board is Cohere Transcribe and it is not yet 2 weeks old.

The ecosystem choices allowed me to hook it up in a night.

Other hardware types also work great on Linux due to its adaptability.

In short, the local stt peak is Linux/Wayland.

VorpalWay · 2026-04-06T22:23:11 1775514191

IR was a typo, meant "it" (fixed it). I blame the phone keyboard plus insufficient proof reading on my part.

If this needs nvidia CPU acceleration for good performance it is not useful to me, I have Intel graphics and handy works fine.

goodroot · 2026-04-06T22:28:59 1775514539

It works well with anything. :)

That said: If handy works, no need whatsoever to change.

LuxBennu · 2026-04-06T20:48:34 1775508514

I've been running whisper large-v3 on an m2 max through a self-hosted endpoint and honestly the accuracy is good enough that i stopped bothering with cleanup models. The bigger annoyance for me was latency on longer chunks, like anything over 30 seconds starts feeling sluggish even with metal acceleration. Haven't tried whisperkit specifically but curious how it handles longer audio compared to the full model.

goodroot · 2026-04-06T21:06:45 1775509605

Ah yeah, longform is interesting.

Not sure how you're running it, via whichever "app thing", but...

On resource limited machines: "Continuous recording" mode outputs when silence is detected via a configurable threshold.

This outputs as you speak in more reasonable chunks; in aggregate "the same output" just chunked efficiently.

Maybe you can try hackin' that up?

LuxBennu · 2026-04-06T21:47:35 1775512055

Yeah that makes sense, chunking on silence would sidestep the latency issue pretty cleanly. I've been running it through a basic fastapi wrapper so it just takes whatever audio blob gets thrown at it, no chunking logic on the server side. Might be worth adding a vad pass before sending to whisper though, would cut down on processing dead air too.

znagengast · 2026-04-07T17:19:29 1775582369

Maintainer of WhisperKit here, confirming we do exactly that for longform. We search for the longest "low energy" silence in the second half of the audio window and set the chunking point to the middle of that silence. It uses a version of the webrtc vad algorithm, and significantly speeds up longform because we can run a large amount of concurrent inference requests through CoreML's async prediction api. Whisper is also pretty smart with silent portions since the encoder will tell it if there are any words at all in the chunk, and simply stop predicting tokens after the prefill step - although you could save the ~100ms encoder run entirely with a good vad model, which our recently opensourced pyannote CoreML pipeline can do.

LuxBennu · 2026-04-07T22:36:30 1775601390

Oh nice, the pyannote coreml port is interesting. Last time I looked at pyannote it was pytorch only so getting it to run efficiently on apple silicon was kind of a pain. Does the coreml version handle diarization or just activity detection?

ericd · 2026-04-07T04:12:58 1775535178

Nice, I've been using Hyprwhspr on Omarchy daily for a while now, it's been awesome, thanks very much.

goodroot · 2026-04-07T17:41:29 1775583689

Thanks ericd! Glad to hear.

hephaes7us · 2026-04-06T20:27:26 1775507246

Thanks for sharing! I was literally getting ready to build, essentially, this. Now it looks like I don't have to!

Have you ever considered using a foot-pedal for PTT?

Apple incidentally already has native STT, but for some reason they just don't use a decent model yet.

goodroot · 2026-04-06T20:39:13 1775507953

They do, and they even have that nice microphone F5 key for it, and an ideal OS level API making the input experience >perfect<.

Apparently they do have a better model, they just haven't exposed it in their own OS yet!

https://developer.apple.com/documentation/speech/bringing-ad...

Wonder what's the hold up...

For footpedal:

Yes, conceptually it’s just another evdev-trigger source, assuming the pedal exposes usable key/button events.

Otherwise we’d bridge it into the existing external control interface. Either way, hooks are there. :)

jiehong · 2026-04-06T21:18:27 1775510307

The only issue with Apple models is that they do not detect languages automatically, nor switch if you do between sentences.

Parakeet does both just fine.

chrisweekly · 2026-04-06T21:46:28 1775511988

sorry, PTT?

serf · 2026-04-06T21:49:51 1775512191

push-to-talk.

chrisweekly · 2026-04-07T02:49:43 1775530183

pmarreck · 2026-04-07T00:08:04 1775520484

looks like there's a nearly identically named one for Hyprland

Also, wish it was on nixpkgs, where at least it will be almost guaranteed to build forever =)

goodroot · 2026-03-16T16:00:09 1773676809

Mark Carney's book "Values" pitches a system such as this.

In better times, perhaps we have the collective will to try.

goodroot · 2026-03-16T00:30:35 1773621035

Source? Rationale?

This is - at best - ignorant hyperbole.

goodroot · 2026-01-14T01:33:57 1768354437

The QuestDB team are among the best doing it.

Love the people and their software.

Great blog Jaromir!

goodroot · on Dec 14, 2023

The same is true for database rankings (db-engines).

If entrants are not artificially inflating "organic" signals via fake content spam (Twitter/X), then the criteria themselves are losing their signal strength (StackOverflow/GitHub).

The diffusion makes it increasingly difficult to understand which channels are important and which correlate to strength in the market.

Unfortunately, these can be more than vanity metrics.

Some VCs or financial markets may use these as methods towards valuation.

goodroot · on Nov 21, 2023

Hey! Thanks for upvoting.

Happy to answer any questions about deduplication. One thing that's not included in the write-up is that we also address out-of-order indexing alongside deduplication.

CommanderHux · on Nov 21, 2023

The dataset link seems to be dead. Do you have a mirror?

goodroot · on Nov 21, 2023

Edit: Updated!

https://mega.nz/folder/A1BjnSYQ#NQe5qhYLVBqiRwhWRmcVtg

Article is updating too.

CommanderHux · on Nov 22, 2023

Thanks. Is Dedup supported on SQL COPY too?

nhourcard · on Nov 22, 2023

Not for CSV import via SQL COPY sadly

goodroot · on Nov 6, 2023

Gosh that's sad.

Zoomers are in this very forum - hi Zoomers.

Hang in there.

Whatever precipitating causes led to such suffering, know that we're _here_, _now_, together.

You aren't as alone as it might seem.

And hey, try to relax a little. We'll figure it out.

_oghd · on Nov 6, 2023

i think part of the problem is these kind of messages are alienating exactly because they appear on screen. the meat-space sentiments rarely match the "thoughts and prayers" type online speech-acts, or at least, they are basically never extended as readily.

alx_the_new_guy · on Nov 6, 2023

Nothing personal (I mean, seriously, nothing personal)

Little (probably hard) advice for if/when you're going to say something like that to a zoomer irl (based on personal experience from the receiving end):

The "you aren't as alone as it might seem" gets the "what you're saying is just factually incorrect and what you're trying to do is to bullshit me and maybe possibly yourself" thing going. I have never heard something like that from a person "in the weeds".

Same for "We'll figure it out". How much time have you personally spent "figuring it out" and how much time have you spent playing hot potato with the problem? How important is it compared to your own problems? I guess, not very, so there is no "us" figuring it out.

Basically, don't be a disingenuous dense motherfucker and don't bullshit other people and yourself. Not saying you personally are doing it, but there are definitely more people that do, than that don't.

goodroot · on Nov 6, 2023

For clarity, this response is personal.

Attitude is a potion or a poison.

Make the choice.

Want demons? You'll find them.

Want help? You'll find it.

Many, many people have spent time figuring it out.

Many, MANY people have went into professions or made life style choices to help.

The will to overcome your own narcissism and self pity are key to any healing.

gedy · on Nov 6, 2023

> Whatever precipitating causes led to such suffering, know that we're _here_, _now_, together.

The article comments on this though:

"All the things that have traditionally made life worth living — love, community, country, faith, work, and family — have been “debunked.”

This is absolutely true and no wonder young folks are feeling down. I think the counter-culture types starting 50+ years ago wanted to tear down the old, but forgot to put something constructive in its place. (Well the leftist/Marxist types tried, but then the USSR imploded)

goodroot · on Nov 6, 2023

It's the Internet: All the "debunkings" have also been "debunked".

From the article...

Monogamy is the corollary for the debunking of love.

That's very silly. Love is much more than marriage.

"Church" foibles is the corollary for the debunking of Faith.

That's also very silly. Faith is much more than church or religion.

In other times and other cultures, spiritual insight removed the roots of suffering.