Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

True, but that's actually because the implementation in products is worse (older) in the giants, compared to the state of the art. Why ? I don't know, but I suspect there's a lot of reasons:

1) they actually have an older implementation. Not many people have working linguistic voice recognition, as that's really, really hard, and you're just not going to make one of those without a 100-strong multidisciplinary team. There's no real business need to replace it (yet).

2) once you have a multidisciplinary team, the multidisciplinary aspect of it is the main reason for the size of the org. Moving to a pure-AI solution would mean 80% or so of the department would become useless and unable to contribute.

3) I bet the linguistic model looks a whole lot more comfortable to the executives than a pure AI algorithm. After all, a linguistic model will not make "hidden mistakes" (mistakes that the system makes but have never been programmed in). And let's not forget that the last few hidden mistake in a highly public model was confusing African Americans with, shall we say, animal byproducts. Needless to say, this was NOT good PR-wise.

(by the way you should try to have a realistic "let's do this with AI" talk with a senior manager, and you'll see what I mean. "Can you guarantee it won't make mistakes ?", "Nope. In fact I will pretty much guarantee mistakes. It's like a person. It might purposefully make mistakes in the sense that it causes a disaster in one area because it improves it's metrics". "Okay. Can you at least tell me why it made decisions ?", "No. Impossible. Also: please don't believe any AI researcher claiming otherwise". You're asking extremely risk-averse people to take a big leap)

4) Career-opposition. In the large orgs, the senior engineers have their senior position because they improved step 57 of algorithm 21 by 5%. Making proposals to replace everything after step 3 with an end-to-end model ... they will "politely sabotage" it. (e.g. demand guarantees that it won't make mistakes. Request papers proving that it outperforms, not just the function, but every individual step. Demand they illustrate that thousands of slight mistakes won't ever happen, ...)

(you know, like factory workers demanding robot features BECAUSE they figured out that they're ridiculously hard. They have no use for the factory. E.g. demands that a robot responds intelligently to a human walking by ... on a floor where no humans are allowed if any machinery is running. Or demand physical separation of robot action radii, when the software supports those robots working together and this is in fact used in said production line. Makes no sense whatsoever. What are they doing ? They think they're defending their jobs)

I've seen several startup products based on Deepmind's Wavenet, but Microsoft and Google's voice recognition have both said that theirs is based on a linguistic accoustic model. You know, the huge, very very complex, dozens of different components, each their own specialization.

So the opposite of what you'd think is actually what's happening. The organizations doing the cutting edge research are not ahead of the curve, they're behind, far behind, and falling more day by day.

Or to put it another way, you want to see innovation happen ? Find companies that would be dead and bankrupt without that innovation. Google and Microsoft, those are not it. Even facebook is better.



Can you cite some sources on the voice recognition part? I seem to remember that Google(https://research.googleblog.com/2015/08/the-neural-networks-...), and Baidu(http://research.baidu.com/deep-speech-3%EF%BC%9Aexploring-ne...) both use neural networks for speech transcription.

I don't really believe that the implementations out of the startups are meaningfully/significantly better than the implementations out of Google/Facebook. If they were seriously better they'd be acquired (as we've seen over and over again).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: