Awesome ! Does this use https://mage-bench.com/ , or is it a separate project? I ran 4 local models in a tournament recently with mage-bench on an RTX 5090 ; Qwen 3.6 27B won narrowly over Gemma 4 .
No, I was not aware of that project when I made this.
I'll have to look into that project, but I also have an RTX 5090 and did a lot of testing with Qwen3.6 27B and Gemma 4 31B. I was not able to get it to play legal turns consistently. I had to keep expanding the system prompt and adding rules for edge cases. By the end, the prompt was over 10k tokens, and while it mostly make legal turns, it did not make good turns. And all the heuristics in the prompt degraded the performance and increased the cost for frontier models.
If you push a change, or you approve, you're responsible for the change and its effects later. Regardless of size. If change is too big, tell your teammates its too big to review and to refactor to bite-size with their great coding agents. Use AI models also for review of large changes, consider a checklist . Setup CI and integration tests (also can be AI assisted)
I think this is bad strategic precedent for the USA, to save few bucks now. I believe in the Jones Acts original purpose, to protect the US Merchant Marine , for that industry's necessity during wartime
My workplace uses Ada SPARK for high-integrity automotive software, ported from C or C++.
IIUC, the contracts and static proofs can replace some activities like isolated unit tests for C++ .
I think the water use arguments are relevant, particularly in regions of the world and US (CA) where potable water is scarce, but land and electricity are available .
>> This can be achieved through air cooling using water evaporation, which is an open-loop and more water-intensive method, or through server liquid cooling.
The reported case about water wells running dry had to do with issues in construction rather than anything about the data center's regular operation:
> But the reason their taps ran dry (which the article itself says) was entirely because of sediment buildup in groundwater from construction. It had nothing to do with the data center’s normal operations (it hadn’t begun operating yet, and doesn’t even draw from local groundwater). The residents were wronged by Meta here and deserve compensation, but this is not an example of a data center’s water demand harming a local population.
Fair but in this case works well because it aptly demonstrates the futility of trying to change a system "from the inside" away from its core designation.
but it does not seem visible to the naked eye when looking at jewelry... I guess with more lab made diamonds we will come to appreciate the geological beauty of it, and have bigger diamonds for the same price.
Article mentions the climate and growing conditions for citrus are specific so there were few locations where citrus was produced that were integrated to world trade; mild temperatures above freezing, irrigation, and rich soil.
The article mentions large lemon exports from Sicily all the way to New York; and California wasn't a united state or developed market producer of fruit at this time.
reply