
Last bit is key. They would’ve just set it to invisible.

Last bit is key. They would’ve just set it to invisible.

The text is public anyone can scrape it. They’ve already used all Reddit’s text in training gpt4.
Not sure what you mean, we are seeing results at an increasing pace if anything. A lot more complexity going into it than ‘increasing text/GPUs’ though.
https://arcprize.org/leaderboard
AlphaEvolve recently achieved what you are after.
We also applied AlphaEvolve to over 50 open problems in analysis , geometry , combinatorics and number theory , including the kissing number problem.
In 75% of cases, it rediscovered the best solution known so far.
In 20% of cases, it improved upon the previously best known solutions, thus yielding new discoveries
AlphaEvolve discovered a new scheduling heuristic for Google’s Borg cluster management system, recovering an average of 0.7% of global compute resources that were previously stranded due to resource fragmentation.
Google’s annual capital expenditures in the tens of billions, this efficiency translates to hundreds of millions of dollars saved annually
Been a few months since I used co-pilot, but they use a model that’s worse than GPT-4/4o which is a big step down from the reasoning models.
Try out Cline, aider, or one of the tools devs actually use with the latest models from Anthropic/Google/OpenAI.
https://aider.chat/docs/leaderboards/
Didn’t look through all the issues but there were things like
The agent was blocked by configuration issues from accessing the necessary dependencies to successfully build and test. Those are being fixed and we’ll continue experimenting.
Been out less than a week, let’s see how it’s doing in a year.
Yes, despite the irrational phobia amongst the Lemmings, AI is massively useful across a wide range of examples like you’ve just given as it reduces barriers to building something.
As a CS grad, the problem isn’t it replacing all programmers, at least not immediately. It’s that a senior software engineer can manage a bunch of AI agents, meaning there’s less demand for developers overall.
Same way tools like Wix, Facebook, etc came in and killed the need for a bunch of web developers that operated in the range for small businesses.

They both suck.
Long live Stremio.
Websites are probably a better example; as the complexity and bloat has increased faster than the tech.