Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Sarvam becomes India’s newest AI unicorn with $234M funding round led by HCLTech

Cybersecurity vets protest ‘dangerous’ US government ban on Anthropic’s most powerful models

UK unveils sweeping social media ban on under-16s

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Cybersecurity vets protest ‘dangerous’ US government ban on Anthropic’s most powerful models

    15 June 2026

    OpenAI is facing investigation by state attorneys general

    15 June 2026

    Meta is reportedly moving to loosen the $2bn Manus deal following Beijing’s demand

    14 June 2026

    As Anthropic blocks access to new models, India debates its AI future

    14 June 2026

    Anthropic’s security warnings may have failed – the government has pulled the plug on its most powerful AI

    13 June 2026
  • Apps

    UK unveils sweeping social media ban on under-16s

    15 June 2026

    Apple is bringing streaming-style subscription packages to the App Store

    15 June 2026

    Snapchat restricts users under 16 from sharing Spotlights with friends

    14 June 2026

    These are the countries that are moving to ban social media for children

    14 June 2026

    Coinbase’s new tool can help agents trade and pay for premium research

    13 June 2026
  • Crypto

    Startup Battlefield 200 applications close today

    27 May 2026

    5 days left: Save up to $410 on Disrupt 2026 passes

    25 May 2026

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026
  • Fintech

    Ramp raises $750M at $44B valuation as investors thirst for fintechs with AI history

    5 June 2026

    Last 24 hours to save up to $410 on your Disrupt 2026 ticket

    29 May 2026

    2 days left: Lock in up to $410 in ticket savings for Disrupt 2026

    28 May 2026

    Robinhood now allows your AI agents to trade stocks

    28 May 2026

    Disrupt 2026 Early Bird ticket savings expire in 3 days

    27 May 2026
  • Hardware

    This slim speaker under the pillow helped me sleep without headphones

    14 June 2026

    Jeff Bezos’ Prometheus Raises $12 Billion to Build an ‘Artificial General Engineer’ for the Natural World

    12 June 2026

    WWDC 2026: What to expect, from Siri’s long-awaited revamp to Apple Intelligence and iOS 27

    9 June 2026

    What to expect from WWDC 2026: The long-awaited Siri refresh and Apple Intelligence updates

    7 June 2026

    What to expect from WWDC 2026: The long-awaited Siri refresh and Apple Intelligence updates

    5 June 2026
  • Media & Entertainment

    Deezer’s new tool can recognize AI music from Spotify, Apple Music and more

    11 June 2026

    Netflix expands revamped mobile app across Asia and doubles down on games for kids

    10 June 2026

    Plex adds new social features ahead of major price hike for its lifetime pass

    6 June 2026

    Startup Battlefield 200 applications officially close in 3 days

    5 June 2026

    Founders Fund Launches Series of Games Starring Sam Altman, Palmer Luckey and Other Tech Elites

    5 June 2026
  • Security

    The FBI built its own replica small town to simulate real-world cyberattacks

    13 June 2026

    US surveillance law to expire for first time after lawmakers rejected Trump’s controversial pick to lead spy agency

    13 June 2026

    Chinese cybercrime operation that used artificial intelligence to scam ‘hundreds of thousands of victims’ sued by Google

    12 June 2026

    ServiceNow is telling customers that a bug left some of their data exposed online

    12 June 2026

    Oracle warns of security flaw that hackers abused to breach 100+ companies

    11 June 2026
  • Startups

    Sarvam becomes India’s newest AI unicorn with $234M funding round led by HCLTech

    15 June 2026

    As AI companies scramble to go public, who else is along for the ride?

    14 June 2026

    Jedify Raises $24M To Help Companies Arm AI Agents With Their Business Context

    12 June 2026

    Military SPAC Quantum Space is trying to catch SpaceX’s IPO wave

    12 June 2026

    Microsoft is using Alt Carbon as a sign of India’s growing role in carbon removal

    11 June 2026
  • Transportation

    GM is joining the race to make batteries for AI data centers and the grid

    15 June 2026

    TechCrunch Mobility: SpaceX rockets pass Tesla

    14 June 2026

    Waymo says it has created a better benchmark for comparing robotics to humans

    14 June 2026

    SpaceX IPO closes up 19% and delivers world’s first trillionaire

    13 June 2026

    SpaceX IPO: Live updates on everything you need to know

    13 June 2026
  • Venture

    Orbio raises $21 million to automate hiring and onboarding of frontline workers

    15 June 2026

    Why business AI will be the focus of VivaTech 2026

    10 June 2026

    How Justin Ernest invested nearly $500 million in hot startups without a traditional VC fund

    10 June 2026

    Mercor’s Brendan Foody calls out Sequoia, accusing it of “double pricing” valuation tricks.

    9 June 2026

    Founders share VC horror stories and some name names

    6 June 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»A new AI coding challenge has just published its first results – and is not beautiful
AI

A new AI coding challenge has just published its first results – and is not beautiful

techtost.comBy techtost.com24 July 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
A New Ai Coding Challenge Has Just Published Its First
Share
Facebook Twitter LinkedIn Pinterest Email

A new AI coding challenge revealed his first winner-and set a new bar for AI software engineers.

On Wednesday at 5 pm PST, the Laude Non -Profit Institute announced the first winner of the K award, a multilevel Coding Challenge started by Databricks and co -founder Andy Konwinski. The winner was a Brazilian engineer called Eduardo Rocha de Andrade, who will receive $ 50,000 for the prize. But more amazing than the victory was his final score: he won the right answers to just 7.5% of test questions.

“We are happy to have built a reference point that is really difficult,” Konwinski said. “The benchmarks should be difficult if they are going to matter,” he continued, adding: “The scores would be different if the big laboratories had entered their largest models, but this is the kind of point.

Konwinski is committed to $ 1 million in the first open source model that can rate higher than 90% in the test.

Similar to the well -known Swench system, the K Award Tests models against signs of Github issues as a test for how good models can deal with real world planning problems. However, while the Swench is based on a stable set of problems that can train models, the K award is designed as “version without SWENCH infection”, using a timed input system to protect against any special reference training. For the first round, the models are due to March 12th. The organizers of the K award then built the test using only GitHub issues highlighted after this date.

The 7.5% top score is intense in contrast to Swe Bench itself, which currently shows a top 75% top score in the easiest “verified” test and 34% of the toughest “complete” test. Konwinski is still not sure if inequality is due to the infection in the Swench or simply to challenge the collection of new issues from Github, but expects that the K will soon answer the question.

“As we have more routes of the thing, we will have a better feel,” he told TechCrunch, “because we expect people to adapt to the dynamics of competition every few months.”

TechCrunch event

Francisco
|
27-29 October 2025

It may seem like a strange place to remain, given the wide range of AI coding tools that are already available to the public – but with reference points to become very easy, many critics see projects such as the K as a necessary step towards resolving The growing AI evaluation problem.

“I am quite refreshing to build new tests for existing reference points,” says Princeton Sayash Kapoor researcher, who presented a similar idea In a recent document. “Without such experiments, we can’t really say if the issue is infection, or even just aiming at the table with man with a man in the loop.”

For Konwinski, it’s not just a better point of reference, but an open challenge for the rest of the industry. “If you hear the advertising campaign, it’s like seeing AI doctors and AI lawyers and AI software engineers, and that’s not true,” he says. “If we can’t even get more than 10% in a cooling infection, this is the control of reality for me.”

Andy Konwinski beautiful challenge Coding K prize Laude Institute published results
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleOpen Source X opponent Mastodon begins to raise funds with new in -app donation feature
Next Article Former Y Combinator, A16Z experts hold a summit for founders only for founders
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Cybersecurity vets protest ‘dangerous’ US government ban on Anthropic’s most powerful models

15 June 2026

OpenAI is facing investigation by state attorneys general

15 June 2026

Meta is reportedly moving to loosen the $2bn Manus deal following Beijing’s demand

14 June 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Sarvam becomes India’s newest AI unicorn with $234M funding round led by HCLTech

15 June 2026

Cybersecurity vets protest ‘dangerous’ US government ban on Anthropic’s most powerful models

15 June 2026

UK unveils sweeping social media ban on under-16s

15 June 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Ramp raises $750M at $44B valuation as investors thirst for fintechs with AI history

5 June 2026

Last 24 hours to save up to $410 on your Disrupt 2026 ticket

29 May 2026

2 days left: Lock in up to $410 in ticket savings for Disrupt 2026

28 May 2026
Startups

Sarvam becomes India’s newest AI unicorn with $234M funding round led by HCLTech

As AI companies scramble to go public, who else is along for the ride?

Jedify Raises $24M To Help Companies Arm AI Agents With Their Business Context

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.