An LLM prompted to “implement SQLite in Rust” will generate code that looks like an implementation of SQLite in Rust. It will have the right module structure and function names. But it can not magically generate the performance invariants that exist because someone profiled a real workload and found the bottleneck. The Mercury benchmark (NeurIPS 2024) confirmed this empirically: leading code LLMs achieve ~65% on correctness but under 50% when efficiency is also required.
Последние новости
。关于这个话题,吃瓜提供了深入分析
В Тегеране раскрыли ответ на удары США по нефтяным объектам Ирана19:56
Посол США выступил с угрозами к лидеру польской партии02:04