#qwen

3 post(s) tagged here.

Benchmarks on Local LLMs about Backend Generation, Monthly

AutoBe's first proper benchmark for backend generation — controlled variables, a six-axis weighted rubric, multi-dimensional precision. The function calling harness has effectively closed the gap between frontier and local models. From next month, expensive frontier models drop out and only small, cheap models compete. Frontend automation joins the leaderboard in two or three months.

#benchmark#function-calling#local-llm#qwen

4/30/2026Jeongho Nam

Qwen 3.5-27B Just Built Complete Backends from Scratch — 100% Compilation, 25x Cheaper

Qwen 3.5-27B generated complete backends with 100% compilation at 1/25th the cost of Claude Opus 4.6 — and the output quality is nearly identical. The benchmark proves it.

#ai#typescript#qwen#opensource

4/8/2026Jeongho Nam

Function Calling Harness: From 6.75% to 100%

6.75% first-try function calling success becomes 100% compilation via type schemas, compilers, and structured feedback. Dissecting the harness engineering behind AutoBe and Typia.

#function-calling#qwen#seminar

3/26/2026Jeongho Nam