vibe-coding

A collection of 86 posts

Open-Source Maintainers Have a Name for What AI Coding Agents Are Doing to Their Inboxes: "AI Slop"

Open-source maintainers have coined a term for what AI coding agents are doing to their contribution queues: "AI slop." Low-quality AI-generated code, PRs, documentation, and bug reports are flooding development workflows at a pace that review processes weren't built to handle. A new qualitative study puts

31 Mar 2026 1 min read

vibe-coding

Agents Write Safer New Code Than Humans — But Maintaining Existing Code Is Where They Break Things

Coding agents are often described as risky for production codebases, but the risk is not uniformly distributed. A new comparative study of 7,191 agent-generated pull requests and 1,402 human-authored PRs from Python repositories in the AIDev dataset finds a counter-intuitive split: coding agents introduce fewer breaking changes than

31 Mar 2026 1 min read

vibe-coding

304,362 AI Commits, One Uncomfortable Finding: AI-Generated Code Accumulates Technical Debt Faster Than It's Cleaned Up

Most research on AI-generated code quality operates under controlled conditions — short-lived experiments, synthetic benchmarks, carefully curated prompts. What happens in production repositories over time is a different question, and until now it's been largely unanswered. A new large-scale empirical study changes that by tracking 304,362 verified AI-authored

31 Mar 2026 1 min read

vibe-coding

Your Coding Agent Is Reading Files When It Should Be Querying a Graph — Codebase-Memory Changes the Equation

The default pattern for coding agents exploring an unfamiliar codebase is expensive and surprisingly shallow. When an agent needs to understand what a function calls, where a class is used, or which modules depend on each other, it falls back on repeated file reads and grep searches — accumulating thousands of

31 Mar 2026 1 min read

vibe-coding

Stop Using AGENTS.md and CLAUDE.md — Research Says You're Probably Doing It Wrong

AGENTS.md and CLAUDE.md have become standard practice in AI-assisted development — the go-to mechanism for telling a coding agent what it needs to know about your project. A controlled study across real repository benchmarks now complicates that assumption. Developer-written context files produced a modest performance improvement. LLM-generated context files

31 Mar 2026 1 min read

vibe-coding

The Architecture of Agentic Governance: When Your Two Agents Lock Each Other in Deadlock

A supply-chain team deployed two AI agents simultaneously — one optimizing procurement speed, one minimizing daily capital spend. When demand spiked, the agents entered complete gridlock: the procurement agent generated purchase orders; the finance agent canceled them immediately. The cycle repeated thousands of times, burning compute tokens while delivering zero operational

31 Mar 2026 1 min read

vibe-coding

Long-Horizon Agents Are Here. Full Autopilot Isn't

Long-horizon agents — those capable of working through multi-step tasks over extended sessions — are no longer a research prototype. They are operational in real engineering environments today. But operational does not mean autonomous. The meaningful breakthrough of early 2026 is not that agents can now be left unsupervised; it is that

31 Mar 2026 1 min read

vibe-coding

Vision2Web: The First Benchmark That Measures Whether Coding Agents Can Actually Build a Real Website End-to-End

Vibe coding's most common real-world use case — "build me a website" — has had no rigorous benchmark until now. Vision2Web, accepted at ICML and built by Zehai He et al. at Zhipu AI, fills that gap with 193 tasks, 918 prototype images, and 1,255 test cases

30 Mar 2026 1 min read

vibe-coding

JetBrains Central: The Governance Layer That Shows Up When You're Juggling Claude Code, Codex, and Junie on the Same Team

JetBrains just announced Central, an open platform for unified AI agent management — and the timing reflects a specific operational crisis that's only visible once a team gets past the "our first agent works" milestone. The problem isn't agent capability. It's what happens

30 Mar 2026 1 min read

vibe-coding

Your Web-Augmented Coding Agent Is Being Misled by Bad Search Results — Sherlock Detects and Repairs It Automatically

Most teams running web-augmented coding agents — those that issue live search queries before generating code — have no systematic defense against bad search results quietly degrading output quality. A new paper from Guoqing Wang et al. documents exactly what happens when that defense is absent, and introduces Sherlock, an automated pipeline

30 Mar 2026 1 min read

vibe-coding

The Prompt Is the Bottleneck: Why 70% of Agentic Sessions Will Start Without a Human in the Loop

At Cognition, the team building Devin has started tracking a ratio that most engineering organizations don't measure: how many of their agent sessions are kicked off by a human versus by a machine signal. Right now, it's 70% human, 30% automated trigger. Nader Dabit, a senior

30 Mar 2026 1 min read

vibe-coding

Stop Copying Agent Skills Between Claude Code, Cursor, and Codex — Index In Place Instead

Any engineer running more than one AI coding tool has the problem itlackey describes in part five of his practical skills management series: three tools, three separate skills directories, none of them talking to each other. Claude Code has its skills in ~/.claude/skills/. Codex has a different path. Cursor

30 Mar 2026 1 min read

← Newer Posts Page 3 of 8 Older Posts →