xdotli's submissions | Hacker News

1.		Chaos of Agent (baulab.info)
		1 point by xdotli 2 days ago \| past \| 1 comment
2.		Native CLI scaffolds consistently outper-form OpenCode when using the same model (arxiv.org)
		1 point by xdotli 2 days ago \| past \| 1 comment
3.		We compare model quality in Cursor (cursor.com)
		2 points by xdotli 2 days ago \| past \| discuss
4.		Automatically Learning Skills for Coding Agents (gepa-ai.github.io)
		4 points by xdotli 19 days ago \| past
5.		We Reached 74.8% on terminal-bench with Terminus-KIRA (krafton-ai.github.io)
		2 points by xdotli 19 days ago \| past
6.		Self-generated skills don't do much for AI agents, but human-curated skills do (theregister.com)
		2 points by xdotli 20 days ago \| past \| 3 comments
7.		First Agent Skills Hackathon by the Authors of SkillsBench (skillathon.ai)
		2 points by xdotli 25 days ago \| past \| 1 comment
8.		The First Agent Skills Benchmark (huggingface.co)
		1 point by xdotli 25 days ago \| past \| 1 comment
9.		GPT-5.2 got worse on Terminal Bench 2.0, so is GPT-5.2 Pro (twitter.com/xdotli)
		1 point by xdotli 3 months ago \| past \| 1 comment
10.		Claude Skills as a Meta Tool (leehanchung.github.io)
		2 points by xdotli 3 months ago \| past
11.		Show HN: Chat with Claude Code on iMessage with Instaline (twitter.com/xdotli)
		2 points by xdotli 6 months ago \| past \| 4 comments
12.		Show HN: PokemonGym – 387 milestones designed to test agents and LLMs (twitter.com/xdotli)
		1 point by xdotli 11 months ago \| past
13.		Show HN: BenchFlow – run AI benchmarks as an API (github.com/benchflow-ai)
		24 points by xdotli 11 months ago \| past \| 1 comment
14.		Ask HN: Which CRM can help manually curated leads and automate lead discovery?
		1 point by xdotli on Feb 20, 2025 \| past \| 3 comments