Hacker Newsnew | past | comments | ask | show | jobs | submit | xdotli's submissionslogin
1.Chaos of Agent (baulab.info)
1 point by xdotli 2 days ago | past | 1 comment
2.Native CLI scaffolds consistently outper-form OpenCode when using the same model (arxiv.org)
1 point by xdotli 2 days ago | past | 1 comment
3.We compare model quality in Cursor (cursor.com)
2 points by xdotli 2 days ago | past | discuss
4.Automatically Learning Skills for Coding Agents (gepa-ai.github.io)
4 points by xdotli 19 days ago | past
5.We Reached 74.8% on terminal-bench with Terminus-KIRA (krafton-ai.github.io)
2 points by xdotli 19 days ago | past
6.Self-generated skills don't do much for AI agents, but human-curated skills do (theregister.com)
2 points by xdotli 20 days ago | past | 3 comments
7.First Agent Skills Hackathon by the Authors of SkillsBench (skillathon.ai)
2 points by xdotli 25 days ago | past | 1 comment
8.The First Agent Skills Benchmark (huggingface.co)
1 point by xdotli 25 days ago | past | 1 comment
9.GPT-5.2 got worse on Terminal Bench 2.0, so is GPT-5.2 Pro (twitter.com/xdotli)
1 point by xdotli 3 months ago | past | 1 comment
10.Claude Skills as a Meta Tool (leehanchung.github.io)
2 points by xdotli 3 months ago | past
11.Show HN: Chat with Claude Code on iMessage with Instaline (twitter.com/xdotli)
2 points by xdotli 6 months ago | past | 4 comments
12.Show HN: PokemonGym – 387 milestones designed to test agents and LLMs (twitter.com/xdotli)
1 point by xdotli 11 months ago | past
13.Show HN: BenchFlow – run AI benchmarks as an API (github.com/benchflow-ai)
24 points by xdotli 11 months ago | past | 1 comment
14.Ask HN: Which CRM can help manually curated leads and automate lead discovery?
1 point by xdotli on Feb 20, 2025 | past | 3 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: