Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
xdotli's submissions
login
1.
Chaos of Agent
(
baulab.info
)
1 point
by
xdotli
2 days ago
|
past
|
1 comment
2.
Native CLI scaffolds consistently outper-form OpenCode when using the same model
(
arxiv.org
)
1 point
by
xdotli
2 days ago
|
past
|
1 comment
3.
We compare model quality in Cursor
(
cursor.com
)
2 points
by
xdotli
2 days ago
|
past
|
discuss
4.
Automatically Learning Skills for Coding Agents
(
gepa-ai.github.io
)
4 points
by
xdotli
19 days ago
|
past
5.
We Reached 74.8% on terminal-bench with Terminus-KIRA
(
krafton-ai.github.io
)
2 points
by
xdotli
19 days ago
|
past
6.
Self-generated skills don't do much for AI agents, but human-curated skills do
(
theregister.com
)
2 points
by
xdotli
20 days ago
|
past
|
3 comments
7.
First Agent Skills Hackathon by the Authors of SkillsBench
(
skillathon.ai
)
2 points
by
xdotli
25 days ago
|
past
|
1 comment
8.
The First Agent Skills Benchmark
(
huggingface.co
)
1 point
by
xdotli
25 days ago
|
past
|
1 comment
9.
GPT-5.2 got worse on Terminal Bench 2.0, so is GPT-5.2 Pro
(
twitter.com/xdotli
)
1 point
by
xdotli
3 months ago
|
past
|
1 comment
10.
Claude Skills as a Meta Tool
(
leehanchung.github.io
)
2 points
by
xdotli
3 months ago
|
past
11.
Show HN: Chat with Claude Code on iMessage with Instaline
(
twitter.com/xdotli
)
2 points
by
xdotli
6 months ago
|
past
|
4 comments
12.
Show HN: PokemonGym – 387 milestones designed to test agents and LLMs
(
twitter.com/xdotli
)
1 point
by
xdotli
11 months ago
|
past
13.
Show HN: BenchFlow – run AI benchmarks as an API
(
github.com/benchflow-ai
)
24 points
by
xdotli
11 months ago
|
past
|
1 comment
14.
Ask HN: Which CRM can help manually curated leads and automate lead discovery?
1 point
by
xdotli
on Feb 20, 2025
|
past
|
3 comments
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: