dan 1 week ago

is there any open source skill benchmark workbench for claude code? i want to define multiple versions of a skill, acceptance criteria (to be evaluated by a separate agent), and a runner that does repeated runs of different versions of my skill to see if there’s a statsig improvement in any version

dan 1 week ago

seems like on ios, there’s a regression: if you go from feed to profile, back swipe no longer works reliably if you start it from a post. also quote embeds seem to maybe interfere with back swipe as well

dan 2 weeks ago

i don't remember why i made this https://morel.us-east.host.bsky.network/xrpc/com.atproto.sync.getBlob?did=did:plc:fpruhuo22xkm5o7ttr2ktxdo&cid=bafkreigu6iusqnlay75fnsp3gmhlkhg5kmwrqtyzjt7xb4i47ma64kn2vu

dan 2 weeks ago

wooooo! RE:

Bluesky Social

Google Open Source (@opensource.google)

New year's resolution, engage with open source developers on our new Bluesky account. ✨

dan 2 weeks ago

hmm i started seeing feed context hints in stories, is that expected for my account or did it accidentally get turned on for everyone RE:

Bluesky Social

Bluesky (@bsky.app)

📢 v1.113 is rolling out now! This release focuses on fixing bugs and improving the stability of the app. We also fixed an issue some of you rep...

View quoted note →

dan 2 weeks ago

i’m team em dashes for life the actual tell of most AI content is just that it doesn’t have anything to say and is boring to read people who can’t tell good writing from bad writing by content shouldn’t have any sway over how we write RE: