mozilla: "we choose the path of the most capital intensive boondoggle we can't afford"
it sort of sucks that now we are in an era where no new software fucking works and all changes to working software are to make it not work
So if I'm reading this right: (If one forgives the sampling limitations (one question, 100 positive responses, etc.), which I'm not sure you should...) The output of a language model is predictable enough that, even with simple ML models, if you use an LLM to generate prompts about a topic x and not about a topic x record the timestamps of when the LLM responded with the next chunk of tokens for each generated prompt train the ML model to classify "known to be/not be about x" from the timestamps then you can get effectively perfect precision of guessing when something is about x. Or, if Microsoft's own research is true, then the output of LLMs is not in fact a divinely powerful information oracle, but so stereotyped you can predict the topic by the timing of responses. The oracle only appears divine because we cannot see any of the other requests, and are led to believe our question is unique and personal. Or, this is a profound self own about the use of LLMs for information, period.
Aaron Swartz should have been 39 years old today #AaronSwartzDay
@npub1yasa...andl what have you done
i hadn't "taken poison on purpose and just full on vibe coded as a bit" in awhile, so i gave it a whirl trying to mimic the manic intensity of some of the PRs i see nowadays. i guess i didn't really appreciate how anyone who talks about using LLMs for code is eliding the fact that they are stupendously wealthy or otherwise have access to resources, because doing some stuff i do for free for like an hour on cheap mode cost me $15, which is the maximum i will spent on a bit of this caliber. if i was actually "working" with that there's no way it would cost me less than $200 a day to simply do things i do on my own now.
Here's another #FEP for representing torrents on activitypub :) short, sweet, and with a reference implementation and tests! towards a federated bittorrent tracker with #sciop ! PR: Discussion: (or this thread) #FEP_d8c8 #BitTorrentOverActivityPub #FederatedP2P #BitTorrent
it's never too late to drop out and dedicate your last academic breath spitting sand in the gears of the machine so that new fertile ground might bloom View quoted note β†’
I can't hang on bsky because it brings back the twitter in me. I saw someone saying that Wikipedia is bloated and inefficient because "why spend $100m on labor when the hosting is only $3m." Like a) thats an absolute bargain for a site that serves free high quality information in 27b views a month, and b) how do you think hosting costs are that cheap? Ive seen bloated nonprofits, but look at wikimedia's 990 - the top execs are taking like $400k. Thats half what society for neuroscience's CEO makes for a way more complicated and impactful organization
question: "is the title field in a work registered in the DOI system a cost-effective means of storage" answer: curl https://api.datacite.org/dois/10.57874/531s-1f11 | jq -r '.data.attributes.titles[0].title' | base64 --decode | play -t mp3 - https://www.octopus.ac/publications/n6k2-p549/versions/1