**Simon** · Dec 03, 2025, 17:28

Simon

Simon @spoltier@qoto.org

1.79K Posts

374 Following

29 Followers

Country: 🇨🇭

profile banner: https://showyourstripes.info/

code / data wrangler in Switzerland.
Recovering reply guy. Posts random photos once in a while.

Joined Jul 2023

374 Following 29 Followers

Posts Posts and replies Media

Show newer

Simon boosted

**Leshem Choshen** @LChoshen@sigmoid.social · Dec 03, 2025, 17:28

Dec 03, 2025, 17:28

Leshem Choshen @LChoshen@sigmoid.social

“Today, I have a vision, a vision of superintelligence from experience”

Presented in his humble way, @richardSutton shares his vision of what AI needs
General, experiential, discovers its own abstractions and not bitter🤢
#NeurIPS2025 #NeurIPS

1d8a01c54df49c8d.png

**Simon** · Dec 03, 2025, 05:11

Simon boosted

**Tim Kellogg** @timkellogg.me@bsky.brid.gy · Dec 03, 2025, 05:11

Dec 03, 2025, 05:11

Tim Kellogg @timkellogg.me@bsky.brid.gy

in the words of Gemini 3: “It is basically a Frankenstein monster combining a CNN (Convolutional Neural Network) and a Transformer, organized like a mammalian brain” 0.5B, SYNTH huggingface.co/mkurman/Neur...

com.atproto.sync.getBlob?did=did:plc:ckaz32jwl6t2cno6fmuw2nhn&cid=bafkreiby4u7ixf6ykksltm2hit7btnpwlpxx2hxexj4k5fq3gk6tu4nvpu

**Simon** @spoltier@qoto.org · Dec 03, 2025, 17:39

**Simon** @spoltier@qoto.org · Dec 03, 2025, 17:39

Dec 03, 2025, 17:39

Simon @spoltier@qoto.org

@timkellogg.me gemini 3 is like "yuck, *mammalians*"

**Simon** · Dec 02, 2025, 17:32

Simon boosted

**Simon Willison** @simon@simonwillison.net · Dec 02, 2025, 17:32

Dec 02, 2025, 17:32

Simon Willison @simon@simonwillison.net

Four new models from Mistral today - all Apache 2 licensed, all vision-capable, and one of them is a 3GB model that can run in a web browser and answer questions about things it can see through the webcam! https://simonwillison.net/2025/Dec/2/introducing-mistral-3/

**Simon** · Nov 30, 2025, 16:23

Simon boosted

**Karsten Schmidt** @toxi@mastodon.thi.ng · Nov 30, 2025, 16:23

Nov 30, 2025, 16:23

Karsten Schmidt @toxi@mastodon.thi.ng

Hierarchies 😩... One of the biggest recurring time-consuming issues I sometimes encounter is making decisions about _where_ to put some (new or exisiting) code/feature, i.e. in which package, new or existing, considering: functional fit (topic), structural fit (pre-existing data format conventions with the rest of a package), and if possible, not introducing new dependencies as a result of new feature... Sometimes these three aspects are mutually blocking each other and it's so time consuming to figure out a solution...

I've got very similar issues with most other static hierarchies (e.g. directory-based file systems, hierarchical websearch directories etc.) and why I think tag-based systems (with intersection/union/negation ops, not just single categories) are a superior way to organize large collections of knowledge (counting source code here too as a form of encoded knowledge). It's also one of the reasons I've been experimenting with and building tools with completely flat collections/graphs and then use queries & transclusion to assemble/extract/select functionality on demand... Need to prepare some screen recordings to share more of those tools/experiments...

#Hierarchy #Tagging #SoftwareArchitecture

**Simon** · Nov 29, 2025, 06:01

Simon boosted

**Brewster Kahle** @brewsterkahle@mastodon.archive.org · Nov 29, 2025, 06:01

Nov 29, 2025, 06:01

Brewster Kahle @brewsterkahle@mastodon.archive.org

Why SUV when you can LSV? (Low Speed Vehicles)

25mph max car! street legal in San Francisco.

Can drive on almost all roads in the city.

This one is not that great, imho (but try it at gocar). I want to see more LSVs. Amsterdam has many, and many types.

all roads w/ 35mph limits or less are ok, almost all roads in SF. Here are all limits on all roads in SF:
https://docs.google.com/spreadsheets/d/1CYnKdHTLK2tolrgvIenq7-_Y2D6SlMARkTyvm7ZjTqU/edit?gid=2067870016#gid=2067870016

from: https://catalog.data.gov/dataset/speed-limits-per-street-segment/resource/d6034e92-4836-40e5-8e93-52bad2dfaf3d (go sf!)

**Simon** @spoltier@qoto.org · Nov 29, 2025, 12:21

**Simon** @spoltier@qoto.org · Nov 29, 2025, 12:21

Nov 29, 2025, 12:21

Simon @spoltier@qoto.org

@brewsterkahle Seen any microlinos?

de403c37c5b74f16.jpg

**Simon** · Nov 24, 2025, 06:23

Simon boosted

**Bryan Cantrill** @bcantrill@mastodon.social · Nov 24, 2025, 06:23

Nov 24, 2025, 06:23

Bryan Cantrill @bcantrill@mastodon.social

On Monday, @ahl and I will be joined by members of the Oxide team to talk about a doozy: an 18-year-old ZFS data corruption bug that we recently nailed. We'll be at a special Europe-friendly time: 9a Pacific/noon Eastern/5p GMT -- join us for the wild tale!

https://discord.gg/QrcKGTTPrF?event=1442398752215400459

**Simon** @spoltier@qoto.org · Nov 28, 2025, 14:20

**Simon** @spoltier@qoto.org · Nov 28, 2025, 14:20

Nov 28, 2025, 14:20

Simon @spoltier@qoto.org

@bcantrill @ahl just caught up on this one, eagerly awaiting resolution on the DMA issue, was it the nerd Mandela effect or is it real?

**Simon** @spoltier@qoto.org · Nov 28, 2025, 11:38

**Simon** @spoltier@qoto.org · Nov 28, 2025, 11:38

Nov 28, 2025, 11:38

Simon @spoltier@qoto.org

Wrote about using #googlejules to migrate an #RStats test suite to #testthat

https://www.linkedin.com/posts/mirai-solutions-gmbh_from-runit-to-testthat-with-coding-agent-activity-7399043585066655744-yRX6

#aiagents

**Simon** @spoltier@qoto.org · Nov 26, 2025, 14:15

**Simon** @spoltier@qoto.org · Nov 26, 2025, 14:15

Nov 26, 2025, 14:15

Simon @spoltier@qoto.org

@deviantollam never flew as a kid, but used to love thunderstorms!

**Simon** @spoltier@qoto.org · Nov 26, 2025, 11:13

**Simon** @spoltier@qoto.org · Nov 26, 2025, 11:13

Nov 26, 2025, 11:13

Simon @spoltier@qoto.org

@timkellogg.me yeah, if the first attempt fails by wiring money to the wrong account, I don't care that it didn't cost as much in compute. They then omit the better agent / models in the table where they only compare accuracy...

**Simon** · Nov 25, 2025, 13:59

Simon boosted

**Tim Kellogg** @timkellogg.me@bsky.brid.gy · Nov 25, 2025, 13:59

Nov 25, 2025, 13:59

Tim Kellogg @timkellogg.me@bsky.brid.gy

community note: using cost on the y axis makes it appear like cheaper models are more capable on pass@3

Show thread

**Simon** · Nov 21, 2025, 09:32 *

Simon boosted

**Electric Gremlin** @trysdyn@electric.marf.space · Nov 21, 2025, 09:32 *

Nov 21, 2025, 09:32 *

Electric Gremlin @trysdyn@electric.marf.space

Someone sent vee the most amazing study on a pediatric hospital contacting professional racing pit teams and asking them to advise on drafting a handoff procedure for ICU patients of the highest concern between wards.

And Ferrari and Williams went "You all are babies, let us show you how it's done" and cut the error rate in handoffs by like 20% and generally found that you need less training, not more, to do it correctly despite having a faster, more detailed protocol.

This is my shit, so much.

Edit: The link to the article is in a reply to prevent masto-hugging the host but people seem to not be seeing it: https://onlinelibrary.wiley.com/doi/pdf/10.1111/j.1460-9592.2006.02239.x

**Simon** @spoltier@qoto.org · Nov 23, 2025, 11:49

**Simon** @spoltier@qoto.org · Nov 23, 2025, 11:49

Nov 23, 2025, 11:49

Simon @spoltier@qoto.org

@timkellogg.me@bsky.brid.gy addendum:

eb179cf3ef501502.jpg

**Simon** · Nov 22, 2025, 23:23

Simon boosted

**Tim Kellogg** @timkellogg.me@bsky.brid.gy · Nov 22, 2025, 23:23

Nov 22, 2025, 23:23

Tim Kellogg @timkellogg.me@bsky.brid.gy

he’s nice even when he’s trashing someone

com.atproto.sync.getBlob?did=did:plc:ckaz32jwl6t2cno6fmuw2nhn&cid=bafkreigzzzylmca27ow7pqoi4jkvxhxkupd7esmmbd4acly4il2wp2klwe

**Simon** @spoltier@qoto.org · Nov 23, 2025, 07:40

**Simon** @spoltier@qoto.org · Nov 23, 2025, 07:40

Nov 23, 2025, 07:40

Simon @spoltier@qoto.org

@timkellogg.me is this Nano banana pro? Either it's a bit confused with greek vs latin alphabets, or it's making math / typographical puns

5aa26ac8cd6294a3.jpg

**Simon** · Nov 23, 2025, 02:19

Simon boosted

**Tim Kellogg** @timkellogg.me@bsky.brid.gy · Nov 23, 2025, 02:19

Nov 23, 2025, 02:19

Tim Kellogg @timkellogg.me@bsky.brid.gy

Evolutionary Algorithms for optimizing LLM weights Gradient descent and backpropagation have a lot of problems, alignment becomes a nightmare. Evolutionary algos fix this, but they don’t scale A recent paper, EGGROLL, makes it computationally feasible to do now www.alphaxiv.org/abs/2511.16652

**Simon** · Nov 21, 2025, 12:18

Simon boosted

**The Europeans** @europeanspodcast@mastodon.social · Nov 21, 2025, 12:18

Nov 21, 2025, 12:18

The Europeans @europeanspodcast@mastodon.social

There's an amazing new music coming out from European women artists at the moment — this week on the podcast we've got recommendations for Robyn, Oklou, Zaho de Sagazan and Lily Allen. Who should we be adding to our list?

#music #pop #europe #culture #podcast

**Simon** · Nov 22, 2025, 04:35 *

Simon boosted

**Terence Tao** @tao@mathstodon.xyz · Nov 22, 2025, 04:35 *

Nov 22, 2025, 04:35 *

Terence Tao @tao@mathstodon.xyz

Over at the Erdos problem website, AI assistance is now becoming routine. Here is what happened recently regarding Erdos problem #367 https://www.erdosproblems.com/367 :

1. On Nov 20, Wouter van Doorn produced a (human-generated) disproof of the second part of this problem, contingent on a congruence identity that he thought was true, and was "sure someoneone here is able to verify... does indeed hold".

2. A few hours later, I posed this problem to Gemini Deepthink, which (after about ten minutes) produced a complete proof of the identity (and confirmed the entire argument): https://gemini.google.com/share/81a65aecfd70 . The argument used some p-adic algebraic number theory which was overkill for this problem. I then spent about half an hour converting the proof by hand into a more elementary proof, which I presented on the site. I then remarked that the resulting proof should be within range of "vibe formalizing" in Lean.

3. Two days later, Boris Alexeev used the Aristotle tool from Harmonic to complete the Lean formalization, making sure to formalize the final statement by hand to guard against AI exploits. This process took two to three hours, and the output can be found at https://borisalexeev.com/t/Erdos367.lean

EDIT: after making this post, I decided to round things out by making AI literature searches on this problem, which (after about fifteen minutes) turned up some related literature on consecutive powerful numbers, but nothing directly relating to #367. https://chatgpt.com/share/6921427d-9dc0-800e-b798-be8fc94a9240 https://gemini.google.com/share/0d296454bea0

Show older

Country: 🇨🇭

profile banner: https://showyourstripes.info/

code / data wrangler in Switzerland.
Recovering reply guy. Posts random photos once in a while.

Joined Jul 2023

Simon @spoltier@qoto.org

Resources

Developers

What is Mastodon?

qoto.org

More…