The Open/Closed Problem in AI

12 points by mempko


kornel

I don't see a problem with baking LLM inference into hardware.

Our capital overlords decided they're going to stuff LLMs into everything, even if it boils rivers dry, so any efficiency improvement is helpful.

The GPUs we have are ridiculously ill-suited for LLM inference. They have way too much compute compared to RAM bandwidth. Inference at large scale is a complicated slicing of models across GPUs with very fast interconnects, and even that only improves throughput, not latency and not efficiency.

I don't think model architecture changes will obsolete such ASICs before chip making process will. We haven't moved past the Transformer model since the original GPT. All the inference improvements can be boiled down to more matrix multiplications. The ASIC aren't literally a single model, just a dumb GPGPU without any pixels or triangles, just maximising matmul and memory.

colonelpanic

Our brains use a closed loop to learn. Our brains have a model of the outside world; they make predictions on what our senses should sense, and then check our senses to see how far off the prediction is. If the prediction is wrong, the brain is surprised and updates the model to make a more accurate prediction. In other words, there is no outside process for our brains to accumulate knowledge. It's done all inside our brain, a closed loop.

Wrong in a very fundamental way. The world is the outside process. Kant's division of phenomena and noumena forgets that we have immediate access to the objects of our contemplation and can make direct changes to them.

carlana

Interesting.

Could you make the same point about the stack register? Back in the day there used to be a lot of different programming paradigms, but now we all just use stacks. But stacks seem fine so it's not a problem?