the bug that taught me more about PyTorch than years of using it
26 points by knl
26 points by knl
Translation of the introduction from machine-learning speak:
If you think "wait if we have a search function that takes a bunch of inputs, and a meta-search function that tunes the search function, can't we just make another search function that searches for good inputs to the meta-search function?" then the answer is "yes, but people have been trying for 40+ years and afaik it still kinda sucks".
Re: hyperparameter search. I agree "it still kinda sucks". I found Bayesian Optimization in AlphaGo to be an interesting look into hyperparameter search in practice in a production system.
Wow, this was a great read. I did not guess the bug before I got to it! Good time of year too; "it's never a compiler* bug... until it is" makes for a good comp sci horror story setup.
* Using compiler here in a broad sense.