>>13340937yea thats pretty much how you do it.
I forgot to mention, I imagine these proprietary packages are also highly compute optimized. This is another bag of tricks that is maybe even more esoteric than the hyperparameter shit. To really make these things go fast you need to be optimizing on the register/cache level, minimizing cache misses etc. That's a very specialized skill set even among experienced programmers.
If there's custom or specialized hardware involved that could be a factor to watch out for too.