Great job the only think I didn't understand was do you ssh into some cloud machine with large computing power and it terms of deploying this where did you deploy the flask micro service.
It greatly depends on the requirements that the model has. Some models can even run off of cpu and system ram. But, there are many GPU VMs available with AWS, Azure, etc. and you’d deploy it similar to how you would your other applications.