Flyte enables production-grade orchestration for machine learning workflows and data processing created to accelerate local workflows to production.

Flyte

Hi, I heard that offline batch inference is not recommended in flyte, and I better to deploy model to end-point and send requests there. Is it true?

this is infact one of the biggest uses of flyte in the world today

Great, thank you!
It popped up, In our internal conversations, maybe a misunderstanding of something. And I couldn't google any direct answer

Just to be completely clear for our internal discussion. Even if the model requires GPU, right?

now you may want to use unions (reusable) container if you have very short running jobs