performance - How to dynamically load multiple TensorFlow Lite models in a Flutter app instead of bundling them?

I am building a Flutter app where I have to execute three separate TensorFlow Lite models on-device:

Currently, I bundle all .tflite models inside the assets/ folder and load them using tflite_flutter.

As the three models are all packed within the app, the APK/IPA size has turned very bulky and the performance is also being impacted.

However, the app size is huge, and executing the models (particularly the video detection and GPT2) is resulting in lag.

What are the best practices for running multiple TFLite models in a Flutter app without making the app too heavy?
For video models and a language model such as DistilGPT2, how do I best optimize performance on-device?

Any advice, optimization suggestions, or example strategies would be highly appreciated.

Akhil George

1,0031 gold badge10 silver badges33 bronze badges

asked Sep 11 at 14:33

Nidhi Singh

11 bronze badge

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.