Kevin Zhang 50c3664496
chore: add the ability of lru cache for api v3 to improve the inference speed when exchange model weights (#1058)
* chore: add the ability of lru cache for api v3 to improve the inference speed when exchange model weights

* chore: Dockerfile start api v3

* chore: api default port from 127.0.0.1 to 0.0.0.0

* chore: make gpu happy when do tts

* chore: rollback Dockerfile

* chore: fix

* chore: fix

---------

Co-authored-by: kevin.zhang <kevin.zhang@cardinfolink.com>
2024-05-19 17:15:56 +08:00
..
2024-03-08 23:41:59 +08:00
2024-03-08 23:41:59 +08:00
2024-03-08 23:41:59 +08:00
2024-01-16 17:38:48 +08:00
2024-02-08 21:38:38 +08:00
2024-02-17 16:45:31 +08:00