8 Commits

Author SHA1 Message Date
zpeng11
3e63595f0e feat:update kv cache to [len, head, dim] to allow linear size increasement 2025-08-26 17:03:29 -04:00
zpeng11
419909b443 failed , testing expand y 2025-08-25 21:57:36 -04:00
zpeng11
26228402e3 feat:solve unified kv cache shape handling, todo: clean up upper level to unify first and following step 2025-08-25 12:06:26 -04:00
zpeng11
e4d1894a8f feat:experiments with for onnx with attention, but does not work well todo:clean code and try v3v4 2025-08-24 00:46:29 -04:00
zpeng11
610b36561a feat:remove debug, todo:rewrite the onnx export interface 2025-08-17 19:22:11 -04:00
zpeng11
8c0f32da3e feat:v2pp onnx export ready testing... 2025-08-17 17:54:57 -04:00
XXXXRT666
53cac93589
Refactor: Format Code with Ruff and Update Deprecated G2PW Link (#2255)
* ruff check --fix

* ruff format --line-length 120 --target-version py39

* Change the link for G2PW Model

* update pytorch version and colab
2025-04-07 16:42:47 +08:00
Ναρουσέ·μ·γιουμεμί·Χινακάννα
7d1e94c8b0
Add AR Onnx Module 2024-01-25 02:31:08 +08:00