Commit Graph

17 Commits

Author SHA1 Message Date
Woosuk Kwon 1ce1333573 Set default dtype to half 2023-02-23 21:31:39 +00:00
Woosuk Kwon fdd0f2f472 Minor 2023-02-23 20:23:47 +00:00
Woosuk Kwon 1f6c7ef437 Add controller 2023-02-23 09:32:19 +00:00
Woosuk Kwon 343cea3dbc Add seq_ids to input metadata 2023-02-23 09:25:01 +00:00
Woosuk Kwon 4b1ac23f53 Fix slot mapping 2023-02-23 00:10:07 +00:00
Woosuk Kwon 8290fce47d Add Worker class 2023-02-22 19:01:38 +00:00
Woosuk Kwon 709a69176e Move worker/models -> models 2023-02-22 18:03:48 +00:00
Woosuk Kwon 6f058c7ba8 Implement cache ops 2023-02-16 07:47:03 +00:00
Woosuk Kwon a1c67e6db8 Minor 2023-02-16 01:42:53 +00:00
Woosuk Kwon 9e68a6827e Fix return type error 2023-02-16 01:33:03 +00:00
Woosuk Kwon 8edcabc737 Add warning 2023-02-16 01:28:17 +00:00
Woosuk Kwon 2f4887de77 Fix KVCache shape 2023-02-16 01:24:45 +00:00
Woosuk Kwon ee9442518d Fix get_model 2023-02-13 22:51:03 +00:00
Woosuk Kwon fffa2e1f4b Add model_utils 2023-02-13 09:36:12 +00:00
Woosuk Kwon bb59a3e730 Fix cache engine 2023-02-13 09:35:48 +00:00
Woosuk Kwon e7bee2aa81 Add cache engine 2023-02-09 11:28:02 +00:00
Woosuk Kwon 39161c98a0 Add OPT 2023-02-09 11:25:37 +00:00