Commit graph

17 commits

Author SHA1 Message Date
Ashwin Bharambe
86c2993296 small rename 2024-07-11 00:07:19 -07:00
Ashwin Bharambe
631328f556 added DPO 2024-07-11 00:01:58 -07:00
Ashwin Bharambe
7cade3acc3 fixes 2024-07-10 23:33:57 -07:00
Ashwin Bharambe
ee86f2c75f memory banks 2024-07-10 23:28:16 -07:00
Raghotham Murthy
6fb69efbe5 Added batch inference 2024-07-10 23:25:23 -07:00
Raghotham Murthy
22d6093258 missed html/yaml 2024-07-10 23:01:00 -07:00
Ashwin Bharambe
c1f6816d76 some finetuning changes 2024-07-10 22:09:13 -07:00
Raghotham Murthy
6ec7c47938 reward scoring model enum 2024-07-10 21:59:01 -07:00
Raghotham Murthy
ebb59aa35f reward scoring 2024-07-10 21:56:16 -07:00
Ashwin Bharambe
69ecf55de2 finetuning 2024-07-10 20:47:05 -07:00
Ashwin Bharambe
956f07b04c fixes to reward stuff 2024-07-10 19:22:33 -07:00
Ashwin Bharambe
beb2870750 initial client + server for agentic system 2024-07-09 22:17:36 -07:00
Ashwin Bharambe
13e1667e7a add safety 2024-07-09 16:08:47 -07:00
Ashwin Bharambe
256f1d5991 agentic system stream changes 2024-07-09 15:08:54 -07:00
Ashwin Bharambe
97f9b18aca more work on agent definitions 2024-07-09 13:53:09 -07:00
Ashwin Bharambe
6e4586ba7a more definitions 2024-07-08 16:35:28 -07:00
Ashwin Bharambe
722d20c6de making the API python based with a converter script 2024-07-08 15:01:05 -07:00