Commit graph

2248 commits

Author SHA1 Message Date
Krrish Dholakia
fff0228c20 fix(factory.py): support togethercomputer codellama pt 2023-12-06 19:02:58 -08:00
Krrish Dholakia
d4c6cc3920 test: fix testing 2023-12-06 18:52:49 -08:00
Krrish Dholakia
c0eedf28fc test: fix proxy server testing 2023-12-06 18:38:53 -08:00
ishaan-jaff
19b1deb200 (feat) proxy: protect health endpoint 2023-12-06 18:14:54 -08:00
ishaan-jaff
1b43c7120c (test) custom logger - check kwargs passed all the way 2023-12-06 18:08:48 -08:00
Krrish Dholakia
0295509b3b fix(factory.py): fix claude 2.1 prompt template to handle system, assistant, user prompt 2023-12-06 18:02:06 -08:00
ishaan-jaff
c8c45ef9c4 (test) async custom logger 2023-12-06 17:53:06 -08:00
Krrish Dholakia
45b4140615 test: fix config import for proxy testing 2023-12-06 17:40:38 -08:00
Krrish Dholakia
c789865c87 fix(langfuse.py): metadata none error 2023-12-06 17:24:59 -08:00
Krrish Dholakia
d814184bc3 test: fix test imports 2023-12-06 17:21:47 -08:00
Krrish Dholakia
d2de7ffe27 test(test_proxy_server_langfuse.py): add langfuse proxy testing 2023-12-06 17:19:17 -08:00
ishaan-jaff
b60dc20f4b (fix) proxy edit custom logger 2023-12-06 17:16:24 -08:00
ishaan-jaff
8adbf35623 (feat) add async loggers under custom logger 2023-12-06 17:16:24 -08:00
ishaan-jaff
b482b9002c (feat) Custom_logger add async success & async failure 2023-12-06 17:16:24 -08:00
ishaan-jaff
1bac052eca (fix) proxy use async logging 2023-12-06 17:16:24 -08:00
ishaan-jaff
dfb30d38fa (feat) proxy print set callbacks 2023-12-06 17:16:24 -08:00
Krrish Dholakia
58848841e1 fix(proxy_server.py): make headers json serializable 2023-12-06 17:09:02 -08:00
Krrish Dholakia
51492125b5 test: skip flaky test 2023-12-06 16:50:02 -08:00
Krrish Dholakia
ad922b205b fix(proxy_server.py): enable rate limiting concurrent user requests 2023-12-06 15:11:05 -08:00
ishaan-jaff
44bf51601a (feat) proxy - custom on failure callback 2023-12-06 14:43:47 -08:00
ishaan-jaff
b3f039627e (feat) litellm - add _async_failure_callback 2023-12-06 14:43:47 -08:00
ishaan-jaff
3b17fd3821 (feat) proxy - async_on_fail_logger 2023-12-06 14:43:47 -08:00
Krrish Dholakia
f1c1ec8523 fix(bedrock.py): fix embeddings call 2023-12-06 14:16:00 -08:00
ishaan-jaff
be15cf20b9 (chore) print verbose 2023-12-06 14:14:20 -08:00
ishaan-jaff
e1230627d0 (fix) print statements 2023-12-06 14:11:23 -08:00
ishaan-jaff
0598ab9b63 (fix) proxy /model/new writing to config 2023-12-06 14:11:23 -08:00
Krrish Dholakia
346551da29 fix(proxy_server.py): allow worker config to just be the config filepath 2023-12-06 14:03:25 -08:00
ishaan-jaff
368934d160 (feat) proxy: use async_callback function 2023-12-06 13:51:24 -08:00
Krrish Dholakia
b24c9b4cbf refactor: fix linting 2023-12-06 13:27:40 -08:00
Krrish Dholakia
d962d5d4c0 fix(bedrock.py): adding support for cohere embeddings 2023-12-06 13:25:18 -08:00
ishaan-jaff
cf6ecc03a5 (fix) linting 2023-12-06 13:14:26 -08:00
ishaan-jaff
f3c3a9860a (feat) /v1/model/info 2023-12-06 13:03:29 -08:00
ishaan-jaff
06255c6590 (feat) proxy add ext-embedding-ada-002 as a base model 2023-12-06 12:19:47 -08:00
ishaan-jaff
29fb97f88a (feat) proxy - define model info 2023-12-06 12:06:30 -08:00
Krrish Dholakia
102de97960 refactor: fix linting errors 2023-12-06 11:46:15 -08:00
ishaan-jaff
de58dcc016 (feat) proxy - allow setting cost, context window 2023-12-06 11:42:56 -08:00
Krrish Dholakia
94f065f83c feat(sagemaker.py): support huggingface embedding models 2023-12-06 11:41:38 -08:00
ishaan-jaff
aefa4f36f9 (docs) update yaml with chat/embedding/completion mode 2023-12-06 11:36:16 -08:00
ishaan-jaff
fd86876164 (feat) proxy: add mode in model info 2023-12-06 11:29:59 -08:00
ishaan-jaff
7c77cc3cfa (feat) add mode for config.yaml health checks 2023-12-06 11:16:29 -08:00
ishaan-jaff
4f02b3c161 (fix) print_verbose health check 2023-12-06 11:16:29 -08:00
Krrish Dholakia
f6546076b0 docs(quick_start.md): add docs on calling openai-compatible endpoint on proxy 2023-12-06 11:06:09 -08:00
ishaan-jaff
cc48b35a8d (test) router - read os.environ/ OpenAI 2023-12-06 10:56:27 -08:00
ishaan-jaff
8f47293ce8 (chore) linting fix 2023-12-06 10:48:01 -08:00
ishaan-jaff
1e2a8869a9 (docs) proxy config with azure, openai embedding models 2023-12-06 10:45:07 -08:00
ishaan-jaff
9f4928fae4 (feat) proxy - add health check for embeddings 2023-12-06 10:45:07 -08:00
ishaan-jaff
caf2a6b279 (fix) proxy - move new health check import 2023-12-06 10:13:06 -08:00
ishaan-jaff
01aa8941a5 (test) OTEL / traceloop - waiting for async support 2023-12-06 10:08:37 -08:00
ishaan-jaff
11a8713a50 (test) router - set sync stream client 2023-12-06 10:08:37 -08:00
Ishaan Jaff
a4cf4e7ca9
Merge pull request #1023 from PSU3D0/speedup_health_endpoint
(feat) Speedup health endpoint
2023-12-06 09:52:13 -08:00