Commit graph

3232 commits

Author SHA1 Message Date
ishaan-jaff
44bf51601a (feat) proxy - custom on failure callback 2023-12-06 14:43:47 -08:00
ishaan-jaff
b3f039627e (feat) litellm - add _async_failure_callback 2023-12-06 14:43:47 -08:00
ishaan-jaff
3b17fd3821 (feat) proxy - async_on_fail_logger 2023-12-06 14:43:47 -08:00
Krrish Dholakia
f1c1ec8523 fix(bedrock.py): fix embeddings call 2023-12-06 14:16:00 -08:00
ishaan-jaff
be15cf20b9 (chore) print verbose 2023-12-06 14:14:20 -08:00
ishaan-jaff
e1230627d0 (fix) print statements 2023-12-06 14:11:23 -08:00
ishaan-jaff
0598ab9b63 (fix) proxy /model/new writing to config 2023-12-06 14:11:23 -08:00
Krrish Dholakia
346551da29 fix(proxy_server.py): allow worker config to just be the config filepath 2023-12-06 14:03:25 -08:00
ishaan-jaff
368934d160 (feat) proxy: use async_callback function 2023-12-06 13:51:24 -08:00
Krrish Dholakia
b24c9b4cbf refactor: fix linting 2023-12-06 13:27:40 -08:00
Krrish Dholakia
d962d5d4c0 fix(bedrock.py): adding support for cohere embeddings 2023-12-06 13:25:18 -08:00
ishaan-jaff
cf6ecc03a5 (fix) linting 2023-12-06 13:14:26 -08:00
ishaan-jaff
f3c3a9860a (feat) /v1/model/info 2023-12-06 13:03:29 -08:00
ishaan-jaff
06255c6590 (feat) proxy add ext-embedding-ada-002 as a base model 2023-12-06 12:19:47 -08:00
ishaan-jaff
29fb97f88a (feat) proxy - define model info 2023-12-06 12:06:30 -08:00
Krrish Dholakia
102de97960 refactor: fix linting errors 2023-12-06 11:46:15 -08:00
ishaan-jaff
de58dcc016 (feat) proxy - allow setting cost, context window 2023-12-06 11:42:56 -08:00
Krrish Dholakia
94f065f83c feat(sagemaker.py): support huggingface embedding models 2023-12-06 11:41:38 -08:00
ishaan-jaff
aefa4f36f9 (docs) update yaml with chat/embedding/completion mode 2023-12-06 11:36:16 -08:00
ishaan-jaff
fd86876164 (feat) proxy: add mode in model info 2023-12-06 11:29:59 -08:00
ishaan-jaff
7c77cc3cfa (feat) add mode for config.yaml health checks 2023-12-06 11:16:29 -08:00
ishaan-jaff
4f02b3c161 (fix) print_verbose health check 2023-12-06 11:16:29 -08:00
Krrish Dholakia
f6546076b0 docs(quick_start.md): add docs on calling openai-compatible endpoint on proxy 2023-12-06 11:06:09 -08:00
ishaan-jaff
cc48b35a8d (test) router - read os.environ/ OpenAI 2023-12-06 10:56:27 -08:00
ishaan-jaff
8f47293ce8 (chore) linting fix 2023-12-06 10:48:01 -08:00
ishaan-jaff
1e2a8869a9 (docs) proxy config with azure, openai embedding models 2023-12-06 10:45:07 -08:00
ishaan-jaff
9f4928fae4 (feat) proxy - add health check for embeddings 2023-12-06 10:45:07 -08:00
ishaan-jaff
caf2a6b279 (fix) proxy - move new health check import 2023-12-06 10:13:06 -08:00
ishaan-jaff
01aa8941a5 (test) OTEL / traceloop - waiting for async support 2023-12-06 10:08:37 -08:00
ishaan-jaff
11a8713a50 (test) router - set sync stream client 2023-12-06 10:08:37 -08:00
Ishaan Jaff
a4cf4e7ca9
Merge pull request #1023 from PSU3D0/speedup_health_endpoint
(feat) Speedup health endpoint
2023-12-06 09:52:13 -08:00
ishaan-jaff
bd0579703c (test) router - reading os.environ/ with client 2023-12-06 09:26:21 -08:00
ishaan-jaff
13f9e78799 (fix) router - errors with reading timeout, stream timeout, max retries 2023-12-06 09:19:51 -08:00
ishaan-jaff
527aadd1ab (test) router - reading os.environ/ variables 2023-12-06 09:19:51 -08:00
ishaan-jaff
aab6be654e (fix) router - set read os.environ/ values 2023-12-06 08:59:33 -08:00
Krrish Dholakia
92b2cbcdc5 feat(proxy_server.py): adding /model/delete endpoint 2023-12-05 22:38:38 -08:00
ishaan-jaff
ff028111cf (fix) router len(num_retries) 2023-12-05 22:05:47 -08:00
ishaan-jaff
5e065ebb8f (test) router - explcitly call one deployment 2023-12-05 21:57:00 -08:00
ishaan-jaff
8e6c4c5310 (fix) router - allow users to call a specific_model explicit 2023-12-05 21:57:00 -08:00
Krrish Dholakia
acef6bd58d refactor: linting fixes 2023-12-05 21:43:02 -08:00
Krrish Dholakia
7b83238cb5 fix(router.py): log when a call is retried or fallback happens 2023-12-05 21:29:58 -08:00
Frank Colson
95e5331090 Use litellm logging convention 2023-12-05 22:28:23 -07:00
Frank Colson
fc31221b8a Speedup health endpoint 2023-12-05 22:09:01 -07:00
ishaan-jaff
642c62f7b7 (fix) proxy: better debugging when -debug is on 2023-12-05 18:19:15 -08:00
ishaan-jaff
48aa00d6c0 (fix) proxy - clean up print statement 2023-12-05 18:14:01 -08:00
ishaan-jaff
27d7d7ba9c (feat) proxy cli, better description of config yaml param 2023-12-05 18:11:29 -08:00
ishaan-jaff
56acded998 (router) better debugging using config.yaml 2023-12-05 18:07:27 -08:00
ishaan-jaff
155e99b9a3 (fix) prox cli: remove deprecated param 2023-12-05 18:04:08 -08:00
ishaan-jaff
cb52e3347e (fix) proxy: make yaml load print_verbose 2023-12-05 18:00:00 -08:00
Krrish Dholakia
648d41c96f fix(sagemaker.py): prompt templating fixes 2023-12-05 17:47:44 -08:00