Krrish Dholakia
|
fff0228c20
|
fix(factory.py): support togethercomputer codellama pt
|
2023-12-06 19:02:58 -08:00 |
|
Krrish Dholakia
|
d4c6cc3920
|
test: fix testing
|
2023-12-06 18:52:49 -08:00 |
|
Krrish Dholakia
|
c0eedf28fc
|
test: fix proxy server testing
|
2023-12-06 18:38:53 -08:00 |
|
ishaan-jaff
|
19b1deb200
|
(feat) proxy: protect health endpoint
|
2023-12-06 18:14:54 -08:00 |
|
ishaan-jaff
|
1b43c7120c
|
(test) custom logger - check kwargs passed all the way
|
2023-12-06 18:08:48 -08:00 |
|
Krrish Dholakia
|
0295509b3b
|
fix(factory.py): fix claude 2.1 prompt template to handle system, assistant, user prompt
|
2023-12-06 18:02:06 -08:00 |
|
ishaan-jaff
|
c8c45ef9c4
|
(test) async custom logger
|
2023-12-06 17:53:06 -08:00 |
|
Krrish Dholakia
|
45b4140615
|
test: fix config import for proxy testing
|
2023-12-06 17:40:38 -08:00 |
|
Krrish Dholakia
|
c789865c87
|
fix(langfuse.py): metadata none error
|
2023-12-06 17:24:59 -08:00 |
|
Krrish Dholakia
|
d814184bc3
|
test: fix test imports
|
2023-12-06 17:21:47 -08:00 |
|
Krrish Dholakia
|
d2de7ffe27
|
test(test_proxy_server_langfuse.py): add langfuse proxy testing
|
2023-12-06 17:19:17 -08:00 |
|
ishaan-jaff
|
b60dc20f4b
|
(fix) proxy edit custom logger
|
2023-12-06 17:16:24 -08:00 |
|
ishaan-jaff
|
8adbf35623
|
(feat) add async loggers under custom logger
|
2023-12-06 17:16:24 -08:00 |
|
ishaan-jaff
|
b482b9002c
|
(feat) Custom_logger add async success & async failure
|
2023-12-06 17:16:24 -08:00 |
|
ishaan-jaff
|
1bac052eca
|
(fix) proxy use async logging
|
2023-12-06 17:16:24 -08:00 |
|
ishaan-jaff
|
dfb30d38fa
|
(feat) proxy print set callbacks
|
2023-12-06 17:16:24 -08:00 |
|
Krrish Dholakia
|
58848841e1
|
fix(proxy_server.py): make headers json serializable
|
2023-12-06 17:09:02 -08:00 |
|
Krrish Dholakia
|
51492125b5
|
test: skip flaky test
|
2023-12-06 16:50:02 -08:00 |
|
Krrish Dholakia
|
ad922b205b
|
fix(proxy_server.py): enable rate limiting concurrent user requests
|
2023-12-06 15:11:05 -08:00 |
|
ishaan-jaff
|
44bf51601a
|
(feat) proxy - custom on failure callback
|
2023-12-06 14:43:47 -08:00 |
|
ishaan-jaff
|
b3f039627e
|
(feat) litellm - add _async_failure_callback
|
2023-12-06 14:43:47 -08:00 |
|
ishaan-jaff
|
3b17fd3821
|
(feat) proxy - async_on_fail_logger
|
2023-12-06 14:43:47 -08:00 |
|
Krrish Dholakia
|
f1c1ec8523
|
fix(bedrock.py): fix embeddings call
|
2023-12-06 14:16:00 -08:00 |
|
ishaan-jaff
|
be15cf20b9
|
(chore) print verbose
|
2023-12-06 14:14:20 -08:00 |
|
ishaan-jaff
|
e1230627d0
|
(fix) print statements
|
2023-12-06 14:11:23 -08:00 |
|
ishaan-jaff
|
0598ab9b63
|
(fix) proxy /model/new writing to config
|
2023-12-06 14:11:23 -08:00 |
|
Krrish Dholakia
|
346551da29
|
fix(proxy_server.py): allow worker config to just be the config filepath
|
2023-12-06 14:03:25 -08:00 |
|
ishaan-jaff
|
368934d160
|
(feat) proxy: use async_callback function
|
2023-12-06 13:51:24 -08:00 |
|
Krrish Dholakia
|
b24c9b4cbf
|
refactor: fix linting
|
2023-12-06 13:27:40 -08:00 |
|
Krrish Dholakia
|
d962d5d4c0
|
fix(bedrock.py): adding support for cohere embeddings
|
2023-12-06 13:25:18 -08:00 |
|
ishaan-jaff
|
cf6ecc03a5
|
(fix) linting
|
2023-12-06 13:14:26 -08:00 |
|
ishaan-jaff
|
f3c3a9860a
|
(feat) /v1/model/info
|
2023-12-06 13:03:29 -08:00 |
|
ishaan-jaff
|
06255c6590
|
(feat) proxy add ext-embedding-ada-002 as a base model
|
2023-12-06 12:19:47 -08:00 |
|
ishaan-jaff
|
29fb97f88a
|
(feat) proxy - define model info
|
2023-12-06 12:06:30 -08:00 |
|
Krrish Dholakia
|
102de97960
|
refactor: fix linting errors
|
2023-12-06 11:46:15 -08:00 |
|
ishaan-jaff
|
de58dcc016
|
(feat) proxy - allow setting cost, context window
|
2023-12-06 11:42:56 -08:00 |
|
Krrish Dholakia
|
94f065f83c
|
feat(sagemaker.py): support huggingface embedding models
|
2023-12-06 11:41:38 -08:00 |
|
ishaan-jaff
|
aefa4f36f9
|
(docs) update yaml with chat/embedding/completion mode
|
2023-12-06 11:36:16 -08:00 |
|
ishaan-jaff
|
fd86876164
|
(feat) proxy: add mode in model info
|
2023-12-06 11:29:59 -08:00 |
|
ishaan-jaff
|
7c77cc3cfa
|
(feat) add mode for config.yaml health checks
|
2023-12-06 11:16:29 -08:00 |
|
ishaan-jaff
|
4f02b3c161
|
(fix) print_verbose health check
|
2023-12-06 11:16:29 -08:00 |
|
Krrish Dholakia
|
f6546076b0
|
docs(quick_start.md): add docs on calling openai-compatible endpoint on proxy
|
2023-12-06 11:06:09 -08:00 |
|
ishaan-jaff
|
cc48b35a8d
|
(test) router - read os.environ/ OpenAI
|
2023-12-06 10:56:27 -08:00 |
|
ishaan-jaff
|
8f47293ce8
|
(chore) linting fix
|
2023-12-06 10:48:01 -08:00 |
|
ishaan-jaff
|
1e2a8869a9
|
(docs) proxy config with azure, openai embedding models
|
2023-12-06 10:45:07 -08:00 |
|
ishaan-jaff
|
9f4928fae4
|
(feat) proxy - add health check for embeddings
|
2023-12-06 10:45:07 -08:00 |
|
ishaan-jaff
|
caf2a6b279
|
(fix) proxy - move new health check import
|
2023-12-06 10:13:06 -08:00 |
|
ishaan-jaff
|
01aa8941a5
|
(test) OTEL / traceloop - waiting for async support
|
2023-12-06 10:08:37 -08:00 |
|
ishaan-jaff
|
11a8713a50
|
(test) router - set sync stream client
|
2023-12-06 10:08:37 -08:00 |
|
Ishaan Jaff
|
a4cf4e7ca9
|
Merge pull request #1023 from PSU3D0/speedup_health_endpoint
(feat) Speedup health endpoint
|
2023-12-06 09:52:13 -08:00 |
|