Chen Wang
|
ebc889d77a
|
Use max_input_token for trim_messages
|
2024-04-16 13:36:25 +01:00 |
|
Krrish Dholakia
|
40d9167cae
|
test(test_amazing_vertex_completion.py): fix vertex anthropic streaming text
|
2024-04-15 14:37:22 -07:00 |
|
Krrish Dholakia
|
1ec7118e1f
|
fix(vertex_ai_anthropic.py): set vertex_credentials for vertex ai anthropic calls
allows setting vertex credentials as a json string for vertex ai anthropic calls
|
2024-04-15 14:16:28 -07:00 |
|
Krrish Dholakia
|
50081479f9
|
fix(main.py): accept vertex service account credentials as json string
allows us to dynamically set vertex ai credentials
|
2024-04-15 13:28:59 -07:00 |
|
Krish Dholakia
|
72b54eaad7
|
Merge pull request #3028 from BerriAI/litellm_anthropic_text_completion_fix
fix(anthropic_text.py): add support for async text completion calls
|
2024-04-15 09:26:28 -07:00 |
|
Ishaan Jaff
|
c8832489d6
|
test groq tool calling
|
2024-04-15 08:41:53 -07:00 |
|
Ishaan Jaff
|
ea575ef62d
|
fix test groq function call
|
2024-04-15 08:40:39 -07:00 |
|
Krrish Dholakia
|
26286a54b8
|
fix(anthropic_text.py): add support for async text completion calls
|
2024-04-15 08:15:00 -07:00 |
|
Ishaan Jaff
|
017127a704
|
test - groq tool calling
|
2024-04-15 08:13:05 -07:00 |
|
Stefan Dobrev
|
1cfea955aa
|
Fix Anthropic system message handling
|
2024-04-14 16:23:59 +03:00 |
|
Krrish Dholakia
|
bef24cd4ab
|
fix(caching.py): don't decode a string
|
2024-04-13 18:48:03 -07:00 |
|
Krrish Dholakia
|
4e81acf2c6
|
feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.)
|
2024-04-13 18:15:02 -07:00 |
|
Ishaan Jaff
|
5856ec03c6
|
(ci/cd) run again
|
2024-04-12 20:48:26 -07:00 |
|
Ishaan Jaff
|
f617f5ebb5
|
fix - test caching atext_completion
|
2024-04-12 20:37:56 -07:00 |
|
Ishaan Jaff
|
11cd1ec6cf
|
test - atext_completion + caching
|
2024-04-12 12:32:21 -07:00 |
|
Krrish Dholakia
|
77097f8e7d
|
test(test_jwt.py): add testing to make sure user api key auth returns the expected values
|
2024-04-11 20:18:55 -07:00 |
|
Krrish Dholakia
|
84d43484c6
|
fix(router.py): make sure pre call rpm check runs even when model not in model cost map
|
2024-04-11 09:27:46 -07:00 |
|
Krish Dholakia
|
83e7ed94ce
|
Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
|
2024-04-10 20:16:53 -07:00 |
|
Ishaan Jaff
|
8bc02b34c2
|
test -base64 cache hits
|
2024-04-10 16:46:56 -07:00 |
|
Krrish Dholakia
|
a47a719caa
|
fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
|
2024-04-10 15:23:57 -07:00 |
|
Krrish Dholakia
|
a311788f0d
|
test(test_completion.py): handle api instability
|
2024-04-09 21:58:48 -07:00 |
|
Krrish Dholakia
|
a6b004f10b
|
test(test_completion.py): change model
|
2024-04-09 21:38:17 -07:00 |
|
Krish Dholakia
|
2fa47cf416
|
Merge pull request #2924 from BerriAI/litellm_text_completion_response
fix(main.py): handle translating text completion openai to chat completion for async requests
|
2024-04-09 21:13:30 -07:00 |
|
Krrish Dholakia
|
8857c9b978
|
test(test_key_generate_prisma.py): add better unit testing for spend logs on proxy server
|
2024-04-09 17:58:18 -07:00 |
|
Krrish Dholakia
|
855e7ed9d2
|
fix(main.py): handle translating text completion openai to chat completion for async requests
also adds testing for this, to prevent future regressions
|
2024-04-09 16:47:49 -07:00 |
|
Krrish Dholakia
|
4495f6c058
|
test(test_key_generate_prisma.py): fix check for raising error if key generate max budget > key upperbound
|
2024-04-09 13:07:57 -07:00 |
|
Krrish Dholakia
|
ef4c85522c
|
test(test_llm_guard.py): fix test
|
2024-04-08 21:55:35 -07:00 |
|
Krrish Dholakia
|
b6cd200676
|
fix(llm_guard.py): enable request-specific llm guard flag
|
2024-04-08 21:15:33 -07:00 |
|
Krrish Dholakia
|
48bfc45cb0
|
fix(utils.py): fix reordering of items for cached embeddings
ensures cached embedding item is returned in correct order
|
2024-04-08 12:18:24 -07:00 |
|
Ishaan Jaff
|
a5aef6ec00
|
Merge pull request #2879 from BerriAI/litellm_async_anthropic_api
[Feat] Async Anthropic API 97.5% lower median latency
|
2024-04-07 09:56:52 -07:00 |
|
Krrish Dholakia
|
6773341a08
|
test(test_alerting.py): remove duplicate test
|
2024-04-06 19:33:01 -07:00 |
|
Ishaan Jaff
|
d51e853b60
|
undo adding next-asyncio
|
2024-04-06 19:28:51 -07:00 |
|
Krish Dholakia
|
83f608dc5d
|
Merge pull request #2880 from BerriAI/litellm_api_base_alerting
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 19:17:18 -07:00 |
|
Krrish Dholakia
|
6234982fd9
|
test(test_custom_callback_input.py): fix assert
|
2024-04-06 18:48:33 -07:00 |
|
Krrish Dholakia
|
5f910c1811
|
test(test_streaming.py): fix vertex project name
|
2024-04-06 18:45:17 -07:00 |
|
Ishaan Jaff
|
f08486448c
|
fix - test streaming
|
2024-04-06 18:28:07 -07:00 |
|
Ishaan Jaff
|
2622f0351b
|
(ci/cd) run again
|
2024-04-06 18:26:52 -07:00 |
|
Ishaan Jaff
|
4512510d37
|
Revert "fix test streaming"
This reverts commit f10aba1007 .
|
2024-04-06 18:24:00 -07:00 |
|
Ishaan Jaff
|
f10aba1007
|
fix test streaming
|
2024-04-06 18:23:10 -07:00 |
|
Ishaan Jaff
|
9be6b7ec7c
|
ci/cd run again
|
2024-04-06 18:07:41 -07:00 |
|
Krrish Dholakia
|
a6f987e065
|
test(test_amazing_vertex_completion.py): fix project name
|
2024-04-06 17:59:49 -07:00 |
|
Ishaan Jaff
|
548b2b6861
|
test - async claude streaming
|
2024-04-06 17:55:26 -07:00 |
|
Krrish Dholakia
|
6110d32b1c
|
feat(proxy/utils.py): return api base for request hanging alerts
|
2024-04-06 15:58:53 -07:00 |
|
Ishaan Jaff
|
a2c63075ef
|
Merge pull request #2877 from BerriAI/litellm_fix_text_completion
[Feat] Text-Completion-OpenAI - Re-use OpenAI Client
|
2024-04-06 12:15:52 -07:00 |
|
Krrish Dholakia
|
b49e47b634
|
ci(vertex_key.json): switch gcloud accounts
|
2024-04-06 12:14:46 -07:00 |
|
Ishaan Jaff
|
3b622556be
|
test - setting org for openai text completion
|
2024-04-06 11:53:26 -07:00 |
|
Ishaan Jaff
|
a55f3cdace
|
test - router re-use openai client
|
2024-04-06 11:33:17 -07:00 |
|
Krish Dholakia
|
9119858f4a
|
Merge pull request #2798 from CLARKBENHAM/main
add test for rate limits - Router isn't coroutine safe
|
2024-04-06 08:47:40 -07:00 |
|
Ishaan Jaff
|
3d298fc549
|
(test) completion
|
2024-04-05 21:03:04 -07:00 |
|
Ishaan Jaff
|
83580dc00a
|
(ci/cd) run again
|
2024-04-05 20:44:51 -07:00 |
|