Krrish Dholakia
|
18e3cf8bff
|
fix(utils.py): support azure mistral function calling
|
2024-04-17 19:10:26 -07:00 |
|
Krrish Dholakia
|
0d2b400e91
|
test(test_function_calling.py): handle for when model returns a text response
|
2024-04-17 18:32:34 -07:00 |
|
Krrish Dholakia
|
fabc398acd
|
test(test_function_calling.py): fix test to check if tool call in response
|
2024-04-17 18:28:59 -07:00 |
|
Krrish Dholakia
|
c613a47c66
|
test(test_max_tpm_rpm_limiter.py): fix test to use hashed api key in cache check
|
2024-04-17 18:10:18 -07:00 |
|
Krrish Dholakia
|
15ae7a8314
|
fix(utils.py): fix streaming special character flushing logic
|
2024-04-17 18:03:40 -07:00 |
|
Krrish Dholakia
|
7d0086d742
|
fix(utils.py): ensure streaming output parsing only applied for hf / sagemaker models
selectively applies the <s>
</s> checking
|
2024-04-17 17:43:41 -07:00 |
|
Krrish Dholakia
|
473e667bdf
|
test: refactor testing to handle the hash token fix
|
2024-04-17 17:31:39 -07:00 |
|
Krrish Dholakia
|
bafb008b44
|
fix(_types.py): hash api key, if given
fixes https://github.com/BerriAI/litellm/issues/2114
|
2024-04-17 16:43:35 -07:00 |
|
Krrish Dholakia
|
53df916f69
|
fix(utils.py): accept {custom_llm_provider}/{model_name} in get_model_info
fixes https://github.com/BerriAI/litellm/issues/3100
|
2024-04-17 16:38:53 -07:00 |
|
Ishaan Jaff
|
409bd5b4ab
|
ci/cd run again
|
2024-04-17 08:01:39 -07:00 |
|
Ishaan Jaff
|
5e955fb52c
|
fix - azure test - don't pop api key
|
2024-04-17 07:54:16 -07:00 |
|
Krish Dholakia
|
8febe2f573
|
Merge pull request #3062 from cwang/cwang/trim-messages-fix
Use `max_input_token` for `trim_messages`
|
2024-04-16 22:29:45 -07:00 |
|
Krrish Dholakia
|
f68a6b6e13
|
test(test_amazing_vertex_completion.py): handle api errors
|
2024-04-16 21:47:02 -07:00 |
|
Ishaan Jaff
|
70f1dc2bb9
|
(ci/cd) run again
|
2024-04-16 21:44:11 -07:00 |
|
Ishaan Jaff
|
860d97c342
|
fix test_key_name_null
|
2024-04-16 21:37:31 -07:00 |
|
Ishaan Jaff
|
5393930701
|
fix function calling prompt - ask llm to respond in fahrenheit
|
2024-04-16 21:09:53 -07:00 |
|
Ishaan Jaff
|
6498bbbd09
|
fix - gemini function calling
|
2024-04-16 20:54:22 -07:00 |
|
Ishaan Jaff
|
9db529b2f8
|
test vertex ai exception mapping
|
2024-04-16 20:00:32 -07:00 |
|
Ishaan Jaff
|
7e99854d05
|
Merge pull request #3079 from BerriAI/litellm_router_save_settings_ui
UI - Save / Edit router settings UI
|
2024-04-16 16:57:42 -07:00 |
|
Krrish Dholakia
|
13cd252f3e
|
fix(proxy_server.py): ensure id used in delete deployment matches id used in litellm Router
|
2024-04-16 15:17:18 -07:00 |
|
Ishaan Jaff
|
e271ce8030
|
router - get settings
|
2024-04-16 14:22:54 -07:00 |
|
Ishaan Jaff
|
2ad2d6cf9f
|
return correct response from user/new
|
2024-04-16 08:17:45 -07:00 |
|
Krish Dholakia
|
2837b6b2f3
|
Merge pull request #3019 from ligaz/fix-anthropic-system-msg-handling
Fix Anthropic system message handling
|
2024-04-16 07:18:47 -07:00 |
|
Chen Wang
|
ebc889d77a
|
Use max_input_token for trim_messages
|
2024-04-16 13:36:25 +01:00 |
|
Krrish Dholakia
|
40d9167cae
|
test(test_amazing_vertex_completion.py): fix vertex anthropic streaming text
|
2024-04-15 14:37:22 -07:00 |
|
Krrish Dholakia
|
1ec7118e1f
|
fix(vertex_ai_anthropic.py): set vertex_credentials for vertex ai anthropic calls
allows setting vertex credentials as a json string for vertex ai anthropic calls
|
2024-04-15 14:16:28 -07:00 |
|
Krrish Dholakia
|
50081479f9
|
fix(main.py): accept vertex service account credentials as json string
allows us to dynamically set vertex ai credentials
|
2024-04-15 13:28:59 -07:00 |
|
Krish Dholakia
|
72b54eaad7
|
Merge pull request #3028 from BerriAI/litellm_anthropic_text_completion_fix
fix(anthropic_text.py): add support for async text completion calls
|
2024-04-15 09:26:28 -07:00 |
|
Ishaan Jaff
|
c8832489d6
|
test groq tool calling
|
2024-04-15 08:41:53 -07:00 |
|
Ishaan Jaff
|
ea575ef62d
|
fix test groq function call
|
2024-04-15 08:40:39 -07:00 |
|
Krrish Dholakia
|
26286a54b8
|
fix(anthropic_text.py): add support for async text completion calls
|
2024-04-15 08:15:00 -07:00 |
|
Ishaan Jaff
|
017127a704
|
test - groq tool calling
|
2024-04-15 08:13:05 -07:00 |
|
Stefan Dobrev
|
1cfea955aa
|
Fix Anthropic system message handling
|
2024-04-14 16:23:59 +03:00 |
|
Krrish Dholakia
|
bef24cd4ab
|
fix(caching.py): don't decode a string
|
2024-04-13 18:48:03 -07:00 |
|
Krrish Dholakia
|
4e81acf2c6
|
feat(prometheus_services.py): monitor health of proxy adjacent services (redis / postgres / etc.)
|
2024-04-13 18:15:02 -07:00 |
|
Ishaan Jaff
|
5856ec03c6
|
(ci/cd) run again
|
2024-04-12 20:48:26 -07:00 |
|
Ishaan Jaff
|
f617f5ebb5
|
fix - test caching atext_completion
|
2024-04-12 20:37:56 -07:00 |
|
Ishaan Jaff
|
11cd1ec6cf
|
test - atext_completion + caching
|
2024-04-12 12:32:21 -07:00 |
|
Krrish Dholakia
|
77097f8e7d
|
test(test_jwt.py): add testing to make sure user api key auth returns the expected values
|
2024-04-11 20:18:55 -07:00 |
|
Krrish Dholakia
|
84d43484c6
|
fix(router.py): make sure pre call rpm check runs even when model not in model cost map
|
2024-04-11 09:27:46 -07:00 |
|
Krish Dholakia
|
83e7ed94ce
|
Merge pull request #2942 from BerriAI/litellm_fix_router_loading
Router Async Improvements
|
2024-04-10 20:16:53 -07:00 |
|
Ishaan Jaff
|
8bc02b34c2
|
test -base64 cache hits
|
2024-04-10 16:46:56 -07:00 |
|
Krrish Dholakia
|
a47a719caa
|
fix(router.py): generate consistent model id's
having the same id for a deployment, lets redis usage caching work across multiple instances
|
2024-04-10 15:23:57 -07:00 |
|
Krrish Dholakia
|
a311788f0d
|
test(test_completion.py): handle api instability
|
2024-04-09 21:58:48 -07:00 |
|
Krrish Dholakia
|
a6b004f10b
|
test(test_completion.py): change model
|
2024-04-09 21:38:17 -07:00 |
|
Krish Dholakia
|
2fa47cf416
|
Merge pull request #2924 from BerriAI/litellm_text_completion_response
fix(main.py): handle translating text completion openai to chat completion for async requests
|
2024-04-09 21:13:30 -07:00 |
|
Krrish Dholakia
|
8857c9b978
|
test(test_key_generate_prisma.py): add better unit testing for spend logs on proxy server
|
2024-04-09 17:58:18 -07:00 |
|
Krrish Dholakia
|
855e7ed9d2
|
fix(main.py): handle translating text completion openai to chat completion for async requests
also adds testing for this, to prevent future regressions
|
2024-04-09 16:47:49 -07:00 |
|
Krrish Dholakia
|
4495f6c058
|
test(test_key_generate_prisma.py): fix check for raising error if key generate max budget > key upperbound
|
2024-04-09 13:07:57 -07:00 |
|
Krrish Dholakia
|
ef4c85522c
|
test(test_llm_guard.py): fix test
|
2024-04-08 21:55:35 -07:00 |
|