Commit graph

416 commits

Author SHA1 Message Date
Krish Dholakia
18d8edc145
Merge pull request #4461 from t968914/litellm-fix-vertexaibeta
fix: Include vertex_ai_beta in vertex_ai param mapping/Do not use google auth project_id
2024-07-04 15:27:20 -07:00
Krrish Dholakia
3d61a316cb fix(router.py): bump azure default api version
Allows 'tool_choice' to be passed to azure
2024-07-03 12:00:00 -07:00
Krish Dholakia
21d3a28e51
Merge branch 'main' into litellm_support_dynamic_rpm_limiting 2024-07-02 17:51:18 -07:00
Tiger Yu
26630cd263
Merge branch 'main' into litellm-fix-vertexaibeta 2024-07-02 09:49:44 -07:00
Krrish Dholakia
6b529d4e0e fix(dynamic_rate_limiter.py): support setting priority + reserving tpm/rpm 2024-07-01 23:08:54 -07:00
Ishaan Jaff
4b7feb3261 feat - return response headers for async openai requests 2024-07-01 17:01:42 -07:00
Krish Dholakia
58d0330cd7
Merge pull request #4478 from BerriAI/litellm_support_response_schema_param_vertex_ai_old
feat(vertex_httpx.py): support the 'response_schema' param for older vertex ai models
2024-06-29 20:17:39 -07:00
Ishaan Jaff
f9ba3cf668 fix bedrock claude test 2024-06-29 18:46:06 -07:00
Krrish Dholakia
b699d9a8b9 fix(utils.py): support json schema validation 2024-06-29 15:05:52 -07:00
Krrish Dholakia
05dfc63b88 feat(vertex_httpx.py): support the 'response_schema' param for older vertex ai models - pass as prompt (user-controlled)
if 'response_schema' is not supported for vertex model (e.g. gemini-1.5-flash) pass in prompt
2024-06-29 13:25:27 -07:00
Tiger Yu
b0c1d235be Include vertex_ai_beta in vertex_ai param mapping 2024-06-28 10:36:58 -07:00
Krrish Dholakia
d421486a45 fix(token_counter.py): New `get_modified_max_tokens' helper func
Fixes https://github.com/BerriAI/litellm/issues/4439
2024-06-27 15:38:09 -07:00
Ishaan Jaff
d213f81b4c add initial support for volcengine 2024-06-26 16:53:44 -07:00
Ishaan Jaff
aa2e5d6288 add volcengine as provider to litellm 2024-06-26 16:47:23 -07:00
Ishaan Jaff
1cfe03c820 add fireworks ai param mapping 2024-06-26 06:43:18 -07:00
Ishaan Jaff
07829514d1 feat - add param mapping for nvidia nim 2024-06-25 09:13:08 -07:00
Ishaan Jaff
abf919eef2 add nvidia nim to __init__ 2024-06-25 08:53:06 -07:00
Krrish Dholakia
a31a05d45d feat(dynamic_rate_limiter.py): working e2e 2024-06-22 14:41:22 -07:00
Krrish Dholakia
532f24bfb7 refactor: instrument 'dynamic_rate_limiting' callback on proxy 2024-06-22 00:32:29 -07:00
Krrish Dholakia
16941eee43 fix(utils.py): re-integrate separate gemini optional param mapping (google ai studio)
Fixes https://github.com/BerriAI/litellm/issues/4333
2024-06-21 09:01:32 -07:00
Wonseok Lee (Jack)
c4c7d1b367
Merge branch 'main' into feat/friendliai 2024-06-21 10:50:03 +09:00
Krish Dholakia
63216f42b8
Merge branch 'main' into litellm_gemini_refactoring 2024-06-17 19:50:56 -07:00
Ishaan Jaff
4a75e57b6f fix logfire 2024-06-17 19:21:35 -07:00
Krrish Dholakia
3d9ef689e7 fix(vertex_httpx.py): check if model supports system messages before sending separately 2024-06-17 17:32:38 -07:00
Krish Dholakia
a80520004e
Merge branch 'main' into litellm_gemini_refactoring 2024-06-17 17:28:50 -07:00
Ishaan Jaff
fc23399b6f
Merge pull request #4247 from BerriAI/litellm_add_codestral_fim
[Feat] Add Codestral FIM API
2024-06-17 16:35:47 -07:00
Ishaan Jaff
364492297d feat - add fim codestral api 2024-06-17 13:46:03 -07:00
Krrish Dholakia
be66800a98 feat(main.py): initial commit - refactoring google ai studio to just use vertex httpx
Uses the same calling logic for google ai studio/vertex ai. Simplifies logic, gives google ai studio integration all of vertex
ai features.
2024-06-17 13:31:46 -07:00
Ishaan Jaff
ad47fee181 feat add text completion config for mistral text 2024-06-17 12:48:46 -07:00
Ishaan Jaff
3ae05c0404 vo - init commit adding codestral API 2024-06-17 11:05:24 -07:00
Krrish Dholakia
b886812787 fix(__init__.py): add gemini models to all model list
Fixes https://github.com/BerriAI/litellm/issues/4240
2024-06-17 10:54:28 -07:00
Krrish Dholakia
115adc7c30 fix(init.py): fix imports 2024-06-15 11:31:09 -07:00
Krrish Dholakia
4f91205530 refactor(utils.py): refactor Logging to it's own class. Cut down utils.py to <10k lines.
Easier debugging

 Reference: https://github.com/BerriAI/litellm/issues/4206
2024-06-15 10:57:20 -07:00
Krrish Dholakia
6f715b4782 feat(router.py): support content policy fallbacks
Closes https://github.com/BerriAI/litellm/issues/2632
2024-06-14 17:15:44 -07:00
Krrish Dholakia
83d8711f16 feat(__init__.py): allow setting drop_params as an env
Closes https://github.com/BerriAI/litellm/issues/4175
2024-06-13 16:00:14 -07:00
Krish Dholakia
05e21441a6
Merge branch 'main' into litellm_vertex_completion_httpx 2024-06-12 21:19:22 -07:00
Krrish Dholakia
c426d75e91 fix(vertex_httpx.py): add function calling support to httpx route 2024-06-12 21:11:00 -07:00
Ishaan Jaff
e128dc4e1f feat - add azure ai studio models on litellm ui 2024-06-12 20:28:16 -07:00
Krrish Dholakia
3955b058ed fix(vertex_httpx.py): support streaming via httpx client 2024-06-12 19:55:14 -07:00
Wonseok Lee (Jack)
776c75c1e5
Merge branch 'main' into feat/friendliai 2024-06-13 09:59:56 +09:00
Ishaan Jaff
7eeef7ec1f feat - add mistral embedding config 2024-06-12 15:00:00 -07:00
Ishaan Jaff
f09158504b feat - support vertex ai dimensions 2024-06-12 09:29:51 -07:00
Krrish Dholakia
54f9faac79 fix(__init__.py): fix linting error 2024-06-11 18:42:01 -07:00
Krish Dholakia
a53ba9b2fb
Merge pull request #4134 from BerriAI/litellm_azure_ai_route
Azure AI support all models
2024-06-11 18:24:05 -07:00
Krrish Dholakia
6305d2dbcf fix(__init__.py): add 'log_raw_request_response' flag to init 2024-06-11 17:26:03 -07:00
Krrish Dholakia
88e567af2c fix(utils.py): add new 'azure_ai/' route
supports azure's openai compatible api endpoint
2024-06-11 14:06:56 -07:00
Krrish Dholakia
7eae0ff7e3 fix(utils.py): allow user to opt in to raw request logging to langfuse 2024-06-11 13:35:22 -07:00
wslee
18cc703aa2 change friendli_ai -> friendliai 2024-06-11 16:17:30 +09:00
wslee
fe8d59f5eb add friendli_ai provider 2024-06-10 17:27:15 +09:00
Krrish Dholakia
b26c3c7d22 fix(cost_calculator.py): fixes tgai unmapped model pricing
Fixes error where tgai helper function returned None. Enforces stronger type hints, refactors code, adds more unit testing.
2024-06-08 19:43:57 -07:00