Commit graph

11876 commits

Author SHA1 Message Date
Krrish Dholakia
d9ad7c6218 fix(router.py): fix validation error for default fallback 2024-05-15 13:23:00 -07:00
Ishaan Jaff
59e18f23e0 fix - show litellm_debug_info 2024-05-15 13:07:04 -07:00
Ishaan Jaff
74f093bb4b
Merge pull request #3653 from BerriAI/litellm_fix_text_completions
[Fix] - Alerting on `/completions` -  don't raise hanging request alert for /completions
2024-05-15 11:28:55 -07:00
Krrish Dholakia
dba713ea43 fix(router.py): add validation for how router fallbacks are setup
prevent user errors
2024-05-15 10:44:16 -07:00
Ishaan Jaff
efca96baf8
Merge pull request #3647 from paul-gauthier/openrouter-gpt-4o
cloned gpt-4o models into openrouter/openai in costs&context.json
2024-05-15 10:33:19 -07:00
Ishaan Jaff
2e81347607 fix - don't raise hanging request alert for /completions 2024-05-15 10:27:02 -07:00
Ishaan Jaff
5177e4408e
Merge pull request #3651 from BerriAI/litellm_improve_load_balancing
[Feat] Proxy + router - don't cooldown on 4XX error that are not 429, 408, 401
2024-05-15 10:24:34 -07:00
Ishaan Jaff
f17f0a09d8 feat - router use _is_cooldown_required 2024-05-15 10:03:55 -07:00
Krrish Dholakia
f5d73547c7 fix(lowest_latency.py): allow ttl to be a float 2024-05-15 09:59:21 -07:00
Krrish Dholakia
5dcf3d672c feat(proxy_server.py): new /end_user/info endpoint
get spend for a specific end-user
2024-05-15 09:50:52 -07:00
Ishaan Jaff
ae80148c12 test - router cooldowns 2024-05-15 09:43:30 -07:00
Ishaan Jaff
52f8c39bbf feat - don't cooldown deployment on BadRequestError 2024-05-15 09:03:27 -07:00
Krrish Dholakia
f43da3597d test: fix test 2024-05-15 08:51:40 -07:00
Krrish Dholakia
fffb2427f3 fix(types/init.py): don't import openai assistants types by default 2024-05-15 08:50:31 -07:00
Krrish Dholakia
51a02de4cf refactor(proxy_server.py): update doc string for /user/update 2024-05-15 08:25:14 -07:00
Krrish Dholakia
1840919ebd fix(main.py): testing fix 2024-05-15 08:23:00 -07:00
Krrish Dholakia
1a3b001432 docs(langfuse_integration.md): cleanup docs 2024-05-15 07:37:04 -07:00
Krrish Dholakia
8117af664c fix(huggingface_restapi.py): fix task extraction from model name 2024-05-15 07:28:19 -07:00
Krrish Dholakia
900bb9aba8 test(test_token_counter.py): fix load test 2024-05-15 07:12:43 -07:00
Paul Gauthier
e0152c0b61 cloned gpt-4o models into openrouter/openai 2024-05-15 06:20:51 -07:00
Edwin Jose George
81836ebe5d fix: custom_llm_provider needs to be set before setting timeout 2024-05-15 22:36:15 +09:30
Krrish Dholakia
4ff0703a31 fix(slack_alerting.py): fix timezone utc issue 2024-05-14 22:54:33 -07:00
Krrish Dholakia
b06f989871 refactor(main.py): trigger new build 2024-05-14 22:46:44 -07:00
Krrish Dholakia
e0c1fe91f5 test(test_end_users.py): fix end user test 2024-05-14 22:34:26 -07:00
Krrish Dholakia
8f3bf584be docs(vertex.md): add gemini 1.5 flash to vertex docs 2024-05-14 22:26:56 -07:00
Krrish Dholakia
83ba819602 bump: version 1.37.9 → 1.37.10 2024-05-14 22:17:52 -07:00
Krrish Dholakia
3b5c06747d refactor(main.py): trigger new build 2024-05-14 22:17:40 -07:00
Krrish Dholakia
54587db402 fix(alerting.py): fix datetime comparison logic 2024-05-14 22:10:09 -07:00
Ishaan Jaff
0bac40b0f2 ci/cd run again 2024-05-14 21:53:14 -07:00
Ishaan Jaff
6290de36df (ci/cd) run again 2024-05-14 21:39:09 -07:00
Krrish Dholakia
73b6b5e804 test(test_token_counter.py): fix token counting test 2024-05-14 21:35:28 -07:00
Ishaan Jaff
faa58c7938 (ci/cd) run again 2024-05-14 20:45:07 -07:00
Ishaan Jaff
e7af8d61cd fix check_request_disconnected = None case 2024-05-14 20:38:32 -07:00
Ishaan Jaff
6d1ae5b9c4 (ci/cd) run again 2024-05-14 20:18:12 -07:00
Ishaan Jaff
aaea02dee8
Merge pull request #3640 from BerriAI/litellm_fix_client_side_disconnecting_reqs
[Feat] Proxy - cancel tasks when fast api request is cancelled
2024-05-14 20:14:42 -07:00
Ishaan Jaff
4466982507 feat - cancel tasks when fast api request is cancelled 2024-05-14 19:58:51 -07:00
Krrish Dholakia
0262c480be refactor(main.py): trigger new build 2024-05-14 19:52:23 -07:00
Krrish Dholakia
9eee2f3889 docs(prod.md): add 'disable load_dotenv' tutorial to docs 2024-05-14 19:13:22 -07:00
Krrish Dholakia
1ab4974773 fix: disable 'load_dotenv' for prod environments 2024-05-14 19:09:36 -07:00
Krrish Dholakia
298fd9b25c fix(main.py): ignore model_config param 2024-05-14 19:03:17 -07:00
Krrish Dholakia
6de358cf21 build(model_prices_and_context_window.json): support new gemini 1.5 preview models 2024-05-14 18:40:14 -07:00
Krrish Dholakia
a1dd341ca1 fix(utils.py): default claude-3 to tiktoken (0.8s faster than hf tokenizer) 2024-05-14 18:37:14 -07:00
Krish Dholakia
46a81524ab
Merge pull request #3637 from BerriAI/revert-3444-main
Revert "Logfire Integration"
2024-05-14 17:39:34 -07:00
Krish Dholakia
b04a8d878a
Revert "Logfire Integration" 2024-05-14 17:38:47 -07:00
Krrish Dholakia
c0d701a51e test(test_config.py): fix linting error 2024-05-14 17:32:31 -07:00
Krrish Dholakia
4e30d7cf5e build(model_prices_and_context_window.json): add gemini 1.5 flash model info 2024-05-14 17:30:31 -07:00
Krrish Dholakia
dd0b4b8644 fix(utils.py): fix pydantic v1 error 2024-05-14 17:17:20 -07:00
Krrish Dholakia
1db1af1154 fix(types): fix typing 2024-05-14 17:09:36 -07:00
Krrish Dholakia
888c53e774 fix(proxy/_types.py): fix linting errors 2024-05-14 17:02:11 -07:00
Krrish Dholakia
ad7e289802 fix(types/router.py): fix python3.8 typing issue 2024-05-14 16:56:07 -07:00