Commit graph

5330 commits

Author SHA1 Message Date
Krrish Dholakia
eda65a25e3 test(test_caching.py): fix test_redis_cache_acompletion_stream 2024-03-25 21:36:47 -07:00
Krrish Dholakia
be055f31ee test(test_azure_astreaming_and_function_calling): fix test to handle caching 2024-03-25 19:33:57 -07:00
Krrish Dholakia
f604a6155f fix(utils.py): persist system fingerprint across chunks 2024-03-25 19:24:09 -07:00
Krrish Dholakia
e9096ee922 fix(test_amazing_vertex_completion.py): fix test to check if content is none 2024-03-25 19:11:39 -07:00
Krrish Dholakia
c5bd4d4233 fix(utils.py): log success event for streaming 2024-03-25 19:03:10 -07:00
Krrish Dholakia
fed6ee1f35 test(test_custom_logger.py): cleanup test 2024-03-25 18:32:12 -07:00
Krrish Dholakia
cbf4c95e5f fix(utils.py): persist response id across chunks 2024-03-25 18:20:43 -07:00
Krrish Dholakia
ecc0cf5d9c fix(utils.py): fix text completion streaming 2024-03-25 16:47:17 -07:00
Krrish Dholakia
26dbb76d53 fix(utils.py): ensure last chunk is always empty delta w/ finish reason
makes sure we're openai-compatible with our streaming. Adds stricter tests for this as well
2024-03-25 16:33:41 -07:00
Krrish Dholakia
c667e437b9 fix(utils.py): allow user to disable streaming logging
fixes event loop issue for litellm.disable_streaming_logging
2024-03-25 14:28:46 -07:00
Krrish Dholakia
fec92767bb fix(caching.py): support default ttl for caching 2024-03-25 13:40:17 -07:00
Krish Dholakia
f8d1d41b34 Merge pull request #2687 from BerriAI/litellm_jwt_auth_fixes_2
Litellm jwt auth fixes
2024-03-25 13:27:19 -07:00
Krrish Dholakia
10ecf31827 test(test_jwt.py): add unit tests for jwt auth integration 2024-03-25 13:24:39 -07:00
Krrish Dholakia
30c45b83bf fix(handle_jwt.py): cache public keys
caches jwt public keys - reducing need for making http calls on every request
2024-03-25 12:36:32 -07:00
Ishaan Jaff
033d89373c Merge pull request #2683 from BerriAI/litellm_f_string
(feat) Perf Improvement - stop eagerly evaluating fstring
2024-03-25 12:35:30 -07:00
Krrish Dholakia
72a0f0213d fix(handle_jwt.py): support scopes being a list
allow scopes in jwt to be a list, not just a space-separated string
2024-03-25 12:30:40 -07:00
Krrish Dholakia
fe3a39351f fix(handle_jwt.py): don't require kid to be set 2024-03-25 12:28:16 -07:00
Krrish Dholakia
38bdf31652 fix(handle_jwt.py): allow setting proxy admin role string for jwt auth 2024-03-25 12:20:14 -07:00
Krrish Dholakia
d3e7376e10 fix(handle_jwt.py): enable user to set jwt admin scope string in config 2024-03-25 11:34:49 -07:00
Ishaan Jaff
a6065e4853 (fix) stop using f strings in verbose logger 2024-03-25 10:55:30 -07:00
Ishaan Jaff
f0992c2dbd (fix) stop using f strings with logger 2024-03-25 10:47:18 -07:00
Max Deichmann
efa599b0ee push 2024-03-25 17:43:55 +01:00
Ishaan Jaff
2c01457a4b (feat) stop eagerly evaluating fstring 2024-03-25 09:01:42 -07:00
Krrish Dholakia
8821b3d243 feat(main.py): support router.chat.completions.create
allows using router with instructor

https://github.com/BerriAI/litellm/issues/2673
2024-03-25 08:26:28 -07:00
Krish Dholakia
c291b0d2d6 Merge pull request #2666 from BerriAI/litellm_model_fix_v_3
fix(proxy_server.py): fix model info check
2024-03-23 19:39:03 -07:00
Krish Dholakia
c46de42339 Merge pull request #2669 from BerriAI/litellm_router_pre_call_checks
feat(router.py): enable pre-call checks
2024-03-23 19:38:09 -07:00
Krrish Dholakia
8c6402b02d docs(routing.md): add pre-call checks to docs 2024-03-23 19:10:34 -07:00
Krrish Dholakia
292cdd81e4 fix(router.py): fix pre call check logic 2024-03-23 18:56:08 -07:00
Ishaan Jaff
cd650bf4b2 Merge pull request #2668 from BerriAI/litellm_update_deploy_docs
[Docs] Add Docs on deploying to EKS Cluster + K8
2024-03-23 18:42:36 -07:00
Krrish Dholakia
4e70a3e09a feat(router.py): enable pre-call checks
filter models outside of context window limits of a given message for a model group

 https://github.com/BerriAI/litellm/issues/872
2024-03-23 18:03:30 -07:00
Ishaan Jaff
0aaa5fcfe6 (fix) update locustfile 2024-03-23 17:59:54 -07:00
Ishaan Jaff
40f00cac0d (feat) update config.yaml 2024-03-23 17:59:16 -07:00
Krrish Dholakia
47b2b0df17 fix(bedrock.py): fix supported openai params for bedrock claude 3 2024-03-23 16:02:15 -07:00
Krrish Dholakia
ed3681d9bd fix(proxy_server.py): fix model info check 2024-03-23 15:59:17 -07:00
Krish Dholakia
6c3ef60c93 Merge pull request #2659 from vivek-athina/athina-doc-updates-and-a-minor-fix
Athina docs updated with information about additional fields and a minor fix in the callback
2024-03-23 15:54:32 -07:00
Krrish Dholakia
4e61fbf7d7 test(test_completion.py): make default claude 3 test message multi-turn 2024-03-23 14:34:42 -07:00
Krrish Dholakia
3d9a883190 build(test_python_38.py): add testing for litellm cli import 2024-03-23 10:42:19 -07:00
Krrish Dholakia
2e88a22d63 test(test_completion.py): fix claude multi-turn conversation test 2024-03-23 00:56:41 -07:00
Krrish Dholakia
44c0a7f63e fix(factory.py): fix anthropic check 2024-03-23 00:27:24 -07:00
Vivek Aditya
6f81cc97a0 Athina docs updated with information about additional fields and a minor fix in the callback 2024-03-23 12:42:07 +05:30
Ishaan Jaff
9ace9183f7 Merge pull request #2657 from BerriAI/litellm_improve_perf
(feat) remove litellm.telemetry - improve perf on EC2 tiny machines by 90%
2024-03-22 22:15:41 -07:00
Ishaan Jaff
07067db5a1 (feat) remove litellm.telemetry 2024-03-22 20:58:14 -07:00
Krrish Dholakia
16368fec98 fix(anthropic.py): support async claude 3 tool calling + streaming
https://github.com/BerriAI/litellm/issues/2644
2024-03-22 19:57:01 -07:00
Ishaan Jaff
c6ffd456ff (fix) add some better load testing 2024-03-22 19:48:54 -07:00
Ishaan Jaff
9bd53cec6a (fix) update load test used 2024-03-22 19:48:54 -07:00
Krish Dholakia
08bfb0950a Merge pull request #2650 from BerriAI/litellm_jwt_auth_fixes
feat(handle_jwt.py): enable jwt-project based auth
2024-03-22 19:32:46 -07:00
Krrish Dholakia
4dad400b57 fix(anthropic.py): handle multiple system prompts 2024-03-22 18:14:15 -07:00
Krrish Dholakia
d275c1183e fix(proxy_server.py): enable jwt-auth for users
allow a user to auth into the proxy via jwt's and call allowed routes
2024-03-22 17:08:10 -07:00
Krrish Dholakia
6eb0183776 fix(handle_jwt.py): add more logging for jwt header 2024-03-22 16:33:32 -07:00
Krrish Dholakia
4d539295de fix(ollama_chat.py): don't pop from dictionary while iterating through it 2024-03-22 08:18:22 -07:00