Commit graph

9111 commits

Author SHA1 Message Date
Ishaan Jaff
a4d90fb42d Merge pull request #2693 from BerriAI/litellm_batch_write_redis_cache
[FEAT] batch write redis cache
2024-03-25 18:47:24 -07:00
Krrish Dholakia
fed6ee1f35 test(test_custom_logger.py): cleanup test 2024-03-25 18:32:12 -07:00
Ishaan Jaff
c898ffe636 (feat) improve cache debugging litellm 2024-03-25 18:26:58 -07:00
Krrish Dholakia
cbf4c95e5f fix(utils.py): persist response id across chunks 2024-03-25 18:20:43 -07:00
Ishaan Jaff
a0601723e5 (test) batch writing to cache 2024-03-25 18:04:04 -07:00
Ishaan Jaff
412a56eea4 (fix) print verbose in batch writing redis 2024-03-25 18:02:31 -07:00
Krrish Dholakia
ecc0cf5d9c fix(utils.py): fix text completion streaming 2024-03-25 16:47:17 -07:00
Ishaan Jaff
853ed0278f Merge branch 'main' into litellm_batch_write_redis_cache 2024-03-25 16:41:29 -07:00
Ishaan Jaff
ec0435bdea (feat) batch write redis cache output 2024-03-25 16:39:47 -07:00
Krrish Dholakia
26dbb76d53 fix(utils.py): ensure last chunk is always empty delta w/ finish reason
makes sure we're openai-compatible with our streaming. Adds stricter tests for this as well
2024-03-25 16:33:41 -07:00
Ishaan Jaff
21b9a0ead3 (fix) locust load test use uuid 2024-03-25 15:36:30 -07:00
Ishaan Jaff
c986842f26 (feat) v0 batch redis cache writes 2024-03-25 15:20:10 -07:00
Krrish Dholakia
c667e437b9 fix(utils.py): allow user to disable streaming logging
fixes event loop issue for litellm.disable_streaming_logging
2024-03-25 14:28:46 -07:00
Krrish Dholakia
a9737adcc0 docs(proxy/caching.md): add ttl param to proxy/caching.md 2024-03-25 13:46:52 -07:00
Krrish Dholakia
fec92767bb fix(caching.py): support default ttl for caching 2024-03-25 13:40:17 -07:00
Krrish Dholakia
fdaeef32c1 bump: version 1.34.1 → 1.34.2 2024-03-25 13:28:25 -07:00
Krish Dholakia
f8d1d41b34 Merge pull request #2687 from BerriAI/litellm_jwt_auth_fixes_2
Litellm jwt auth fixes
2024-03-25 13:27:19 -07:00
Krrish Dholakia
10ecf31827 test(test_jwt.py): add unit tests for jwt auth integration 2024-03-25 13:24:39 -07:00
Krrish Dholakia
6a14909e5e docs(token_auth.md): add renaming jwt scope string to docs 2024-03-25 12:49:44 -07:00
Krish Dholakia
edddfc2520 Merge pull request #2660 from dragosMC91/fix-pip-install-extra-proxy
Fix pip install 'litellm[extra_proxy]'
2024-03-25 12:42:13 -07:00
Krrish Dholakia
30c45b83bf fix(handle_jwt.py): cache public keys
caches jwt public keys - reducing need for making http calls on every request
2024-03-25 12:36:32 -07:00
Ishaan Jaff
033d89373c Merge pull request #2683 from BerriAI/litellm_f_string
(feat) Perf Improvement - stop eagerly evaluating fstring
2024-03-25 12:35:30 -07:00
Krrish Dholakia
72a0f0213d fix(handle_jwt.py): support scopes being a list
allow scopes in jwt to be a list, not just a space-separated string
2024-03-25 12:30:40 -07:00
Krrish Dholakia
fe3a39351f fix(handle_jwt.py): don't require kid to be set 2024-03-25 12:28:16 -07:00
Krrish Dholakia
38bdf31652 fix(handle_jwt.py): allow setting proxy admin role string for jwt auth 2024-03-25 12:20:14 -07:00
Krrish Dholakia
d3e7376e10 fix(handle_jwt.py): enable user to set jwt admin scope string in config 2024-03-25 11:34:49 -07:00
Dragos Campean
cb918a3d4f Merge remote-tracking branch 'upstream/main' into fix-pip-install-extra-proxy 2024-03-25 20:22:17 +02:00
Ishaan Jaff
a6065e4853 (fix) stop using f strings in verbose logger 2024-03-25 10:55:30 -07:00
Krish Dholakia
ee6507dfee Merge pull request #2672 from RoniGurvich/main
`cryptography` and `python-multipart` version bump
2024-03-25 10:54:43 -07:00
Krish Dholakia
d2d6f6d0aa Merge pull request #2685 from maxdeichmann/log-langfuse-instantiation
add langfuse logging
2024-03-25 10:48:16 -07:00
Ishaan Jaff
f0992c2dbd (fix) stop using f strings with logger 2024-03-25 10:47:18 -07:00
Max Deichmann
efa599b0ee push 2024-03-25 17:43:55 +01:00
onukura
6ee8f26746 Fix ollama embedding response 2024-03-25 16:26:49 +00:00
onukura
3423038601 Fix ollama api_base to enable remote url 2024-03-25 16:26:40 +00:00
Ishaan Jaff
2c01457a4b (feat) stop eagerly evaluating fstring 2024-03-25 09:01:42 -07:00
Krrish Dholakia
1970742023 bump: version 1.34.0 → 1.34.1 2024-03-25 08:37:01 -07:00
Krrish Dholakia
552dbb8d04 docs(instructor.md): tutorial on using litellm with instructor 2024-03-25 08:35:11 -07:00
Krrish Dholakia
8821b3d243 feat(main.py): support router.chat.completions.create
allows using router with instructor

https://github.com/BerriAI/litellm/issues/2673
2024-03-25 08:26:28 -07:00
Krrish Dholakia
f886608e93 docs(routing.md): add fallbacks being done in order 2024-03-24 12:13:19 -07:00
Roni Gurvich
3aa818ea8c cryptography and python-multipart version bump 2024-03-24 08:49:36 +00:00
Roni Gurvich
a232bcc8d5 cryptography and python-multipart version bump 2024-03-24 08:44:36 +00:00
Dragos Campean
c248229f19 Merge remote-tracking branch 'upstream/main' into fix-pip-install-extra-proxy 2024-03-24 08:53:54 +02:00
Dragos Campean
d2af93518f Remove dependency which was removed on main 2024-03-24 08:53:43 +02:00
Krrish Dholakia
6750f7dfe8 bump: version 1.33.9 → 1.34.0 2024-03-23 22:35:18 -07:00
Krrish Dholakia
131b7060ee docs(routing.md): add url 2024-03-23 20:03:42 -07:00
Krrish Dholakia
511df58cf6 docs(routing.md): add proxy example to pre-call checks in routing docs 2024-03-23 20:00:50 -07:00
Krish Dholakia
c291b0d2d6 Merge pull request #2666 from BerriAI/litellm_model_fix_v_3
fix(proxy_server.py): fix model info check
2024-03-23 19:39:03 -07:00
Ishaan Jaff
f36827a6d7 Merge pull request #2670 from BerriAI/litellm_docs_best_practices_prod
(docs) best prod practices
2024-03-23 19:38:22 -07:00
Krish Dholakia
c46de42339 Merge pull request #2669 from BerriAI/litellm_router_pre_call_checks
feat(router.py): enable pre-call checks
2024-03-23 19:38:09 -07:00
Ishaan Jaff
010cf5dc55 (docs) prod best perf 2024-03-23 19:36:26 -07:00