Commit graph

1547 commits

Author SHA1 Message Date
ishaan-jaff
08525ce200 (ci/cd) use 3 retries for image generation 2024-01-09 10:07:09 +05:30
ishaan-jaff
9be7e34cb0 (ci/cd) pytest skip slow replicate test 2024-01-09 09:57:06 +05:30
Krrish Dholakia
88d498a54a fix(ollama.py): use tiktoken as backup for prompt token counting 2024-01-09 09:47:18 +05:30
Krrish Dholakia
a5147f9e06 feat(lowest_latency.py): support expanded time window for latency based routing
uses a 1hr avg. of latency for deployments, to determine which to route to

https://github.com/BerriAI/litellm/issues/1361
2024-01-09 09:38:04 +05:30
ishaan-jaff
6263103680 (ci/cd) run again 2024-01-08 22:42:31 +05:30
Krrish Dholakia
8edd3fe651 test(test_proxy_startup.py): fix gunicorn test 2024-01-08 19:55:18 +05:30
Krrish Dholakia
e305dcf0a6 test(test_proxy_startup.py): separate tests 2024-01-08 17:58:37 +05:30
Krrish Dholakia
8f8f961941 fix(proxy_server.py): add support for passing in config file via worker_config directly + testing 2024-01-08 16:47:15 +05:30
Ishaan Jaff
a70626d6e9
Merge pull request #1356 from BerriAI/litellm_improve_proxy_logs
[Feat] Improve Proxy Logging
2024-01-08 14:41:01 +05:30
ishaan-jaff
96e8c2b4cf (test) tg - ai - set max_tokens=5, fast test 2024-01-08 14:03:31 +05:30
ishaan-jaff
bf30e8fdb2 (test) router- verbose logs with fallbacks 2024-01-08 14:00:12 +05:30
Krrish Dholakia
6333fbfe56 fix(main.py): support cost calculation for text completion streaming object 2024-01-08 12:41:43 +05:30
Krrish Dholakia
9b46412279 fix(utils.py): fix logging for text completion streaming 2024-01-08 12:05:28 +05:30
Krrish Dholakia
b1fd0a164b fix(huggingface_restapi.py): support timeouts for huggingface + openai text completions
https://github.com/BerriAI/litellm/issues/1334
2024-01-08 11:40:56 +05:30
Krrish Dholakia
c720870f80 docs(gemini.md,-deploy.md): doc updates 2024-01-08 11:02:12 +05:30
Krish Dholakia
4ea3e778f7
Merge pull request #1315 from spdustin/feature_allow_claude_prefill
Adds "pre-fill" support for Claude
2024-01-08 10:48:15 +05:30
Krrish Dholakia
3469b5b911 fix(utils.py): map optional params for gemini 2024-01-08 07:38:55 +05:30
Krrish Dholakia
79264b0dab fix(gemini.py): better error handling 2024-01-08 07:32:26 +05:30
Krrish Dholakia
888e21e8e7 test(test_google_ai_studio_gemini.py): use an image url that will work on ci/cd 2024-01-06 22:58:37 +05:30
Krrish Dholakia
1507217725 fix(factory.py): more logging around the image loading for gemini 2024-01-06 22:50:44 +05:30
Krrish Dholakia
5fd2f945f3 fix(factory.py): support gemini-pro-vision on google ai studio
https://github.com/BerriAI/litellm/issues/1329
2024-01-06 22:36:22 +05:30
ishaan-jaff
edac4130bb (fix) s3 + os.environ/ cache test 2024-01-06 16:33:29 +05:30
ishaan-jaff
174248fc71 (test) add back test for counting stream completion tokens 2024-01-06 16:08:32 +05:30
Krish Dholakia
8d32f08858
Merge pull request #1342 from BerriAI/litellm_dockerfile_updates
build(Dockerfile): moves prisma logic to dockerfile
2024-01-06 16:03:25 +05:30
ishaan-jaff
f999b63d05 (test) using os.environ/ on cache + proxy 2024-01-06 15:54:50 +05:30
ishaan-jaff
c2b061acb2 (feat) cache+proxy - set os.environ/ on proxy config 2024-01-06 15:54:16 +05:30
ishaan-jaff
0d152b3748 (fix) cloudflare tests 2024-01-06 15:35:49 +05:30
Krrish Dholakia
9375570547 test(test_async_fn.py): skip cloudflare test - flaky 2024-01-06 15:17:42 +05:30
Krrish Dholakia
04c04d62e3 test(test_stream_chunk_builder.py): remove completion assert, the test is for prompt tokens 2024-01-06 14:12:44 +05:30
Krrish Dholakia
5c45e69a5e test(test_proxy_server_keys.py): add logic for connecting/disconnecting from http server 2024-01-06 14:09:10 +05:30
ishaan-jaff
4a076350cc (ci/cd) move to old version of test_proxy_server_keys.py 2024-01-06 13:03:12 +05:30
ishaan-jaff
41bfd43a48 (ci/cd) pin anyio / async dependencies 2024-01-06 12:38:56 +05:30
ishaan-jaff
3bb49447bc (ci/cd) fix event loop bug proxy_test 2024-01-06 12:30:43 +05:30
ishaan-jaff
79fd2380bb (ci/cd) run again 2024-01-06 12:11:31 +05:30
ishaan-jaff
0ebd0653c5 (ci/cd) make prisma tests async 2024-01-06 11:43:23 +05:30
spdustin@gmail.com
6201ab2c21 Update factory (and tests) for Claude 2.1 via Bedrock 2024-01-05 23:32:32 +00:00
spdustin@gmail.com
5d074f5b56 Adds tests and updates docs for Claude "pre-fill" 2024-01-05 22:58:41 +00:00
ishaan-jaff
ae54e6d8b0 (ci/cd) proxy:test_add_new_key 2024-01-05 22:53:03 +05:30
ishaan-jaff
40aaac69cc (ci/cd) add print_verbose for /key/generate 2024-01-05 22:38:46 +05:30
ishaan-jaff
dfdd329ddf (ci/cd) pytest event loop fixture 2024-01-05 22:28:34 +05:30
ishaan-jaff
050c289ed1 (ci/cd) test fixture 2024-01-05 22:15:08 +05:30
ishaan-jaff
6f9d3fc3bc (ci/cd) retry hosted ollama + stream test 3 times 2024-01-05 18:02:20 +05:30
ishaan-jaff
0eb899c087 (test) hosted ollama - retry 3 times 2024-01-05 17:58:59 +05:30
ishaan-jaff
76b2db4492 (ci/cd) run test again 2024-01-05 16:40:56 +05:30
ishaan-jaff
69bac0dbf6 (ci/cd) test proxy - init prisma in test 2024-01-05 16:18:23 +05:30
ishaan-jaff
4679c7b99a (fix) caching use same "created" in response_object 2024-01-05 16:03:56 +05:30
ishaan-jaff
f211009263 (test) openai embedding cost calculation 2024-01-05 15:22:17 +05:30
ishaan-jaff
20256c45ad (fix) retry cloudflare ai workers 3 times 2024-01-05 13:55:47 +05:30
ishaan-jaff
6694975ec3 (test) azure completion_cost 2024-01-05 13:53:08 +05:30
ishaan-jaff
13201edc4b (test) test reading configs on proxy 2024-01-05 13:37:31 +05:30