Commit graph

59 commits

Author SHA1 Message Date
Krish Dholakia
7d2a9f9222 Merge pull request #2236 from BerriAI/litellm_faster_usage_logs_pg_db
fix(proxy_server.py): introduces a beta endpoint for admin to view global spend
2024-02-28 17:08:55 -08:00
Krrish Dholakia
129e84f0c5 test: removing aws tests - account suspended - pending their approval 2024-02-28 13:46:20 -08:00
ishaan-jaff
3310bc9ee5 (test) hf catch rate limit errors 2024-02-28 12:02:54 -08:00
Krrish Dholakia
265bc82471 test(test_async_fn.py): handle hf tgi model not being loaded yet 2024-02-15 18:37:32 -08:00
Krrish Dholakia
23c410a548 feat(sagemaker.py): initial commit of working sagemaker with aioboto3 2024-02-12 17:25:57 -08:00
ishaan-jaff
cea0d6c8b0 (fix) litellm.acompletion with type hints 2024-01-11 10:47:12 +05:30
Krrish Dholakia
e4a5a3395c fix(huggingface_restapi.py): support timeouts for huggingface + openai text completions
https://github.com/BerriAI/litellm/issues/1334
2024-01-08 11:40:56 +05:30
Krish Dholakia
8cfae7bb1a Merge pull request #1342 from BerriAI/litellm_dockerfile_updates
build(Dockerfile): moves prisma logic to dockerfile
2024-01-06 16:03:25 +05:30
ishaan-jaff
61ab6e6ab6 (fix) cloudflare tests 2024-01-06 15:35:49 +05:30
Krrish Dholakia
0cb166b9c2 test(test_async_fn.py): skip cloudflare test - flaky 2024-01-06 15:17:42 +05:30
ishaan-jaff
f650f979c6 (fix) retry cloudflare ai workers 3 times 2024-01-05 13:55:47 +05:30
ishaan-jaff
55aa5fdb31 (ci/cd) run cloudflare test 3 retries 2024-01-05 11:55:12 +05:30
Krrish Dholakia
7d55a563ee fix(main.py): don't set timeout as an optional api param 2023-12-30 11:47:07 +05:30
ishaan-jaff
a21f135fff (test) async + stream clooudflare 2023-12-29 12:03:29 +05:30
ishaan-jaff
b7539df9b3 (test) async cloudflare 2023-12-29 11:50:09 +05:30
Krrish Dholakia
9cf43cd5dc refactor: move async text completion testing to test_text_completion.py 2023-12-29 11:46:40 +05:30
Krrish Dholakia
a88f07dc60 fix(main.py): fix async text completion streaming + add new tests 2023-12-29 11:33:42 +05:30
Krrish Dholakia
79978c44ba refactor: add black formatting 2023-12-25 14:11:20 +05:30
Krrish Dholakia
0e1c747f12 fix(router.py): support cloudflare ai gateway for azure models on router 2023-11-30 14:09:06 -08:00
Krrish Dholakia
60d6b6bc37 fix(router.py): fix exponential backoff to use retry-after if present in headers 2023-11-28 17:25:03 -08:00
Krrish Dholakia
e495a8a9c2 fix(main.py): fix acompletion for anyscale, openrouter, deepinfra, perplexity endpoints 2023-11-22 13:22:58 -08:00
Krrish Dholakia
efc2bfe295 fix(utils.py): add response ms for async calls 2023-11-21 19:59:00 -08:00
ishaan-jaff
2f0304f100 (test) async for chunk 2023-11-18 17:17:39 -08:00
ishaan-jaff
30fcc1f9a0 (test) do nothing if token==None in stream chunk 2023-11-18 17:13:16 -08:00
Krrish Dholakia
7ea80de2f1 test(test_async_fn.py): catch timeouts 2023-11-17 18:01:52 -08:00
Krrish Dholakia
7d70bf84a7 test(test_langfuse.py): handle timeouts 2023-11-17 17:05:46 -08:00
Krrish Dholakia
02ed97d0b2 fix(acompletion): support client side timeouts + raise exceptions correctly for async calls 2023-11-17 15:39:47 -08:00
Krrish Dholakia
48a508bab6 feat: global client for sync + async calls (openai + Azure only) 2023-11-16 14:44:13 -08:00
Krrish Dholakia
a6e9f147d3 fix(openai.py): handling extra headers 2023-11-16 12:48:21 -08:00
Krrish Dholakia
e54056f0ed fix(azure.py): use openai client sdk for handling sync+async calling 2023-11-16 12:08:12 -08:00
Krrish Dholakia
f14bd24b46 fix(openai.py): fix linting issues 2023-11-16 11:01:28 -08:00
Krrish Dholakia
9c7cc84eb0 fix(openai.py): supporting openai client sdk for handling sync + async calls (incl. for openai-compatible apis) 2023-11-16 10:35:03 -08:00
Krrish Dholakia
e28b9534e4 fix(huggingface_restapi.py): async implementation 2023-11-15 16:54:15 -08:00
Krrish Dholakia
e35ce15a89 refactor(huggingface_restapi.py): moving async completion + streaming to real async calls 2023-11-15 15:14:21 -08:00
Krrish Dholakia
04ce14e404 fix(utils.py): fix langfuse integration 2023-11-15 14:05:40 -08:00
Krrish Dholakia
29a0c29eb3 fix(utils.py): await async function in client wrapper 2023-11-14 22:07:28 -08:00
Krrish Dholakia
7b4e21e574 refactor(bedrock.py-+-cohere.py): making bedrock and cohere compatible with openai v1 sdk 2023-11-11 17:33:19 -08:00
Krrish Dholakia
c0a757a25f refactor(azure.py): working azure completion calls with openai v1 sdk 2023-11-11 16:44:39 -08:00
Krrish Dholakia
1ec07c0aba refactor(openai.py): working openai chat + text completion for openai v1 sdk 2023-11-11 16:25:10 -08:00
Krrish Dholakia
a5ec85b1f2 refactor(openai.py): making it compatible for openai v1
BREAKING CHANGE:
2023-11-11 15:33:02 -08:00
ishaan-jaff
4319aa415a (test) async_fn with stream 2023-11-10 17:47:15 -08:00
Krrish Dholakia
272a6dc9b0 refactor(azure.py): enabling async streaming with aiohttp 2023-11-09 16:41:06 -08:00
Krrish Dholakia
9b278f567b refactor(openai.py): support aiohttp streaming 2023-11-09 16:15:30 -08:00
Krrish Dholakia
1d46891ceb fix(azure.py): adding support for aiohttp calls on azure + openai 2023-11-09 10:40:33 -08:00
Krrish Dholakia
53b5409fac test(test_async_fn.py): fix import issues 2023-10-30 20:57:24 -07:00
Krrish Dholakia
93b1cd7ab2 test(test_async_fn.py): test logging 2023-10-30 19:59:36 -07:00
Krrish Dholakia
2c274c5ad6 test(test_async_fn.py): addding sync test 2023-10-30 19:38:31 -07:00
Krrish Dholakia
9c9b205a7b test(test_async_fn.py): more logging 2023-10-30 18:39:10 -07:00
Krrish Dholakia
6eb7205aba test(test_async_fn.py): add better error logging 2023-10-30 18:18:11 -07:00
Krrish Dholakia
e55b20b1e4 test(test_async_fn.py): adding additional logging for tests 2023-10-24 12:54:18 -07:00