Commit graph

3353 commits

Author SHA1 Message Date
Keegan McCallum
3b719b2afd
Allow overriding headers for anthropic 2024-01-18 20:12:59 -08:00
Krrish Dholakia
f7694bc193 Merge branch 'main' into litellm_tpm_rpm_rate_limits 2024-01-18 19:10:07 -08:00
ishaan-jaff
5698be0df1 (fix) safe access litellm_params, proxy_server_request 2024-01-18 18:05:51 -08:00
Krrish Dholakia
94ce524c63 test(test_completion.py): handle together ai timeout 2024-01-18 17:54:16 -08:00
ishaan-jaff
16f3d7e0ed (feat) use user_id passed to request - cost track 2024-01-18 17:51:48 -08:00
ishaan-jaff
697c511e76 (feat) support user param for all providers 2024-01-18 17:45:59 -08:00
ishaan-jaff
ddd9ca86a7 (feat) proxy - track cost for user_ids that do not exist 2024-01-18 17:44:39 -08:00
Krrish Dholakia
e0aaa94f28 fix(main.py): read azure ad token from optional params extra body 2024-01-18 17:14:03 -08:00
Krrish Dholakia
1e5efdfa37 fix(proxy_server.py): support setting tpm/rpm limits per user / per key 2024-01-18 17:03:18 -08:00
Ishaan Jaff
a8ba5df90e
Merge pull request #1500 from BerriAI/litellm_create_keys_with_team_id
[Feat] /key/generate - create keys with`team_id`
2024-01-18 16:35:14 -08:00
Jakob
7ba7272a9a add headers to budget manager 2024-01-18 16:10:45 -08:00
Krrish Dholakia
5dac2402ef test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits 2024-01-18 15:28:28 -08:00
Krrish Dholakia
22c3321208 test(test_parallel_request_limiter.py): unit testing for tpm/rpm rate limits 2024-01-18 15:25:29 -08:00
ishaan-jaff
340706565f (fix) add team_id to doc string 2024-01-18 15:23:05 -08:00
ishaan-jaff
2b6972111e (feat) write team_id to User Table 2024-01-18 14:42:46 -08:00
ishaan-jaff
5beef6dbcd (test) setting team_id 2024-01-18 14:33:13 -08:00
Ishaan Jaff
a26267851f
Merge pull request #1498 from BerriAI/litellm_spend_tracking_logs
[Feat] Proxy - Add Spend tracking logs
2024-01-18 14:21:51 -08:00
ishaan-jaff
90509a159a (fix) write team_id to key table 2024-01-18 13:54:08 -08:00
Krrish Dholakia
aef59c554f feat(parallel_request_limiter.py): add support for tpm/rpm limits 2024-01-18 13:52:15 -08:00
ishaan-jaff
42ad12b2bd (fix) support team_id for /key/generate 2024-01-18 13:48:52 -08:00
ishaan-jaff
4294657b99 (fix) use get_logging_payload 2024-01-18 13:40:48 -08:00
ishaan-jaff
ea32a8757b (feat) set team_id on virtual_keys 2024-01-18 13:34:51 -08:00
ishaan-jaff
7bdf0a0a63 (chore) cleanup utils.py 2024-01-18 13:21:51 -08:00
ishaan-jaff
73938080f2 (feat) track - api_key in spendLogs 2024-01-18 13:16:25 -08:00
Duarte OC
daa399bc60
adds s3 folder prefix to cache 2024-01-18 21:57:47 +01:00
Krrish Dholakia
1ea3833ef7 fix(parallel_request_limiter.py): decrement count for failed llm calls
https://github.com/BerriAI/litellm/issues/1477
2024-01-18 12:42:14 -08:00
ishaan-jaff
5b54bcc712 (feat) spendLogs table DynamoDB 2024-01-18 12:39:11 -08:00
ishaan-jaff
4eb9221121 (fix) track EmbeddingResponse cost 2024-01-18 12:29:21 -08:00
ishaan-jaff
88cdfedf84 (feat) track cost streaming 2024-01-18 12:21:56 -08:00
ishaan-jaff
b0e18e92b6 (fix) when kwargs==None 2024-01-18 12:05:08 -08:00
ishaan-jaff
d14d36af9a (v0 ) working - writing /chat/completion spend tracking 2024-01-18 11:54:15 -08:00
Krrish Dholakia
c8dd36db9e fix(proxy_server.py): show all models user has access to in /models 2024-01-18 10:56:37 -08:00
ishaan-jaff
4a5f987512 (feat) insert_data to spend table 2024-01-18 10:09:02 -08:00
ishaan-jaff
4821fa9201 (v0) add schema.prisma 2024-01-18 10:04:34 -08:00
Krish Dholakia
658fd4de38
Merge pull request #1495 from puffo/litellm_ollama_chat_fix
fix(ollama_chat.py): use tiktoken as backup for prompt token counting
2024-01-18 10:02:27 -08:00
Ishaan Jaff
143e225194
Merge pull request #1496 from BerriAI/litellm_unit_test_key_endpoints
[Test+Fix] /Key/Info, /Key/Update - Litellm unit test key endpoints
2024-01-18 09:55:30 -08:00
ishaan-jaff
08ee65f894 (test) /key/update, /key/info 2024-01-18 09:35:02 -08:00
ishaan-jaff
fc1eb36f24 (fix) /key/update overwriting metadata 2024-01-18 09:32:56 -08:00
Krrish Dholakia
96122a4f88 fix(proxy/utils.py): fix isoformat to string logic 2024-01-18 09:32:30 -08:00
Ishaan Jaff
7db04afaca
Merge pull request #1494 from duarteocarmo/patch-1
Update s3 cache to support folder
2024-01-18 09:16:19 -08:00
puffo
becff369dc fix(ollama_chat.py): use tiktoken as backup for prompt token counting 2024-01-18 10:47:24 -06:00
Duarte OC
dbadd64395 revert comment 2024-01-18 17:26:38 +01:00
Krrish Dholakia
71034099c9 fix(proxy/utils.py): prisma client fix get data to handle list return 2024-01-18 07:49:13 -08:00
Duarte OC
578256a6a2
Update s3 cache to support folder 2024-01-18 11:38:05 +01:00
ishaan-jaff
85b5395692 (test) use os.environ/ for azure vision enhance 2024-01-17 21:26:47 -08:00
ishaan-jaff
79c412cab5 (feat) set Azure vision enhancement params using os.environ 2024-01-17 21:23:40 -08:00
ishaan-jaff
0414e40d4a (docs) also test gpt-4 vision enhancements 2024-01-17 18:46:41 -08:00
ishaan-jaff
debef7544d (feat) return Azure enahncements used 2024-01-17 18:46:41 -08:00
Krish Dholakia
e9ac001005
Merge pull request #1483 from BerriAI/litellm_model_access_groups_feature
feat(proxy_server.py): support model access groups
2024-01-17 18:16:53 -08:00
Ishaan Jaff
15ae9182db
Merge pull request #1484 from BerriAI/litellm_access_key_metadata_in_callbacks
[Feat] Proxy - Access Key metadata in callbacks
2024-01-17 18:08:08 -08:00