ishaan-jaff
|
739d9e7a78
|
(fix) vertex ai - use usage from response
|
2023-12-29 16:30:25 +05:30 |
|
ishaan-jaff
|
e6a7212d10
|
(fix) counting streaming prompt tokens - azure
|
2023-12-29 16:13:52 +05:30 |
|
ishaan-jaff
|
d1049c8922
|
bump: version 1.16.6 → 1.16.7
|
2023-12-29 16:04:18 +05:30 |
|
ishaan-jaff
|
8c03be59a8
|
(fix) token_counter for tool calling
|
2023-12-29 15:54:03 +05:30 |
|
ishaan-jaff
|
73f60b7315
|
(test) stream chunk builder - azure prompt tokens
|
2023-12-29 15:45:41 +05:30 |
|
ishaan-jaff
|
b1077ebc38
|
(test) test_token_counter_azure
|
2023-12-29 15:37:46 +05:30 |
|
ishaan-jaff
|
037dcbbe10
|
(fix) use openai token counter for azure llms
|
2023-12-29 15:37:46 +05:30 |
|
Krrish Dholakia
|
cbdfae1267
|
fix(router.py): support wait_for for async completion calls
|
2023-12-29 15:27:20 +05:30 |
|
Krrish Dholakia
|
1f76b0e721
|
docs(routing.md): add retry_after to docs
|
2023-12-29 15:22:12 +05:30 |
|
Krrish Dholakia
|
4882325c35
|
feat(router.py): support 'retry_after' param, to set min timeout before retrying a failed request (default 0)
|
2023-12-29 15:18:28 +05:30 |
|
ishaan-jaff
|
4a028d012a
|
(test) token_counter - prompt tokens == tokens from API
|
2023-12-29 15:15:39 +05:30 |
|
ishaan-jaff
|
a300ab9152
|
(feat) azure stream - count correct prompt tokens
|
2023-12-29 15:15:39 +05:30 |
|
Krrish Dholakia
|
1e07f0fce8
|
fix(caching.py): hash the cache key to prevent key too long errors
|
2023-12-29 15:03:33 +05:30 |
|
Krrish Dholakia
|
6e68cd1125
|
docs(load_test.md): add litellm load test script to docs
|
2023-12-29 13:41:44 +05:30 |
|
ishaan-jaff
|
35b29d35b8
|
(fix) pyproject streamlit errors
|
2023-12-29 13:36:10 +05:30 |
|
ishaan-jaff
|
68868f065c
|
(fix) pyproject, streamlit errors
|
2023-12-29 13:36:10 +05:30 |
|
Krrish Dholakia
|
f47bc7dfa0
|
build(model_prices_and_context_window.json): add bedrock/cohere light, embeddings, and dedicated capacity pricing
|
2023-12-29 13:07:37 +05:30 |
|
Krrish Dholakia
|
d70645b5e7
|
build(model_prices_and_context_window.json): add new anthropic bedrock pricing
|
2023-12-29 12:55:23 +05:30 |
|
ishaan-jaff
|
6e76872595
|
bump: version 1.16.5 → 1.16.6
|
2023-12-29 12:48:47 +05:30 |
|
ishaan-jaff
|
3973b9c8e4
|
(feat) cloudflare - add exception mapping
|
2023-12-29 12:31:10 +05:30 |
|
Ishaan Jaff
|
299d5cb787
|
(docs) readme embedding
|
2023-12-29 12:19:57 +05:30 |
|
Ishaan Jaff
|
ee8c430eb0
|
(docs) add voyage ai to readme
|
2023-12-29 12:19:14 +05:30 |
|
Ishaan Jaff
|
d8b03e0d3d
|
(docs) add async embeddings for providers
|
2023-12-29 12:17:10 +05:30 |
|
ishaan-jaff
|
4dc9b85436
|
(docs) add cloudflare ai workers to readme
|
2023-12-29 12:13:20 +05:30 |
|
ishaan-jaff
|
52a9696303
|
(docs) cloudflare
|
2023-12-29 12:10:32 +05:30 |
|
ishaan-jaff
|
243ad31e90
|
(test) async + stream clooudflare
|
2023-12-29 12:03:29 +05:30 |
|
ishaan-jaff
|
ee682be093
|
(feat) add cloudflare streaming
|
2023-12-29 12:01:26 +05:30 |
|
ishaan-jaff
|
a999e80b46
|
(test) async cloudflare
|
2023-12-29 11:50:09 +05:30 |
|
ishaan-jaff
|
dde6bc4fb6
|
(feat) cloudflare - add optional params
|
2023-12-29 11:50:09 +05:30 |
|
Krrish Dholakia
|
3d7b1b9468
|
bump: version 1.16.4 → 1.16.5
|
2023-12-29 11:46:55 +05:30 |
|
Krrish Dholakia
|
e06840b571
|
refactor: move async text completion testing to test_text_completion.py
|
2023-12-29 11:46:40 +05:30 |
|
ishaan-jaff
|
5fc9524a46
|
(test) test cloudflare completion
|
2023-12-29 11:34:58 +05:30 |
|
ishaan-jaff
|
8fcfb7df22
|
(feat) cloudflare ai workers - add completion support
|
2023-12-29 11:34:58 +05:30 |
|
Krrish Dholakia
|
6f2734100f
|
fix(main.py): fix async text completion streaming + add new tests
|
2023-12-29 11:33:42 +05:30 |
|
ishaan-jaff
|
2b8e2bd937
|
(ci/cd) set num retries for HF test
|
2023-12-29 10:52:45 +05:30 |
|
ishaan-jaff
|
367e9913dc
|
(feat) v0 adding cloudflare
|
2023-12-29 09:32:29 +05:30 |
|
ishaan-jaff
|
daf32f3bd4
|
(fix) tg AI cost tracking - zero-one-ai/Yi-34B-Chat
|
2023-12-29 09:14:07 +05:30 |
|
ishaan-jaff
|
d79df3a1e9
|
(fix) together_ai cost tracking
|
2023-12-28 22:11:08 +05:30 |
|
Krrish Dholakia
|
a351211d03
|
docs(users.md): add user rate limits to docs
|
2023-12-28 19:28:32 +05:30 |
|
Krrish Dholakia
|
17d47ea1ed
|
bump: version 1.16.3 → 1.16.4
|
2023-12-28 19:09:44 +05:30 |
|
Krrish Dholakia
|
86403cd14e
|
fix(vertex_ai.py): support function calling for gemini
|
2023-12-28 19:07:04 +05:30 |
|
ishaan-jaff
|
a1484171b5
|
(docs) voyage ai embeddings
|
2023-12-28 17:15:16 +05:30 |
|
ishaan-jaff
|
aa2bd93166
|
(docs) add voyage ai
|
2023-12-28 17:12:58 +05:30 |
|
ishaan-jaff
|
95e6d2fbba
|
(feat) add voyage ai embeddings
|
2023-12-28 17:10:15 +05:30 |
|
Krrish Dholakia
|
cbcf406fd0
|
feat(admin_ui.py): support creating keys on admin ui
|
2023-12-28 16:59:11 +05:30 |
|
ishaan-jaff
|
01f7e85057
|
(docs) add mistral embeddings
|
2023-12-28 16:54:26 +05:30 |
|
ishaan-jaff
|
7f74a0331c
|
(docs) add mistral-embed
|
2023-12-28 16:50:52 +05:30 |
|
ishaan-jaff
|
9777db67e2
|
(feat) add pricing for mistral-embed
|
2023-12-28 16:48:08 +05:30 |
|
ishaan-jaff
|
41d5dceaec
|
(test) mistral-embed
|
2023-12-28 16:42:36 +05:30 |
|
ishaan-jaff
|
78f0c0228b
|
(feat) add mistral api embeddings
|
2023-12-28 16:41:55 +05:30 |
|