Commit graph

445 commits

Author SHA1 Message Date
ishaan-jaff
79ab1aa35b (fix) undo - model_dump_json() before logging 2024-01-05 11:47:16 +05:30
ishaan-jaff
40b9f1dcb1 (fix) proxy - log response before model_dump_json 2024-01-05 11:00:02 +05:30
ishaan-jaff
234c057e97 (fix) azure+cf gateway, health check 2024-01-04 12:34:07 +05:30
Krrish Dholakia
0f7d03f761 fix(proxy/rules.md): add docs on setting post-call rules on the proxy 2024-01-04 11:16:50 +05:30
ishaan-jaff
d1e8d13c4f (fix) init_bedrock_client 2024-01-01 22:48:56 +05:30
Krrish Dholakia
a6719caebd fix(aimage_generation): fix response type 2023-12-30 12:53:24 +05:30
Krrish Dholakia
750432457b fix(openai.py): fix async image gen call 2023-12-30 12:44:54 +05:30
Krrish Dholakia
c33c1d85bb fix: support dynamic timeouts for openai and azure 2023-12-30 12:14:02 +05:30
Krrish Dholakia
77be3e3114 fix(main.py): don't set timeout as an optional api param 2023-12-30 11:47:07 +05:30
ishaan-jaff
739d9e7a78 (fix) vertex ai - use usage from response 2023-12-29 16:30:25 +05:30
ishaan-jaff
dde6bc4fb6 (feat) cloudflare - add optional params 2023-12-29 11:50:09 +05:30
ishaan-jaff
8fcfb7df22 (feat) cloudflare ai workers - add completion support 2023-12-29 11:34:58 +05:30
ishaan-jaff
367e9913dc (feat) v0 adding cloudflare 2023-12-29 09:32:29 +05:30
ishaan-jaff
d79df3a1e9 (fix) together_ai cost tracking 2023-12-28 22:11:08 +05:30
Krrish Dholakia
86403cd14e fix(vertex_ai.py): support function calling for gemini 2023-12-28 19:07:04 +05:30
Krrish Dholakia
cbcf406fd0 feat(admin_ui.py): support creating keys on admin ui 2023-12-28 16:59:11 +05:30
Krrish Dholakia
c4fc28ab0d fix(utils.py): use local tiktoken copy 2023-12-28 11:22:33 +05:30
Krrish Dholakia
3b1685e7c6 feat(health_check.py): more detailed health check calls 2023-12-28 09:12:57 +05:30
Krrish Dholakia
31148922b3 fix(azure.py): raise streaming exceptions 2023-12-27 15:43:13 +05:30
Krrish Dholakia
c9fdbaf898 fix(azure.py,-openai.py): correctly raise errors if streaming calls fail 2023-12-27 15:08:37 +05:30
Krrish Dholakia
c88a8d71f0 fix: fix linting issues 2023-12-27 12:21:31 +05:30
dan
c4dfd9be7c updated oobabooga to new api and support for embeddings 2023-12-26 19:45:28 -05:00
ishaan-jaff
3f6e6e7f55 (fix) ollama_chat - support function calling + fix for comp 2023-12-26 20:07:55 +05:30
ishaan-jaff
3839213d28 (feat) ollama_chat acompletion without streaming 2023-12-26 20:01:51 +05:30
ishaan-jaff
837ce269ae (feat) ollama_chat add async stream 2023-12-25 23:45:27 +05:30
ishaan-jaff
916ba9a6b3 (feat) ollama_chat - add streaming support 2023-12-25 23:38:01 +05:30
ishaan-jaff
03de92eec0 (feat) ollama/chat 2023-12-25 23:04:17 +05:30
ishaan-jaff
d85c19394f (feat) ollama use /api/chat 2023-12-25 14:29:10 +05:30
ishaan-jaff
da4ec6c8b6 (feat) add ollama_chat v0 2023-12-25 14:27:10 +05:30
Krrish Dholakia
4905929de3 refactor: add black formatting 2023-12-25 14:11:20 +05:30
Krrish Dholakia
1262d89ab3 feat(gemini.py): add support for completion calls for gemini-pro (google ai studio) 2023-12-24 09:42:58 +05:30
Krrish Dholakia
eaaad79823 feat(ollama.py): add support for async ollama embeddings 2023-12-23 18:01:25 +05:30
Krish Dholakia
03fd5da5ae
Merge pull request #1203 from Manouchehri/bedrock-cloudflare-ai-gateway-1
Add aws_bedrock_runtime_endpoint support
2023-12-23 11:44:04 +05:30
Krish Dholakia
81617534b6
Merge pull request #1213 from neubig/vertex_chat_generate_content
Make vertex ai work with generate_content
2023-12-23 11:40:43 +05:30
Krrish Dholakia
eb2d13e2fb test(test_completion.py-+-test_streaming.py): add ollama endpoint to ci/cd pipeline 2023-12-22 12:21:33 +05:30
Krrish Dholakia
57607f111a fix(ollama.py): use litellm.request timeout for async call timeout 2023-12-22 11:22:24 +05:30
Graham Neubig
2362544344
Update the request_str 2023-12-21 09:58:06 -05:00
Krrish Dholakia
be68796eba fix(router.py): add support for async image generation endpoints 2023-12-21 14:38:44 +05:30
Krrish Dholakia
812f9ca1b3 fix(azure.py): correctly raise async exceptions 2023-12-21 12:23:07 +05:30
David Manouchehri
93c4556eb0
Add aws_bedrock_runtime_endpoint support. 2023-12-20 19:31:43 -05:00
Graham Neubig
6e9267ca66 Make vertex_chat work with generate_content 2023-12-20 15:32:44 -05:00
Graham Neubig
482b3b5bc3 Add a default for safety settings in vertex AI 2023-12-20 13:12:50 -05:00
Krrish Dholakia
04bbd0649f fix(router.py): only do sync image gen fallbacks for now
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version

n
2023-12-20 19:10:59 +05:30
Krrish Dholakia
f355e03515 feat(main.py): add async image generation support 2023-12-20 16:58:40 +05:30
Krrish Dholakia
b3962e483f feat(azure.py): add support for azure image generations endpoint 2023-12-20 16:37:21 +05:30
Krrish Dholakia
f0df28362a feat(ollama.py): add support for ollama function calling 2023-12-20 14:59:55 +05:30
Graham Neubig
2d15e5384b Add partial support of vertexai safety settings 2023-12-19 22:26:55 -05:00
ishaan-jaff
9995229b97 (fix) proxy + ollama - raise exception correctly 2023-12-19 18:48:34 +05:30
Krish Dholakia
408f232bd7
Merge branch 'main' into main 2023-12-18 17:54:34 -08:00
Krrish Dholakia
34509d8dda fix(main.py): return async completion calls 2023-12-18 17:41:54 -08:00