Krrish Dholakia
3b1685e7c6
feat(health_check.py): more detailed health check calls
2023-12-28 09:12:57 +05:30
Krrish Dholakia
31148922b3
fix(azure.py): raise streaming exceptions
2023-12-27 15:43:13 +05:30
Krrish Dholakia
c9fdbaf898
fix(azure.py,-openai.py): correctly raise errors if streaming calls fail
2023-12-27 15:08:37 +05:30
Krrish Dholakia
c88a8d71f0
fix: fix linting issues
2023-12-27 12:21:31 +05:30
dan
c4dfd9be7c
updated oobabooga to new api and support for embeddings
2023-12-26 19:45:28 -05:00
ishaan-jaff
3f6e6e7f55
(fix) ollama_chat - support function calling + fix for comp
2023-12-26 20:07:55 +05:30
ishaan-jaff
3839213d28
(feat) ollama_chat acompletion without streaming
2023-12-26 20:01:51 +05:30
ishaan-jaff
837ce269ae
(feat) ollama_chat add async stream
2023-12-25 23:45:27 +05:30
ishaan-jaff
916ba9a6b3
(feat) ollama_chat - add streaming support
2023-12-25 23:38:01 +05:30
ishaan-jaff
03de92eec0
(feat) ollama/chat
2023-12-25 23:04:17 +05:30
ishaan-jaff
d85c19394f
(feat) ollama use /api/chat
2023-12-25 14:29:10 +05:30
ishaan-jaff
da4ec6c8b6
(feat) add ollama_chat v0
2023-12-25 14:27:10 +05:30
Krrish Dholakia
4905929de3
refactor: add black formatting
2023-12-25 14:11:20 +05:30
Krrish Dholakia
1262d89ab3
feat(gemini.py): add support for completion calls for gemini-pro (google ai studio)
2023-12-24 09:42:58 +05:30
Krrish Dholakia
eaaad79823
feat(ollama.py): add support for async ollama embeddings
2023-12-23 18:01:25 +05:30
Krish Dholakia
03fd5da5ae
Merge pull request #1203 from Manouchehri/bedrock-cloudflare-ai-gateway-1
...
Add aws_bedrock_runtime_endpoint support
2023-12-23 11:44:04 +05:30
Krish Dholakia
81617534b6
Merge pull request #1213 from neubig/vertex_chat_generate_content
...
Make vertex ai work with generate_content
2023-12-23 11:40:43 +05:30
Krrish Dholakia
eb2d13e2fb
test(test_completion.py-+-test_streaming.py): add ollama endpoint to ci/cd pipeline
2023-12-22 12:21:33 +05:30
Krrish Dholakia
57607f111a
fix(ollama.py): use litellm.request timeout for async call timeout
2023-12-22 11:22:24 +05:30
Graham Neubig
2362544344
Update the request_str
2023-12-21 09:58:06 -05:00
Krrish Dholakia
be68796eba
fix(router.py): add support for async image generation endpoints
2023-12-21 14:38:44 +05:30
Krrish Dholakia
812f9ca1b3
fix(azure.py): correctly raise async exceptions
2023-12-21 12:23:07 +05:30
David Manouchehri
93c4556eb0
Add aws_bedrock_runtime_endpoint support.
2023-12-20 19:31:43 -05:00
Graham Neubig
6e9267ca66
Make vertex_chat work with generate_content
2023-12-20 15:32:44 -05:00
Graham Neubig
482b3b5bc3
Add a default for safety settings in vertex AI
2023-12-20 13:12:50 -05:00
Krrish Dholakia
04bbd0649f
fix(router.py): only do sync image gen fallbacks for now
...
The customhttptransport we use for dall-e-2 only works for sync httpx calls, not async. Will need to spend some time writing the async version
n
2023-12-20 19:10:59 +05:30
Krrish Dholakia
f355e03515
feat(main.py): add async image generation support
2023-12-20 16:58:40 +05:30
Krrish Dholakia
b3962e483f
feat(azure.py): add support for azure image generations endpoint
2023-12-20 16:37:21 +05:30
Krrish Dholakia
f0df28362a
feat(ollama.py): add support for ollama function calling
2023-12-20 14:59:55 +05:30
Graham Neubig
2d15e5384b
Add partial support of vertexai safety settings
2023-12-19 22:26:55 -05:00
ishaan-jaff
9995229b97
(fix) proxy + ollama - raise exception correctly
2023-12-19 18:48:34 +05:30
Krish Dholakia
408f232bd7
Merge branch 'main' into main
2023-12-18 17:54:34 -08:00
Krrish Dholakia
34509d8dda
fix(main.py): return async completion calls
2023-12-18 17:41:54 -08:00
ishaan-jaff
6b272076d7
(feat) openrouter set transforms=[]
default
2023-12-18 09:16:33 +05:30
ishaan-jaff
b15682bc1f
(feat) set default openrouter configs
2023-12-18 08:55:51 +05:30
Joel Eriksson
e214e6ab47
Fix bug when iterating over lines in ollama response
...
async for line in resp.content.iter_any() will return
incomplete lines when the lines are long, and that
results in an exception being thrown by json.loads()
when it tries to parse the incomplete JSON
The default behavior of the stream reader for aiohttp
response objects is to iterate over lines, so just
removing .iter_any() fixes the bug
2023-12-17 20:23:26 +02:00
Krrish Dholakia
a3c7a340a5
fix(ollama.py): fix sync ollama streaming
2023-12-16 21:23:21 -08:00
Krrish Dholakia
13d088b72e
feat(main.py): add support for image generation endpoint
2023-12-16 21:07:29 -08:00
Krrish Dholakia
7c2fad2d57
fix(azure.py): fix azure streaming logging
2023-12-16 18:06:08 -08:00
Krrish Dholakia
3923c389fd
build(Dockerfile): fixing build requirements
2023-12-16 17:52:30 -08:00
Krrish Dholakia
4e828ff541
fix(health.md): add background health check details to docs
2023-12-16 10:31:59 -08:00
ishaan-jaff
5ee6b87f2e
(fix) vertexai - gemini
2023-12-16 22:15:41 +05:30
ishaan-jaff
764f31c970
(feat) add async, async+stream for gemini
2023-12-16 18:58:12 +05:30
ishaan-jaff
efe8b75200
(fix) use litellm.vertex_vision_models
2023-12-16 18:39:40 +05:30
ishaan-jaff
774a725ccb
(feat) add vertex ai gemini-pro-vision
2023-12-16 18:31:03 +05:30
ishaan-jaff
20b5505476
(feat) show POST request for HF embeddings
2023-12-16 13:09:49 +05:30
ishaan-jaff
287633887e
(feat) add ollama/llava
2023-12-16 10:35:27 +05:30
Krrish Dholakia
add153d110
fix(huggingface_restapi.py): add support for additional hf embedding formats
2023-12-15 21:02:41 -08:00
Krrish Dholakia
4791dda66f
feat(proxy_server.py): enable infinite retries on rate limited requests
2023-12-15 20:03:41 -08:00
Krrish Dholakia
e5268fa6bc
fix(router.py): support openai-compatible endpoints
2023-12-15 14:47:54 -08:00