From an AI integration perspective, I am hopeful that Cloudflare may be able to improve "performance on the cheap" for replicate's models a little bit.
Replicate has had multiple ways to deploy for auto scaling and you can just keep running periodically to keep the system in a booted and warm state, but that has always seemed like it would be too expensive for a broke bootstrapper like me so I avoided it and model popularity was a big deciding factor. Also because of that and the potential for boot up, in general I avoided it for latency-sensitive things.
I guess there is a limit to what you can do. At some point someone has to spend the money to have the resources stay ready.
But with Cloudflare, theoretically the pool of potential users goes up, and it becomes more likely for someone to have already booted your model.
At the moment I am especially interested in performant and easy ways to run models like "sensefvg/InteractiveOmni-8B" or Qwen 2.5 Omni or models that are even more all in one than that like OpenAI Realtime or Gemini Live.
Now that Ernie 5 launched with (Omni) multimodality built in, I think within six months, developers are going to start to expect speech-to-speech capability from major AI lab releases or product line ups. I feel like eventually the spatial-temporal understanding of video models will be merged in too to make the models understand the world better. But speech in and speech out is closer to being a standard expectation.
Instead of running three models for STT->LLM->TTS with a bunch of tricks like eager end of turn or speculative decoding that basically mean you run the LLM twice or on two different models, and possibly getting shut down by API rate limits, the speech to speech models are a single model that both understands and generates audio as well as text such as for function calls.
This is probably an annoying comment because I am immediately trying to increase the requirements to not only being every model for cheap, but every model for cheap in in a low latency real time streaming way. I just happen to have a contract now that has shown me that multimodal like voice to voice is much more convenient but also much more expensive and fewer options.
Replicate has been so awesome though. Within like a day of me requesting InteractiveOmni, lucataco had it up. So another annoying comment, I sure hope he got paid.
I spent a moment wondering why did Cloudflare buy an on-prem tooling startup https://www.replicated.com
I was wondering if/when cloudflare would jump into doing “more AI”. Of all the paths they could have taken, this seems like a reasonable one. Hope that the dev team made some money/cloudflare stock from this transaction.
As a Replicate customer, I’m excited about this news. Cloudflare has the strategic incentive to invest in this line of business and it puts Replicate in the hands of a very good product marketing and sales team to get the volume it needs for sustainability.
Great match up of technologies. Congrats to Ben and the team! I had the privilege of working with Replicate when I was at AWS, such a great team.
Are there indicators in Cloudflare's culture or history to suggest that Replicate's strengths (docs, api, design) will remain in the long-term?
I wonder if they'll continue to support and develop Cog [0], which I've found quite useful for local deployments of models (as well as to Replicate's servers).
Very smart move - I assume the Workers will get tight integration with the Replicate APIs
I really like both these companies. Replicate just make it so easy to bundle a bunch of different AIs up into a prototype - but I think they have tonnes of untapped potential that Cloudflare could fulfil.
Replicate is my preferred go to for image models and easy proof of concepts for language models, so congratulations to them!
Cloudflare is one of those companies similar to palantir where the valuation just makes you scratch your head. But bringing in AI to try and fill out the AI valuation makes some sense I suppose
Congrats Ben and team. Looking forward to seeing what you build with Cloudflare's backing!
A very smart move for Cloudflare and a huge missed opportunity for Render [0].
> Replicate’s going to carry on as a distinct brand
Uh huh
Acquisition or acquihire? I don't see a price tag, which seems to imply the latter?
How many players are there in this space? Replicate, RunPod, Modal, Northflank, FAL, ... Who are the big ones? It's pretty crowded, right?
FAL was smart. They ditched the "run any model" to focus on just image and video, and now they dominate that space. They raised a pretty substantial round recently. Though I don't think there's any moat and they'll soon face competition too.
What about these vs. the routers like "Open"Router?
I love witnessing companies getting acquired to form monopolies.
> all that’ll happen is that it’s going to get way better
Every acquisition announcement says something like this. It is almost never true. In fact, the opposite often happens.
Congrats Ben and team!
I think this is Cloudflare's most notable acquisition yet? From Wikipedia it looks like they've previously mainly acquired smaller cybersecurity firms like https://en.wikipedia.org/wiki/Area_1_Security