10th Indian Delegation to Dubai, Gitex & Expand North Star – World’s Largest Startup Investor Connect
AI

Why Developers Should Try Predictive AI


Everyone’s experimenting with generative artificial intelligence (Gen AI), but developers should consider incorporating other forms of machine learning into their applications, according to Rita Kozlov, vice president of product for Cloudflare’s developer platform and AI.

“Everyone is experimenting, and I think that gives a sense that there’s more happening in production, in really meaty use cases, than there actually are,” Kozlov said. “What we are seeing is there are a lot of chatbots that are being thrown into the forefront of the user experience.”

Alternatives to AI Chatbots

That’s not necessarily the right approach, she added. Pushing chatbots just to use Generative AI can make it seem like the company is “just checking the AI checkbox” without concern about whether it’s actually helpful to users, she said.

Predictive AI is another option developers should explore, she suggested. Predictive AI uses machine learning algorithms to identify patterns in past events and make predictions about future events. It’s used for tasks such as fraud detection, credit risk assessment, demand forecasting, and even disease diagnosis.

For instance, a calendar app could use machine learning to help someone find available appointment times.

“You don’t necessarily need to lean into generative AI for that,” Kozlov said, adding, “Obviously, there are a lot of use cases where generative AI is extremely useful to people today.”

Predictive AI relies on inference AI, which is the specific act of using a trained model to generate predictions on new data. It actually requires fewer resources to run inference AI than it does to run training workloads, she said.

Cloudflare’s Workers AI manages the backend and gives developers access to serverless AI inference via APIs. The platform also provides access to a variety of open source models, she added.

Wider Adoption of Smaller Models

Another trend Kozlov has noticed is that organizations should deploy smaller models trained on fewer parameters rather than leverage the largest model possible.

“People realized, great, there are models that have 400 billion parameters, but you can’t run them practically. They are way too expensive,” she said. “We’re also seeing a shift back towards smaller models.”

What developers are finding is that it’s easy enough to use a popular GenAI provider that offers only a few models by using the AI’s API.

“People realized, great, there are models that are 400 billion parameters that exist, but you can’t run them practically. They are way too expensive.”
– Rita Kozlov, Cloudflare vp of product, developer platform and AI

Where it gets tricky is when developers want to deploy certain open source large language models or models that are internally trained, Kozlov said. Then, they need to provision virtual machines for infrastructure.

“If you’re looking to leverage some of the really incredible open source models, or a model that you’ve trained yourself,… you’re back to having to provision VMs and really thinking about what’s the peak capacity that I’m going to get, or what’s the peak load that I’m going to get, and provisioning capacity around that.”

Most workloads are not running at 100% all the time — that’s incredibly rare, she said. That means developers are doing a lot of guessing work with provisioning and overpaying for resources that don’t need to be paid for.

“You’re slowing yourself down because you have to think about all that stuff and manage it, and set up load balancing, routing, all that, instead of doing the thing that motivated you to use AI in the first place, which is you want to get to market as quickly as possible, and give yourself that competitive advantage of having AI integrated into your application,” she said.

Cloudflare’s Use of Predictive AI

Cloudflare has been using predictive AI to, for example, identify real attacks versus legitimate spikes in web traffic, Kozlov said.

“We realized we could take the same network that we’ve built out to help protect and accelerate applications, and use that network to offer developers services to build their applications directly on top of it, instead,” she said.

Besides APIs to connect to inference AI, the cloud-based web platform offers developers:

“The AI gateway helps you monitor all these things and experiment, but in a way that allows you to then get and compare the results, and really narrow in on what’s important to you for a particular workload, whether it is cost or whether it’s performance or whether it’s accuracy, and seeing the responses that your users are getting,” she said.


Group Created with Sketch.





Source link

AI
by The Economic Times

IBM said Tuesday that it planned to cut thousands of workers as it shifts its focus to higher-growth businesses in artificial intelligence consulting and software. The company did not specify how many workers would be affected, but said in a statement the layoffs would “impact a low single-digit percentage of our global workforce.” The company had 270,000 employees at the end of last year. The number of workers in the United States is expected to remain flat despite some cuts, a spokesperson added in the statement. A massive supplier of technology to… Source link

AI
by The Economic Times

The number of Indian startups entering famed US accelerator and investor Y Combinator’s startup programme might have dwindled to just one in 2025, down from the high of 2021, when 64 were selected. But not so for Indian investors, who are queuing up to find the next big thing in AI by relying on shortlists made by YC to help them filter their investments. In 2025, Indian investors have invested in close to 10 Y Combinator (YC) AI startups in the US. These include Tesora AI, CodeAnt, Alter AI and Frizzle, all with Indian-origin founders but based in… Source link

by Techcrunch

Lovable, the Stockholm-based AI coding platform, is closing in on 8 million users, CEO Anton Osika told this editor during a sit-down on Monday, a major jump from the 2.3 million active users number the company shared in July. Osika said the company — which was founded almost exactly one year ago — is also seeing “100,000 new products built on Lovable every single day.” Source link