ONNX Runtime Web now supports WebGPU, a web API that enables hardware acceleration for machine learning models running in web browsers. ONNX Runtime Web is a JavaScript library that will allow web developers to deploy machine learning models directly in web browsers, offering multiple backends leveraging hardware acceleration. For CPU inference, it compiles the native […]
Read More: Microsoft’s ONNX Runtime Web brings Generative AI to the web browser