Tagged in

Deep Learning

OctoAI

OctoAI is a Seattle-based startup that delivers infrastructure to run, tune, and scale generative AI.

More information

Followers

334

Elsewhere

More, on Medium

Deep Learning

Jason Knight in OctoAI

Jan 15, 2021

In the cloud — Sparsity on GPUs provides 5X speedup

Author: Tristan Konolige and Sayce Falk

This post now lives at: https://octoml.ai/blog/in-the-cloud-sparsity-on-gpus-provides-5x-speedup/

Sayce Falk in OctoAI

Dec 16, 2020

On the Apple M1, Beating Apple’s Core ML 4 With 50% Model Performance Improvements

Co-Authors: Bing Xu, Lianmin Zheng, Jared Roesch, Sayce Falk

This post now lives at: https://octoml.ai/blog/on-the-apple-m1-beating-apple-s-core-ml-4-with-50-model-performance-improvements/

Thierry Moreau in OctoAI

Dec 3, 2020

Amplify ML Hardware Design Productivity with TVM-driven Hardware Simulation

Chapter 1: Hello, Hardware World!

This post now lives at: https://octoml.ai/blog/amplify-ml-hardware-design-productivity-with-tvm-driven-hardware-simulation/

Sayce Falk in OctoAI

Dec 3, 2020

Octomizer Early Access Is Here!

We’re excited to announce that the Octomizer is now open for Early Access!

This post now lives at: https://octoml.ai/blog/octoml-early-access-is-here/

Sayce Falk in OctoAI

Nov 5, 2020

Unlocking 10X Performance Improvements on Computer Vision Models

At OctoML, we love working with teams that are changing our world through the application and productization of deep learning models. We…

Luis Ceze in OctoAI

Aug 11, 2020

Build ML models once, run anywhere.

Apache TVM democratizes efficient machine learning with a unified software foundation. OctoML is building an MLops automation platform on…

Jason Knight in OctoAI

Jul 17, 2020

Using Sparsity in Apache TVM to halve your cloud bill for NLP

By Joshua Fromm, Bing Xu, Morgan Funtowicz and Jason Knight

Leveraging block sparsity with Apache TVM to halve your cloud bill for NLP

Zachary Tatlock in OctoAI

Mar 30, 2020

Riptide: Fast, Full Binarization in TVM

The first end to end, optimized, open source framework for binary deep learning.

This post now lives at: https://octoml.ai/blog/riptide-fast-full-binarization-in-tvm/

Jason Knight in OctoAI

Mar 24, 2020

Deep Learning

In the cloud — Sparsity on GPUs provides 5X speedup

On the Apple M1, Beating Apple’s Core ML 4 With 50% Model Performance Improvements

Amplify ML Hardware Design Productivity with TVM-driven Hardware Simulation

Chapter 1: Hello, Hardware World!

Octomizer Early Access Is Here!

We’re excited to announce that the Octomizer is now open for Early Access!

Unlocking 10X Performance Improvements on Computer Vision Models

At OctoML, we love working with teams that are changing our world through the application and productization of deep learning models. We…

Build ML models once, run anywhere.

Apache TVM democratizes efficient machine learning with a unified software foundation. OctoML is building an MLops automation platform on…

Using Sparsity in Apache TVM to halve your cloud bill for NLP

By Joshua Fromm, Bing Xu, Morgan Funtowicz and Jason Knight

Riptide: Fast, Full Binarization in TVM

The first end to end, optimized, open source framework for binary deep learning.

OctoML ≔ Easier machine learning

Machine learning and deep learning (ML/DL) are making large impacts across the computing field and the horizon is bright with the rapidly…