Skip to content

Hi there 👋

We're a team that set out to build a local-first consumer AI apps, but after thousands of users and 6 months, we realized the hardware and software aren't there yet. Running near-realtime workloads on consumer CPUs and GPUs can be too slow and drains battery life for most consumer hardware.

While some solutions exist for running local AI models on edge devices, most are only partially open or integrate poorly with native applications. We found this frustrating, so instead of waiting for others to solve the problem, we decided to tackle it ourselves and share our models and SDKs with everyone.

Join our Discord or checkout our models on Huggingface:

Discord Models

Read our blogs here

Pinned Loading

  1. FluidAudio FluidAudio Public

    Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

    Swift 685 83

  2. mobius mobius Public

    Run models on native runtimes

    Python 6 1

  3. swift-scribe swift-scribe Public

    Fully local, no dependency scribe. Speak into your microphone and summarize. Requires iOS 26 and MacOS 26 to use the advanced transcription model and foundational model for summaries

    Swift 254 27

Repositories

Showing 9 of 9 repositories
  • FluidAudio Public

    Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

    FluidInference/FluidAudio’s past year of commit activity
    Swift 685 Apache-2.0 83 3 1 Updated Sep 27, 2025
  • openvino.genai Public Forked from openvinotoolkit/openvino.genai

    Run Generative AI models with simple C++/Python API and using OpenVINO Runtime

    FluidInference/openvino.genai’s past year of commit activity
    C++ 0 Apache-2.0 288 0 2 Updated Sep 27, 2025
  • mobius Public

    Run models on native runtimes

    FluidInference/mobius’s past year of commit activity
    Python 6 Apache-2.0 1 0 0 Updated Sep 26, 2025
  • fluid-server Public

    Local AI server for your Windows apps.

    FluidInference/fluid-server’s past year of commit activity
    Python 3 Apache-2.0 0 0 0 Updated Sep 25, 2025
  • .github Public
    FluidInference/.github’s past year of commit activity
    0 0 0 0 Updated Sep 25, 2025
  • Fluid.OpenVINO.GenAI Public

    OpenVINO and OpenVINO GenAI, Interop in .NET for GenAI workloads

    FluidInference/Fluid.OpenVINO.GenAI’s past year of commit activity
    C# 6 Apache-2.0 0 0 7 Updated Sep 15, 2025
  • swift-scribe Public

    Fully local, no dependency scribe. Speak into your microphone and summarize. Requires iOS 26 and MacOS 26 to use the advanced transcription model and foundational model for summaries

    FluidInference/swift-scribe’s past year of commit activity
    Swift 254 MIT 27 0 1 Updated Aug 10, 2025
  • swift-parakeet-mlx Public archive

    Based on senstella/parakeet-mlx

    FluidInference/swift-parakeet-mlx’s past year of commit activity
    Swift 47 MIT 7 0 0 Updated Jul 18, 2025
  • fluidtop Public

    MacOS hardware performance monitoring CLI tool with a focus on AI Workloads

    FluidInference/fluidtop’s past year of commit activity
    Python 7 MIT 0 0 0 Updated Jul 6, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.