FUTO Swipe: Open-Source Swipe Typing Models Match Big Tech Accuracy

✍️ OpenClawRadar📅 Published: June 24, 2026🔗 Source

FUTO has released FUTO Swipe, a family of open-source swipe typing models and algorithms, along with an inference library. The models power the offline FUTO Keyboard for Android, but can be used independently under the FUTO Model License.

Architecture & Benchmarks

FUTO Swipe uses three model types:

Encoder (635,140 params) – layout-agnostic, language-agnostic, universal predictor.
ContextLM (1,498,472 params total, 1.1M embeddings) – tiny language model trained per language to filter nonsensical words given context. Only needs text data.
Decoder (304,155 params) – language- and layout-specific. Currently only QWERTY English trained on real swipe data.

Combined (1,364,271 active / 2,494,767 total params), the models achieve a top-4 fail rate of ~4% on the test set. Excluding out-of-vocabulary words, the error rate is below 1%. This matches big-tech keyboards, according to FUTO. The benchmarks depend on the dataset, but the paper is forthcoming.

Dataset

FUTO released a dataset of 1 million real QWERTY English swipes under the MIT license, collected from voluntary users on swipe.futo.org. Available on HuggingFace.

Inference Library

The accompanying swipe-library (C++, GPL) handles inference, decoding, and dictionary-constrained beam search. With a beam width of 300, it converts swipe paths to ranked word candidates. The library runs on-device in milliseconds, even on low-end devices.

How to Use

Install FUTO Keyboard v0.1.29 from futo.org for an out-of-box experience with offline swipe typing.
Download models from HuggingFace and integrate via swipe-library.
Attribution required per the FUTO Model License.

FUTO is working on a paper detailing training and architecture. The dataset and models are available now for developers to build upon.

📖 Read the full source: HN AI Agents

👀 See Also

Tools

Benchmarking Nemotron 3 Super 120B with 1M token context on M1 Ultra

A user tested Nemotron 3 Super 120B with a Q4_K_M quantized model using llama.cpp on an M1 Ultra, achieving a 1 million token context window that consumed approximately 90GB of VRAM. Performance benchmarks show token generation speeds ranging from 255 t/s at 512 prompt processing down to 22.37 t/s at 100,000 token context.

Mar 12, 2026, 02:45 PM UTC

OpenClawRadar

Tools

Open Source AI Memory Storage for NodeJS Projects

Mind Palace is an open source memory storage and retrieval system for NodeJS that persists information across LLM chat sessions. It supports major LLMs and vector stores, automatically extracting and vectorizing summarized memories from interactions.

Apr 18, 2026, 04:45 PM UTC

OpenClawRadar

Tools

Building a Self-Improving Knowledge System with Claude Code and Obsidian

A developer built a 25-tool system that gives Claude Code persistent memory through semantic search, knowledge graphs, and spaced repetition over an Obsidian vault. The system indexes content with bge-m3 embeddings, detects contradictions, auto-prunes stale notes, and generates Obsidian Canvas maps automatically.

Apr 13, 2026, 03:22 PM UTC

OpenClawRadar

Tools

Open Source Agent Skill for TypeScript, React, and Next.js Patterns

A developer has released a 4,000-line, 17-file structured markdown reference designed for AI agents like Claude Code to follow when generating or reviewing TypeScript, React, and Next.js code. It addresses common issues like improper API response validation and misuse of 'use client' directives.

Apr 16, 2026, 04:45 PM UTC

OpenClawRadar