LightOn's Avatar

LightOn

@lightonai

LightOn is a leading European generative AI company delivering secure on-prem RAG for document intelligence, enabling safe use of sensitive data behind firewall

92
Followers
12
Following
267
Posts
18.11.2024
Joined
Posts Following

Latest posts by LightOn @lightonai

Preview
Day Zero of Multi-Vector Retrieval - LightOn Introducing ColBERT-Zero: late interaction model trained from scratch with PyLate

Models, checkpoints, training code under Apache 2.0.

πŸ§‘β€πŸ³ Kudos to the whole team @nohtow.bsky.social Luca Arnaboldi @amelietabatta.bsky.social @krzakalaf.bsky.social

πŸ”— Dive into the release: www.lighton.ai/lighton-blog...

19.02.2026 16:14 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

πŸ₯‡ SOTA on BEIR, <150M params
⚑ Supervised-first β†’ distill = most of the gains for a fraction of the cost
🧠 Prompt alignment is non-negotiable to preserve peak performance through fine-tuning

19.02.2026 16:14 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

In collaboration with @epfl-ai-center.bsky.social and the Swiss AI initiative, LightOn pre-trained it end-to-end for late-interaction retrieval

19.02.2026 16:14 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Day Zero for Multi-Vector Retrieval.
Today we’re flipping the retrieval playbook: no dense model adaptation, no retrofit.

πŸ—οΈMulti-vector from scratch, powered by PyLate.

Meet ColBERT-Zero

19.02.2026 16:14 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling - LightOn The "Stronger Grep" for Modern Development While AI coding assistants like Claude Code have transformed how code is written, their ability to navigate large codebases efficiently is often limited by k...

Give your coding agent the search it deserves.

Huge kudos to @nohtow.bsky.social and @raphaelsty.bsky.social

Read more: www.lighton.ai/lighton-blog...

12.02.2026 16:31 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

What we measured with Claude Code:
πŸš€ 70% win rate vs. vanilla grep
πŸ“‰ ~60k tokens saved per question
🀏 56% fewer search operations

Built in Rust with Next-Plaid - 100% local - No code leaves your machine.

12.02.2026 16:31 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

ColGrep is powered by LateOn-Code-edge (17M) and LateOn-Code (130M), the first late-interaction models purpose-built for code.

πŸ† They top MTEB Code,
outperforming models up to 17x their size while running instantly on a laptop.

12.02.2026 16:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

ColGrep mirrors the grep interface your agents already use, but replaces pattern matching with semantic scoring, and supports hybrid queries that combine both. It plugs straight into Claude Code, OpenCode, or Codex.

12.02.2026 16:31 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling - LightOn The "Stronger Grep" for Modern Development While AI coding assistants like Claude Code have transformed how code is written, their ability to navigate large codebases efficiently is often limited by k...

πŸ”₯ Stop burning tokens on blind grep searches. Give your coding agent semantic eyes.

Meet LateOn-Code & ColGrep:
a Rust-powered search tool and two SOTA late-interaction models that bring intent-level code retrieval directly to your terminal.

12.02.2026 16:31 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Introducing LightOn NextPlaid - LightOn Multi-Vector Database Built for Sharper Retrieval and Frugal Inference

Huge kudos to @raphaelsty.bsky.social for shipping this breakthrough! πŸ™Œ

Read the full article here πŸ‘‰ www.lighton.ai/lighton-blog...

10.02.2026 15:54 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

NextPlaid represents the "Blanc" milestone in our Bleu/Blanc/Rouge roadmap for enterprise document intelligence. It follows the "Bleu" release, LightOnOCR-2, a SOTA 1B OCR model which converts complex documents into clean, usable text.

10.02.2026 15:54 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

βš™οΈ Production Ready:
Built in Rust and optimized for CPUs, it supports incremental index updates and concurrent reads/writesβ€”capabilities missing from standard implementations.

10.02.2026 15:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

πŸš€ Seamless Integration:
NextPlaid runs alongside your existing vector database. You can add multi-vector retrieval to your established RAG pipeline without ripping anything out.

10.02.2026 15:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

πŸ“‰ Frugal Inference: High-signal context reduces the amount of noise sent to your LLM, allowing it to answer with fewer, more accurate tokens.

10.02.2026 15:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Why NextPlaid is the missing layer for your RAG stack:

🎯 Precision Matching: Retrieval matches at the token level, surfacing the exact passage that answers your question rather than just a document that vaguely relates.

10.02.2026 15:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

By representing documents as sets of vectors, one per token, we preserve the distinct concepts and precise details that other search engines average away.

10.02.2026 15:54 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

πŸ”πŸͺ‘To find the needle, you better index every straw of the haystack.

Today, LightOn is launching LightOn NextPlaid: a CPU-optimized multi-vector database that indexes at the token level.

10.02.2026 15:54 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Comment donner une mΓ©moire fiable aux intelligences artificielles ? avec AmΓ©lie Chatelain, Head of Knowledge & Search chez LightOn

En entreprise :
πŸ“„ vos documents sont vivants,
πŸ” l’observabilitΓ© est indispensable,
🌳 le bruit coûte cher et les GPUs ne poussent pas sur les arbres !

Un Γ©pisode dense et sans langue de bois sur l'IA en entreprise.

🎧 Γ‰couter l'Γ©pisode
πŸ‘‰ Spotify : open.spotify.com/episode/4Dtt...

09.02.2026 16:19 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

@amelietabatta.bsky.social Head of Knowledge & Search chez @lightonai.bsky.social est l’invitΓ©e de Laurent Nicolas-Guennoc pour le podcast Converteo β€œChangement d’époque”

Face au narratif "bigger context = better", AmΓ©lie remet les pendules Γ  l'heure.

09.02.2026 16:19 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

πŸŽ™οΈβ€œMettre tous vos documents dans le contexte d'un modΓ¨le, c'est comme inviter 30 personnes Γ  une rΓ©union oΓΉ une seule suffit : Γ§a coΓ»te cher, Γ§a fait du bruit, et au final le rΓ©sultat est moins prΓ©cis !”

09.02.2026 16:19 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Introducing Ettin Suite: the SoTA open recipe to outperform existing Generative & Retrieval Models - LightOn Introducing Ettin, the first ever SOTA suite of paired encoder & decoder models, developed by Johns Hopkins University in collaboration with LightOn.

Congrats to @orionweller.bsky.social @jhuclsp.bsky.social @nohtow.bsky.social for pushing the boundaries of useful AI.

πŸ§‘β€πŸ³ Read the open recipe here: lighton.ai/lighton-blog...

28.01.2026 11:13 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Size matters less than the right architecture choice.
That’s why the smallest Ettin model is already being massively adopted to build high-performance Edge AI.

It’s time to stop forcing "Decoder-only" models on every problem. For high-value tasks, specialized engineering beats generic scale.

28.01.2026 11:13 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Ettin was built as the first-ever SOTA suite of paired encoder-only & decoder-only models to prove a point:

πŸ” Encoders for classification & retrieval
✏️ Decoders for text generation

28.01.2026 11:13 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

The Ettin suite paper has been accepted to
@iclr-conf.bsky.social

It highlights the Elephant in the room:
πŸ—οΈ Architecture matters.

28.01.2026 11:13 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
RAG isn’t Dead, Yours is! - LightOn Static ingestion, stale answers, lost trust.

The "G" in RAG only amplifies what the "R" provides.
If your retrieval layer is static, your AI is hallucinating on facts.

Here is how LightOn approaches RAG as critical infrastructure, not just a chatbot feature.
πŸ‘‰ www.lighton.ai/lighton-blog...

16.01.2026 11:59 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

When you treat it as a simple add-on:

πŸ“‰ Relevance drops as document versions change.
πŸ” Security blocks you because access control wasn't enforced at query time.
⚠️ Trust erodes because the system generates confident answers based on last week's data.

16.01.2026 11:59 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Stop building RAG like a feature. It is infrastructure.

RAG inherits every constraint of your organization: scale, heterogeneous data, and strict governance

16.01.2026 11:59 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
lightonai/LightOnOCR-1B-1025 Β· Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

This makes high-performance OCR far more accessible for edge deployments, privacy-sensitive use cases, and cost-efficient production setups.

πŸ”— Model weights available here huggingface.co/lightonai/Li...

15.01.2026 15:05 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

🚦No GPU required
πŸ’» Runs locally on a laptop (CPU-friendly)
πŸ₯‡ SOTA performance on your data: Fine-tune easily using standard Hugging Face tooling LoRA, PEFT, Trainer

15.01.2026 15:05 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

πŸ€— LightOnOCR-1B is now in Hugging Face Transformers

With 1.2B downloads, Transformers Library is the go-to toolkit for developers building AI applications.

Any developer can now add state-of-the-art document reading to their app in one line of code.

#Transformers #OCR #GenAI

15.01.2026 15:05 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0