
The Mecellem Protocol: a new grammar of reasoning
How structured ontology turns scattered legal records into contestable, governed reasoning — the theory behind newmind's semantic layer.
Read the report →A periodic research publication from newmind AI — field studies, protocol papers, and the structural ideas behind governed, agentic legal intelligence.

How structured ontology turns scattered legal records into contestable, governed reasoning — the theory behind newmind's semantic layer.
Read the report →What changed when reasoning moved from chatbots to governed operating systems — adoption, architecture, and the year ahead.
Read the report →
ModernBERT encoders pre-trained from scratch on 112.7B Turkish-dominant tokens and continually adapted — reaching top-3 on the Turkish retrieval leaderboard at a fraction of the parameters.
Read the report →
A Turkish embedding model trained on native — not machine-translated — data with matryoshka representation learning, setting new marks on natural-language inference and semantic textual similarity.
Read the report →
A retrieval-tuned TurkEmbed variant fine-tuned on MS-MARCO-TR that outperforms TurkColBERT on Scifact-TR by 19–26%, raising the bar for Turkish information retrieval.
Read the report →
The first comprehensive benchmark of dense bi-encoders against late-interaction ColBERT-style retrievers for morphologically rich Turkish, across five BEIR-TR domains.
Read the report →
The first suite of hallucination-detection models for Turkish RAG, framing the problem as token-level classification across question answering, data-to-text, and summarization.
Read the report →
A systematic study of guided decoding — Outlines, XGrammar, and LM Format Enforcer — and how multi-turn prompting shapes structured, hallucination-resistant RAG output.
Read the report →
A robustness benchmark that measures how far language models bend the truth under authority and social pressure, with an eight-state behavioral taxonomy evaluated across 22 models.
Read the report →
A training-free method that uses the Ramer–Douglas–Peucker algorithm to pick which layers to adapt — beating full LoRA fine-tuning while touching far fewer layers.
Read the report →
A 15-million-node Turkish synonym graph with a three-way relation discriminator that finally separates synonyms from antonyms and tames semantic drift in large-scale clustering.
Read the report →
A hybrid pipeline — FastText clustering, Gemini classification, and curated dictionaries — that builds an 843K-pair Turkish semantic-relations corpus, a 10× scale-up for about $65.
Read the report →