Product / Operational Copilot

COGNEX
RAG.

A grounded AI copilot for edge, IoT, and autonomous systems that combines live telemetry, manuals, SOPs, incident history, chat, voice, citations, and MCP-powered actions.

SYSTEMS THAT
EXPLAIN THEMSELVES.

Cognex RAG turns fragmented operational data into one intelligent interface. Operators can ask natural-language or voice questions, retrieve cited context from documents and telemetry, and execute tool-backed workflows through MCP servers.

<2sVoice response target
<5msRedis cache hit
50M+Vector chunks target
CitedEvery answer

RETRIEVE, REASON,
AND ACT.

System Architecture

Cognex RAG architecture showing knowledge sources, ingestion, indexing, hybrid retrieval, RAG chain, MCP tools, and user interfaces

Retrieval Pipeline

Cognex RAG retrieval pipeline showing cache checks, query embedding, ClickHouseDB hybrid search, reranking, prompt assembly, LLM synthesis, citations, and cache updates

MCP Tool Layer

Cognex RAG MCP tools diagram showing LangChain routing live-data needs through MCP servers for drone telemetry, IoT device state, maintenance scheduling, and incident log access

KNOWLEDGE + DATA
+ ACTION.

01 / Sources

Multi-Source Inputs

Manuals, firmware notes, telemetry, playbooks, SOPs, incident history, cloud logs, drone events, and edge AI outputs.

02 / Indexing

LlamaIndex Pipeline

Semantic chunking, fixed telemetry summaries, metadata tagging, embedding generation, and low-information fragment filtering.

03 / Retrieval

Hybrid Search

ClickHouseDB vector search combines with BM25-style keyword scoring and Redis cache checks for fast, relevant retrieval.

04 / Reranking

Cross-Encoder Confidence

Top candidates are reranked before prompt assembly, and low-confidence chunks are marked or excluded from citations.

05 / Interaction

Chat + Voice

Field operators can ask typed or spoken questions and receive concise, cited answers through streaming interfaces.

06 / Action

MCP Tool Execution

MCP servers fetch live drone telemetry, IoT device state, maintenance schedules, and incident logs when documents are not enough.

ASK, VERIFY,
RESPOND.

01 / Fault Diagnosis

Cited Fault Answers

A field engineer asks about a fault code. Cognex retrieves the manual section, calls live telemetry through MCP, and returns a cited diagnostic action plan.

02 / Voice in the Field

Hands-Free Procedures

A technician asks for a battery replacement procedure by voice and receives a concise spoken answer from the relevant SOP.

03 / Fleet Knowledge

Maintenance History

Operations managers ask which assets are overdue for inspection, combining playbook rules with live scheduler data.

04 / Firmware Advisory

Version + Changelog Context

Engineers can ask what changed in a firmware release and whether the current fleet is running it.

05 / SOP Search

Semantic Procedure Retrieval

Operators can describe a problem in plain language without knowing the exact SOP name or fault code.

06 / Incident Surge

Redis Hot Cache

Repeated high-traffic incident queries hit cache after the first retrieval, reducing ClickHouseDB load during operational spikes.

GROUNDING
AT SCALE.

Critical

ClickHouseDB Concurrent Search

Large embedding tables can degrade under simultaneous queries. Cognex uses usearch ANN tuning, partition-aware routing, and Redis shielding during spikes.

Critical

Citation Hallucination

Every sentence must trace to a retrieved chunk ID before returning. Uncited sentences are flagged or dropped during post-generation verification.

Critical

MCP Latency in Voice UX

Retrieval and live MCP calls run speculatively in parallel so spoken answers stay responsive when live drone or IoT APIs are involved.

High

Telemetry Freshness

Recent telemetry is queried live through MCP, medium-term data becomes daily digest chunks, and historical data is fully embedded for trend analysis.

High

Hybrid Search Tuning

Fault codes and part numbers need keyword weight, while incident history benefits from semantic matching. Cognex tunes scoring by document type.

Medium

Voice STT Vocabulary

Domain glossary injection improves recognition for terms like YOLOv8, TensorRT, OpenVINO, fault codes, and firmware versions.

RAG WITH
LIVE TOOLS.

Primary Vector Store

ClickHouseDB

Stores manuals, playbooks, SOPs, firmware notes, incident history, and telemetry summaries with usearch ANN indexing.

Hot Vector Cache

Redis

TTL-based cache for repeated high-frequency queries using query-hash keys and serialized top-K chunk lists.

Document Indexing

LlamaIndex

Loads, chunks, tags, embeds, and filters operational documents before indexing.

RAG Orchestration

LangChain

Coordinates query routing, hybrid fusion, reranking, MCP decisions, prompt assembly, and citation post-processing.

Live Tools

MCP Servers

Standardized tool access for drone telemetry, IoT device state, maintenance scheduling, and incident log reading.

Embeddings

text-embedding-3-large

Semantic embeddings with Matryoshka dimension reduction for storage efficiency in ClickHouseDB.

Reranking

Cohere Rerank

Cross-encoder reranking over hybrid search candidates with score-based citation confidence gating.

Voice Interface

Whisper + TTS

Speech-to-text and spoken response synthesis with domain vocabulary support for technical field terms.

Backend

FastAPI + WebSocket

REST chat endpoints, streaming responses, voice progress events, Redis pub/sub invalidation, and Kubernetes scaling.