MEVZU N°13616.05.2026ISTANBULYEAR I — VOL. III

§ AI Wiki / Glossary

Glossary

One-line definitions, the AI dictionary.

§ Search this category

Search the Wiki →

§169Glossary

FLOPs

Floating-point operations per second — the classic metric for raw compute power.

EN: FLOPs
TR: FLOPs (Saniyedeki Kayar Nokta İşlemi)

§170Glossary

Function Calling

An API feature that lets a model invoke a predefined function via structured JSON output.

EN: Function Calling
TR: Fonksiyon Çağırma

§171Glossary

Latency

The time between issuing a request and receiving a result.

EN: Latency
TR: Gecikme (Latency)

§172Glossary

GPU

A processor that runs massive parallel computations — the workhorse of deep learning.

EN: GPU
TR: GPU — Grafik İşlem Birimi

§173Glossary

GraphRAG

A RAG variant that extracts a knowledge graph from documents to capture the bigger picture.

EN: GraphRAG
TR: GraphRAG

§174Glossary

Hybrid Search

An approach that combines keyword and semantic search to improve retrieval quality.

EN: Hybrid Search
TR: Hibrit Arama

§175Glossary

HNSW

A graph-based algorithm for fast approximate nearest-neighbor search over high-dimensional vectors.

EN: HNSW
TR: HNSW

§176Glossary

Fine-tuning

Adapting a pre-trained model to a specific task using smaller, targeted data.

EN: Fine-tuning
TR: İnce Ayar (Fine-tuning)

§177Glossary

Beam Search

A decoding algorithm that keeps the K most-likely candidate sequences alive in parallel during generation.

EN: Beam Search
TR: Işın Araması (Beam Search)

§178Glossary

IVF — Inverted File Index

An ANN indexing technique that partitions the vector space into clusters to speed up search.

EN: IVF (Inverted File Index)
TR: IVF — Ters Dosya İndeksi

§179Glossary

JSON Mode

An API setting that guarantees the model's output is valid JSON.

EN: JSON Mode
TR: JSON Modu

§180Glossary

Citation in RAG

A reference that ties information in an LLM's answer back to its source document.

EN: Citation (in RAG)
TR: Kaynak Gösterme (RAG'da)

§181Glossary

Short-term Memory

The in-session memory an agent keeps within its context window — recent turns and intermediate state.

EN: Short-term Memory
TR: Kısa Süreli Bellek

§182Glossary

Guardrail

A control layer that keeps an LLM or agent within sanctioned behavior boundaries.

EN: Guardrail
TR: Korkuluk (Guardrail)

§183Glossary

KV Cache

The cache that stores previously computed key/value vectors so the model doesn't recompute them every step.

EN: KV Cache
TR: KV Önbelleği

§184Glossary

LangGraph

A stateful, graph-based agent workflow framework from the LangChain team.

EN: LangGraph
TR: LangGraph

§185Glossary

llama.cpp

Georgi Gerganov's open-source C++ project that made running LLMs locally a practical reality.

EN: llama.cpp
TR: llama.cpp

§186Glossary

LoRA (Low-Rank Adaptation)

A fine-tuning technique that trains only small low-rank matrices instead of every weight, dramatically cutting memory.

EN: LoRA (Low-Rank Adaptation)
TR: LoRA (Düşük-Mertebeli Adaptasyon)

§187Glossary

Masked Language Modeling

A training objective where the model learns to predict tokens that have been masked out of a sentence.

EN: Masked Language Modeling
TR: Maskeli Dil Modelleme

§188Glossary

Model FLOPs Utilization (MFU)

How much of a model's theoretical peak FLOPs is actually delivered during real training — a key efficiency metric.

EN: Model FLOPs Utilization (MFU)
TR: Model FLOPs Kullanımı (MFU)

§189Glossary

MLX

Apple's open-source ML framework purpose-built for Apple Silicon, with a NumPy-like API.

EN: MLX
TR: MLX

§190Glossary

Quantization

Representing model weights with lower-precision numbers to save memory and gain speed.

EN: Quantization
TR: Niceleme

§191Glossary

NPU (Neural Processing Unit)

A specialised AI accelerator integrated into phones and laptops to run neural workloads efficiently.

EN: NPU (Neural Processing Unit)
TR: NPU — Nöral İşlem Birimi

§192Glossary

NVIDIA A100

NVIDIA's Ampere-generation GPU launched in 2020, the workhorse of deep learning for years.

EN: NVIDIA A100
TR: NVIDIA A100