MEVZU N°13515.05.2026ISTANBULYEAR I — VOL. III

§ AI Wiki / Glossary

Glossary

One-line definitions, the AI dictionary.

§ Search this category

Search the Wiki →

§241Glossary

Context Window

The maximum number of tokens a language model can process in a single forward pass.

EN: Context Window
TR: Bağlam Penceresi

§242Glossary

Context Length

The total token count consumed in a single model call, used against the model's context-window limit.

EN: Context Length
TR: Bağlam Uzunluğu

§243Glossary

BPE — Byte-Pair Encoding

A tokenisation algorithm that builds a sub-word vocabulary by iteratively merging the most frequent character pairs.

EN: Byte-Pair Encoding (BPE)
TR: BPE — Bayt Çifti Kodlama

§244Glossary

Cross-Attention

An attention mechanism where one sequence attends to a different sequence, typically connecting encoder and decoder.

EN: Cross-Attention
TR: Çapraz-Dikkat

§245Glossary

Multi-Head Attention

A version of attention where multiple parallel 'heads' learn different relationships at the same time.

EN: Multi-head Attention
TR: Çok-Başlı Dikkat

§246Glossary

Decoder

The Transformer component that generates the next token conditioned on what came before.

EN: Decoder
TR: Çözücü (Decoder)

§247Glossary

Attention

The mechanism that lets a model decide how much weight to give different parts of its input.

EN: Attention
TR: Dikkat (Attention)

§248Glossary

Encoder

The Transformer component that turns input into a meaningful internal representation.

EN: Encoder
TR: Kodlayıcı (Encoder)

§249Glossary

Confabulation

A more clinically accurate term for LLM 'hallucination' — confidently filling gaps with plausible-sounding fiction.

EN: Confabulation
TR: Konfabülasyon

§250Glossary

Cosine Similarity

A similarity measure based on the angle between two vectors, returning a value between -1 and 1.

EN: Cosine Similarity
TR: Kosinüs Benzerliği

§251Glossary

Self-Attention

A mechanism where each element in a sequence attends to every other element in the same sequence.

EN: Self-Attention
TR: Öz-Dikkat

§252Glossary

SentencePiece

Google's language-agnostic tokeniser library that treats whitespace as just another character.

EN: SentencePiece
TR: SentencePiece

§253Glossary

Temperature

The sampling parameter that controls how 'creative' or 'deterministic' a model's output is.

EN: Temperature
TR: Sıcaklık (Temperature)

§254Glossary

Tokenization

The process of converting raw text into a sequence of model-readable tokens.

EN: Tokenization
TR: Tokenleştirme

§255Glossary

Top-K Sampling

A sampling strategy that picks the next token from only the K most likely candidates.

EN: Top-K Sampling
TR: Top-K Örnekleme

§256Glossary

Top-P (Nucleus) Sampling

A sampling method that draws from the smallest set of candidates whose cumulative probability exceeds P.

EN: Top-P (Nucleus) Sampling
TR: Top-P (Nucleus) Örnekleme

§257Glossary

Long Context

Next-generation LLMs that can process hundreds of thousands — sometimes millions — of tokens in a single context.

EN: Long Context
TR: Uzun Bağlam

§258Glossary

Vector

A list of numbers representing a point in high-dimensional space — direction and magnitude in one bundle.

EN: Vector
TR: Vektör

§259Glossary

WordPiece

Google's likelihood-driven sub-word algorithm, similar in spirit to BPE and used by BERT.

EN: WordPiece
TR: WordPiece