Category
page 1Machine learning
machine learning
scientific study of algorithms and statistical models that computer systems use to perform tasks without explicit instructions
generative artificial intelligence
artificial intelligence model capable of generating content in response to a prompt
Claude (language model)
Claude is a series of large language models developed by Anthropic and first released in 2023. Since Claude 3, each generation has typically been released in three sizes, from least to most capable: Haiku, Sonnet, and Opus. An additional model named Claude Mythos was released to some companies in 2026 but not to the public.
time series
set of data indexed in time order
prompt engineering
creation or optimization of a prompt to be given to an artificial intelligence model
pattern recognition
branch of machine learning
hallucination
confident unjustified claim by an AI
cross-validation
statistical model validation technique
anomaly detection
The identification of rare items, events or observations which raise suspicions by differing significantly from the expected or majority of the data
overfitting
thumb|300px|Figure 1. The green line represents an overfitted model and the black line represents a regularized model. While the green line best follows the training data, it is too dependent on that data and is likely to have a higher error rate on new unseen data, illustrated by black-outlined dots, compared to the black line.
thumb|300x300px|Figure 2. Noisy (roughly linear) data is fitted to a linear function and a polynomial function. Although the polynomial function is a perfect fit, the linear function can be expected to generalize better: If the two functions were used to ex
Hugging Face
American company
robotic process automation
form of business process automation technology
algorithmic bias
systematic and repeatable errors in a computer system that create unfair outcomes, such as privileging one arbitrary group of users over others
document classification
problem in library science, information science and computer science
confusion matrix
table layout for visualizing performance; also called an error matrix
self-supervised learning
class of machine learning techniques in which a task is solved based on pseudo-labels which help initialize weights the weight, then the actual task is performed with supervised or unsupervised learning
transfer learning
research problem in machine learning (ML) that focuses on storing knowledge gained while solving one problem and applying it to a different but related problem
bag-of-words model
model of text which uses a representation of text that is based on an unordered collection (a "bag") of words
generative model
model for randomly generating observable data in probability and statistics
fine-tuning
process of taking a pre-trained model and further training it on a smaller, specific dataset to adapt or improve its performance for a particular task or domain
curse of dimensionality
various phenomena that arise when analyzing and organizing data in high-dimensional spaces that do not occur in low-dimensional settings such as the three-dimensional physical space of everyday experience
attention
machine learning technique
feature
in machine learning, individual measurable property or characteristic of a phenomenon being observed
feature learning
a set of techniques that learn a feature: a transformation of raw data input to a representation that can be effectively exploited in machine learning tasks
quantum machine learning
Quantum Machine Learning combines quantum computing and ML to enhance algorithms, leveraging unique quantum properties, like superposition and entanglement , for efficient problem-solving.
feature engineering
process that creates features for machine learning by transforming or combining existing features
MLOps
thumb|MLOps is the set of practices at the intersection of Machine Learning, DevOps and Data Engineering
MLOps or ML Ops is a paradigm that aims to deploy and maintain machine learning models in production reliably and efficiently. It bridges the gap between machine learning development and production operations, ensuring that models are robust, scalable, and aligned with business goals. The word is a compound of "machine learning" and the continuous delivery practice (CI/CD) of DevOps in the software field. Machine learning models are tested and developed in isolated experimental systems. Whe
adversarial machine learning
machine learning technique that attempts to prevent models being fooled by supplying deceptive input
bias–variance tradeoff
property of a set of predictive models whereby models with a lower bias in parameter estimation have a higher variance of the parameter estimates across samples, and vice versa
kernel density estimation
estimator
semi-supervised learning
class of machine learning techniques combining a small amount of labeled data with a large amount of unlabeled data during training
data pre-processing
manipulation of data before it is analyzed
binary classification
the task of classifying the elements of a given set into two groups (predicting which group each one belongs to) on the basis of a classification rule
federated learning
method for artificial intelligence
hyperparameter
in machine learning, a parameter whose value is used to control the learning process
conditional random field
class of statistical modeling method applied in pattern recognition and machine learning and used for structured prediction
formal concept analysis
a rigorous method of deriving an ontology from a collection of objects and their properties
hyperparameter optimization
choosing a set of optimal hyperparameters for a learning algorithm
empirical risk minimization
in computer science, a way to determine theoretical bounds in machine learning
learning rate
tuning parameter (hyperparameter) in optimization

statistical learning theory
theoretical framework for machine learning

automated machine learning
process of automating the end-to-end process of machine learning

data augmentation
creation of more data based on previously collected data to enhance the performance of a statistical model

reasoning language model
language models designed for reasoning tasks
lazy learning
type of machine learning method
vector database
database designed to store vector embeddings and perform similarity search in high-dimensional spaces
inductive programming
learning programs from data
Bayesian optimization
optimization technique for undifferentiable, black-box functions
active learning
machine learning strategy in which a learning algorithm interactively queries for new labels
linear separability
geometric property of a pair of sets of points in Euclidean geometry
flow-based generative model
Used in machine learning
astrostatistics
Astrostatistics is a discipline which spans astrophysics, statistical analysis and data mining. It is used to process the vast amount of data produced by automated scanning of the cosmos, to characterize complex datasets, and to link astronomical data to astrophysical theory. Many branches of statistics are involved in astronomical analysis including nonparametrics, multivariate regression and multivariate classification, time series analysis, and especially Bayesian inference. The field is closely related to astroinformatics.
inductive bias
in a machine-learning algorithm, the set of assumptions that the learner uses to predict outputs of given inputs that it has not encountered
Phi coefficient
type of coefficient
learning to rank
application of machine learning
JAX
Python library
multi-armed bandit
reinforcement learning problem exemplifying the exploration–exploitation tradeoff

Cognitive robotics
robot with processing architecture that will allow it to learn
fairness in machine learning
trait of an algorithm, whose results are independent of given variables, e.g. gender, ethnicity, sexual orientation, disability
machine learning in physics
applications of machine learning to quantum physics