Welcome to my blog! -

Comprehensive Understanding of Mistral Model

Mixture of Experts

Mistral

LLMs

Attention mechanism is a key component in Transformer models. It allows the model to focus on different parts of the input sequence and derive the relationship between…

Self-Attention & Transformer

machine learning

word vectors

word embeddings

transformers

deep learning

The necessities for a self-attention model are as follows:

Word Vectors

machine learning

word vectors

word embeddings

transformers

deep learning

Word vectors are also called word embeddings or neural word representations because these whole bunch of words are represented in a high dimensional vector space and they…

Data Fundamentals

machine learning

data preparation

Outliers are examples that look dissimilar to the majority of examples from the dataset. Dissimilarity is measured by some distance metric, such as Euclidean distance. Deleti…