Welcome to my blog!
Mandil Karki
Categories
All
(4)
LLMs
(1)
Mistral
(1)
Mixture of Experts
(1)
data preparation
(1)
deep learning
(2)
machine learning
(3)
transformers
(2)
word embeddings
(2)
word vectors
(2)
Comprehensive Understanding of Mistral Model
Mixture of Experts
Mistral
LLMs
Attention mechanism is a key component in Transformer models. It allows the model to focus on different parts of the input sequence and derive the relationship between…
Mar 11, 2024
Mandil Karki
Self-Attention & Transformer
machine learning
word vectors
word embeddings
transformers
deep learning
The necessities for a self-attention model are as follows:
Oct 23, 2022
Mandil Karki
Word Vectors
machine learning
word vectors
word embeddings
transformers
deep learning
Word vectors are also called
word embeddings
or neural word representations because these whole bunch of words are represented in a high dimensional vector space and they…
Oct 15, 2022
Mandil Karki
Data Fundamentals
machine learning
data preparation
Outliers are examples that look dissimilar to the majority of examples from the dataset. Dissimilarity is measured by some distance metric, such as
Euclidean distance.
Deleti…
Mar 15, 2022
Mandil Karki
No matching items