GPT
tags :
AI and Acronym #
for Generative Pre-trained Transformer
Transformer #
A transformer model is a neural network that learns context and meaning by tracking relationships in sequential data, like the words in this sentence. ref
A transformer is a deep learning architecture developed by Google and based on the multi-head attention mechanism, proposed in a 2017 paper “Attention Is All You Need”.[1] Text is converted to numerical representations called tokens, and each token is converted into a vector via looking up from a word embedding table.[