topic-45971

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Transformers

Posted: Tue Sep 23, 2025 07:08 AM

</s>flowchart TB A[Attention is All You Need] --> B[Introduction] A --> C[Background] A --> D[Model Architecture] click D "https://forums.fivetechsupport.com/viewtopic.php?p=281677#p281677" "click" A --> E[Why Self-Attention] click E "https://forums.fivetechsupport.com/viewtopic.php?p=281680#p281680" "click" A --> F[Training] click F "https://forums.fivetechsupport.com/viewtopic.php?p=281681#p281681" "click" A --> G[Results] click G "https://forums.fivetechsupport.com/viewtopic.php?p=281682#p281682" "click" A --> H[Conclusions] click H "https://forums.fivetechsupport.com/viewtopic.php?p=281683#p281683" "click" <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:09 AM

</s>flowchart LR B[Introduction] --> B1[Motivation] B --> B2[RNN Sequential Nature] B --> B3[Attention Mechanisms] B --> B4[Transformer Overview] C[Background] --> C1[Attention Function] C --> C2[Self-Attention] C --> C3[Multi-Head Attention] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:10 AM

</s>flowchart TB D[Model Architecture] --> D1[Encoder-Decoder] D --> D2[Encoder Stack] D --> D3[Decoder Stack] D --> D4[Attention Mechanisms] D --> D5[Position Encodings] D --> D6[Embeddings] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:11 AM

</s>flowchart LR D2[Encoder Stack] --> D2a[6 Layers] D2 --> D2b[Multi-Head Self-Attention] D2 --> D2c[Feed Forward] D2 --> D2d[Residual Connections] D2 --> D2e[Layer Normalization] D3[Decoder Stack] --> D3a[6 Layers] D3 --> D3b[Masked Multi-Head Attention] D3 --> D3c[Encoder-Decoder Attention] D3 --> D3d[Feed Forward] D3 --> D3e[Residual + LayerNorm] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:11 AM

</s>flowchart TB D4[Attention Mechanisms] --> D4a[Scaled Dot-Product] D4 --> D4b[Multi-Head Attention] D4 --> D4c[Self-Attention] D4 --> D4d[Applications in Model] D4a --> D4a1[Query, Key, Value] D4a --> D4a2[Attention Weights] D4a --> D4a3[Scaling Factor] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:12 AM

</s>flowchart LR E[Why Self-Attention] --> E1[Computational Complexity] E --> E2[Parallelization] E --> E3[Path Length] E --> E4[Comparison with RNN] E --> E5[Comparison with CNN] E --> E6[Interpretability] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:13 AM

</s>flowchart TB F[Training] --> F1[Training Data] F --> F2[Batching] F --> F3[Hardware Setup] F --> F4[Optimizer] F --> F5[Regularization] F --> F6[Learning Rate Schedule] F4 --> F4a[Adam Optimizer] F4 --> F4b[Beta Parameters] F4 --> F4c[Learning Rate] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:14 AM

</s>flowchart LR G[Results] --> G1[Translation Quality] G --> G2[Training Time] G --> G3[BLEU Scores] G --> G4[Comparison with Previous] G --> G5[English-German] G --> G6[English-French] G1 --> G1a[WMT 2014] G1 --> G1b[State-of-the-art] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Antonio Linares

Posts: 44229

Joined: Thu Oct 06, 2005 05:47 PM

Re: Transformers
Posted: Tue Sep 23, 2025 07:15 AM

</s>flowchart TB H[Conclusions] --> H1[Main Contributions] H --> H2[Future Directions] H --> H3[Limitations] H --> H4[Impact] H1 --> H1a[First Sequence Model] H1 --> H1b[Based Entirely on Attention] H1 --> H1c[Superior Performance] <e>

regards, saludos

Antonio Linares
www.fivetechsoft.com

Continue the discussion

Static archive · New replies & topics via GitHub Discussions

FiveTech Support Forums

Transformers

Re: Transformers

Re: Transformers

Re: Transformers

Re: Transformers

Re: Transformers

Re: Transformers

Re: Transformers

Re: Transformers

Continue the discussion