Notitia Restante 🌱

Home

❯

ML

❯

LLM

Folder: ML/LLM

30 items under this folder.

  • May 06, 2025

    Enterprise RAG patterns

    • Jan 28, 2025

      Briefly about transformer’s evolution or why is softmax cool

      • Jan 14, 2025

        Advanced RAG techniques

        • Jan 12, 2025

          prompting

          • Jan 08, 2025

            how to evaluate LLM chatbots

            • Dec 30, 2024

              what can go wrong with LLMs

              • Dec 04, 2024

                query expansion

                • Nov 18, 2024

                  Inference Scaling for Long-Context Retrieval Augmented Generation

                  • Nov 18, 2024

                    Lost in the Middle effect

                    • Nov 14, 2024

                      Evolution of embeddings

                      • Nov 13, 2024

                        prefix caching

                        • Nov 13, 2024

                          speculative decoding

                          • Nov 12, 2024

                            continuous batching

                            • Nov 12, 2024

                              inference optimization

                              • Nov 09, 2024

                                GPU characteristics

                                • Nov 03, 2024

                                  LLM inference

                                  • Nov 03, 2024

                                    decoding strategy

                                    • Oct 25, 2024

                                      scaling laws

                                      • Sep 07, 2024

                                        quantization

                                        • Sep 04, 2024

                                          ROUGE

                                          • Sep 04, 2024

                                            Reinforcement Learning from Human Feedback

                                            • Sep 04, 2024

                                              reward model

                                              • Aug 31, 2024

                                                direct preference optimization

                                                • Aug 22, 2024

                                                  positional encoding

                                                  • Aug 20, 2024

                                                    paper review - Llama 3 Herd of Models

                                                    • Aug 15, 2024

                                                      byte pair encoding

                                                      • Aug 15, 2024

                                                        perplexity

                                                        • Aug 15, 2024

                                                          tokenization

                                                          • Feb 28, 2024

                                                            Retrieval-Augmented Generation

                                                            • Jan 20, 2024

                                                              LLM


                                                              Created with Quartz v4.4.1 © 2025

                                                              • GitHub
                                                              • Discord Community