LLMs for Ingesting Web Pages
Web element detection
OpenDILabCommunity/webpage_element_detection · Hugging Face
Layout parser - doc ingester
https://layout-parser.github.io/
YOLO Object Detection
https://www.v7labs.com/blog/yolo-object-detection
https://www.v7labs.com/blog/yolo-object-detection
multimodal LLMs
HPT 1.5 Air: A New Open-Sourced 8B Multimodal LLM with Llama 3
CSS and HTML LLM
Function Calling Optimized LLMs
Hermes-2 Mistral 7B - good for "Function Calling" (so LLM delegates creating/modifying an app, to the client)
MoE (Mixture of Experts) LLM
- faster inference
- can be more difficult to train
DBRX - a 'mixture of experts' LLM
MoE in depth article
Building and Fine-tuning Models
Deploying and Fine-tuning Deepseek
Fine-Tuning - NEL (Never Ending Learning)
Building a NN from scratch
Data and Data Science
Data science and data trends (via ML) Python libraries
Self-hosting LLMs
AWS Inferentia2 (inf2 AWS instances - cheaper than g5)
Faster Inference via TGI framework
Faster Inference via vLLM (even faster than TGI) framework
https://github.com/vllm-project/vllm
LLM GitHub accelerator projects
Prompt engineering guides
- The Prompt Report - https://arxiv.org/pdf/2406.06608
- https://www.promptingguide.ai
- Anthropic - [Answer Key] Anthropic's Prompt Engineering Interactive Tutorial [PUBLIC ACCESS] - Google Spreadsheets
AWS Bedrock Tips
- batch inference
AWS Inferentia - speculative decoding
AWS Prompt Routing and prompt caching
- can auto pick an LLM?
Comparing Anthropic Claude LLMs
Claude 3.5 Sonnet (new) vs Claude 3.7 Sonnet - Detailed Performance & Feature Comparison
Other LLMs
SOTA LLM by Richard
Transformer Models and Encoders
distilbert - can be trained to classify intent or detect a toxic user prompt
- faster version of BERT
LLMs and Security
Strengthening Your LLM Application Security: Prompt hacking and testing
Design Patterns against Prompt Injection
Monitoring LLMs and Costs
Token counts - for cost estimates
No comments:
Post a Comment