Wednesday, March 20, 2024

AI: LLMs, platforms (hardware) and projects

(AI in Business) - Applied Artificial Intelligence

https://www.bookdepository.com/Applied-Artificial-Intelligence-Where-AI-Can-Be-Used-Business-Francesco-Corea/9783319772516?ref=grid-view&qid=1592465308969&sr=1-1 


Web element detection

OpenDILabCommunity/webpage_element_detection · Hugging Face


Layout parser - doc ingester

https://layout-parser.github.io/




Data generation - Langchain


DistilBERT: Faster version of BERT


DBRX - a 'mixture of experts' LLM


Hermes-2 Mistral 7B - good for "Function Calling" (so LLM delegates creating/modifying an app, to the client)


multimodal LLMs
HPT 1.5 Air: A New Open-Sourced 8B Multimodal LLM with Llama 3


Datasets - Roboflow (like HuggingFace for data)

AWS Inferentia2 (inf2 AWS instances - cheaper than g5)

Prompt hacking and testing

Token counts - for cost estimates

LLM powered autonomous agents

Building a NN from scratch

Data science and data trends (via ML) Python libraries

evaluating LLMs
prometheus-2

LLM GitHub accelerator projects 



distilbert - can train it to classify intent or toxic

Prompt engineering guides

Tiny Multi-modal LLM - regions + OCR

AWS Bedrock Tips
- batch inference

AWS Inferentia - speculative decoding

AWS Prompt Routing and prompt caching
- can auto pick an LLM?

Wednesday, March 6, 2024

AI Papers, Books and Datasets

Learning Transferable Visual Models From Natural Language Supervision = 
https://arxiv.org/abs/2103.00020

Free Datasets

UI Modelling - annotated UIs

Datasets - Roboflow (like HuggingFace for data)

MoE (Mixture of Experts)

An Introduction to Vision-Language Modeling

Vision AI - VLMs and CNNs
- Vision Language Models
- Convolutional Neural Networks

Multi-modal LLMs

Movies: rated [for collaborative filtering]

The Society of Mind
Book by Marvin Minsky

AI Engineering[Book] [O'Reilly] [Zeki recommends]
same author: Designing Machine Learning Systems- DMLS focuses on building applications on top of traditional ML models, which involves more tabular data annotations, feature engineering, and model training