Lecture: Challenges and Real-World Applications of Machine Learning and Large Language Models

Introduction

Presenters: Nico (Research Scientist, Founder of CometML) and Dr. Doug Blank (Generative Media Expert, Founder of CometML)
Background on Comet: MLOps software to support machine learning (ML) model building and deployment
Focus: Real-world applications, challenges, and case studies related to ML and Large Language Models (LLMs)

Problem: Verify user-uploaded photos to detect non-genuine profiles
Dataset: Manually labeled images from the app
Model: Neural network classifier
Issue: Degradation in model performance over time (10%-15% decrease)
- Initial hypothesis: Data drift
- Retraining the model did not resolve the issue
- Actual problem: New iPhone camera with higher resolution images
- Solution: Added additional layers to the neural network to handle higher resolution images

Problem: List and ad recommendations to maximize the likelihood of a click and subsequent purchase
Dataset: Embeddings of search queries, item favorites, views, add-to-carts, and purchases
Model: Embedding model for retrieval and ranking
Issue: Difficulty in surpassing the performance of the production model
- Hypothesis: Seasonal trends, different local optimal points in training, consumer segmentation, and data distribution shifts
- Actual problem: New model trained on embeddings produced by the production model
- Solution: Built multiple retrieval agents in production to diversify embedding data for new training

LLM outputs are highly dependent on prompt engineering and probabilistic selection
Sensitive to low-frequency data and context
Potential solutions involve adjusting model parameters and monitoring probabilities of output

Ethical implications: Misinformation, biased outputs, and negative social impacts
Practical applications: Coding assistance, content summarization, and creative inspiration
Issues in implementing LLMs for critical decisions or external customer interactions

Grade reversal hypothesis: Observing impact of grading sequence on textual output
Methods to reduce hallucinations in LLMs via user feedback and fine-tuning
Challenges in monitoring model consistency and performance
Strategies in adapting ML models to external changes, like improved camera technology

Importance of MLOps and continuous model monitoring
Unpredictable nature of real-world applications prompts the need for regular review
Ethical deployment and adaptability are key to successful ML and LLM application

Importance of using comprehensive tools for ML model management (mention of CometML)
Encouragement to explore various tools and establish best practices

[Applause]