AI Wine Sommelier 🤵🏻‍♂️🍷

An AI-powered wine recommendation system that helps customers find the perfect bottle based on taste preferences, grape varietals, food and cheese pairings, or mood.

Find Your Wine

Personalized Recommmendations

Sommelier's Notes

Quick Examples, Configuration & More

Quick Start 🚀

Head over to: https://ai-wine-som.streamlit.app/ and get your personalized wine recommendations.

Features ✨

Fast Loading: Optimized embeddings cache system
Smart Sampling: Demo mode with 2000 wines for faster performance
Robust Error Handling: Graceful degradation and fallbacks
Production Ready: Streamlined for Streamlit Cloud deployment
Enhanced UX: Clean interface with quick examples and filters

Performance Optimizations

⚡ Embedding Caching: Automatic save/load of computed embeddings
🧠 Memory Efficient: Optimized batch processing (16 samples default)
🎯 Smart Sampling: Use sample mode for demos and testing
💾 CPU Optimized: Forced CPU processing for deployment stability
🔍 Enhanced Search: Increased neighbor candidates for better results

Configuration Options

In Sidebar:

Sample Mode: Enable for faster demo with 2K wines
Batch Size: Lower = less memory usage
Embeddings Cache: Automatic caching for subsequent runs

Environment Variables:

GOOGLE_API_KEY: Enable AI-powered explanations
GEMINI_API_KEY: Alternative key name

Production Features

✅ SSL Handling: Robust model downloading for deployment environments
✅ Error Recovery: Graceful degradation when services fail
✅ Memory Management: Optimized for Streamlit Cloud resource limits
✅ User Experience: Clean interface with helpful examples
✅ Caching Strategy: Smart embedding persistence

Quick Examples

Try these in the app:

"Bold red for BBQ under $25"
"Crisp white for seafood"
"Elegant wine for special dinner"
"Sweet wine for dessert"

Technical Stack

Frontend: Streamlit with custom config
ML Backend: SentenceTransformers (all-MiniLM-L6-v2)
Search: Scikit-learn NearestNeighbors with cosine similarity
AI: Google Gemini for explanations (optional)
Data: 130K+ wine reviews with smart sampling

Technical Overview

The AI Wine Sommelier application leverages several state-of-the-art artificial intelligence technologies to provide intelligent wine recommendations based on natural language descriptions and preferences. This application demonstrates the practical application of modern NLP (Natural Language Processing) techniques to create a sophisticated recommendation system accessible through an intuitive interface.

Core AI Technologies

Semantic Text Embeddings with SentenceTransformer

Model Used: all-MiniLM-L6-v2 from the Sentence-Transformers library
Technology: This transformer-based model converts wine descriptions and user queries into high-dimensional vector embeddings (768 dimensions) that capture semantic meaning beyond simple keywords
Advantage: Enables understanding of context, synonyms, and related concepts in wine descriptions
Implementation: Direct integration with the HuggingFace Sentence-Transformers library for state-of-the-art text encoding

Content-Based Recommendation Engine

Algorithm: Nearest Neighbors search with cosine similarity metric
Implementation: scikit-learn's NearestNeighbors with cosine distance for efficient vector similarity computation
Process: User queries are embedded in the same vector space as wine descriptions, allowing the system to find semantically similar wines regardless of exact keyword matches
Filtering: Additional dimensional filtering for price range, grape variety, and other attributes

Natural Language Generation with Google Gemini

Model: Google Gemini 1.5 Flash
Application: Generates natural, sommelier-style explanations for wine recommendations
Context-Awareness: Incorporates user requests, wine characteristics, and tasting notes to craft personalized explanations
Fallback System: Template-based explanations when Gemini API is unavailable

Technical Architecture

The application employs a hybrid AI architecture combining multiple models:

Embedding Layer: Transforms raw text descriptions into numerical vectors

Uses Sentence-BERT architecture for contextual understanding Dimensionality: 768 (based on the MiniLM model) Efficiently encodes both wine descriptions and user queries

Retrieval Layer: Implements efficient similarity search

Indexed vector database for fast retrieval Support for complex filtering criteria (price, variety) Maintains original metadata alongside vectors

Explanation Generation Layer:

Connects to Google's Gemini API Prompt engineering to ensure concise, relevant explanations Structured output formatting

Caching System:

Streamlit's caching mechanism for model persistence Embeddings storage/retrieval for performance optimization

AI Development Considerations

The application implements several AI best practices:

Robustness: The system includes template-based explanation fallbacks when external AI services are unavailable
Efficiency: Vector caching and batched processing reduce computational overhead
Explainability: The system doesn't just recommend wines but explains why they match the user's request
Adaptability: The modular design allows for easy model swapping or upgrading as better AI technologies become available

Future AI Enhancement Potential

The architecture supports several avenues for AI advancement:

Fine-tuning the embedding model on wine-specific language
Adding multi-modal capabilities to incorporate wine label images
Implementing personalized recommendations based on user preference history
Incorporating domain-specific wine knowledge graphs

This application demonstrates how multiple AI technologies can be integrated to create a practical, user-friendly application that brings expert-level wine knowledge to everyone through natural language interaction.

Performance Optimization Guide

The embedding process can be resource-intensive, especially with large datasets. Here are tips for optimizing performance:

For Faster Development/Testing

Use Data Sampling: Enable the "Use data sample" option in the sidebar to work with a smaller subset of wines.
Adjust Sample Size: Use the slider to find a balance between coverage and speed (1000-2000 wines is usually sufficient for testing).

Pre-compute Embeddings: Generate embeddings offline and save them to a file:

from src.recommender import Recommender
from src.utils import load_wine_dataset

# Load data
df = load_wine_dataset("data/wine_reviews.csv")

# Create and fit recommender
rec = Recommender()
rec.fit(df)

# Save embeddings for faster loading
rec.save_embeddings("data/embeddings.npz")

For Memory Optimization

Adjust Batch Size: Lower the batch size slider in the sidebar if you encounter memory issues.
Recommended Settings:
- 8-16: For very limited memory environments (e.g., shared hosting)
- 32: Good balance for most deployments
- 64-128: For environments with ample memory

Local Development 👨🏻‍💻

Fork this repository to your GitHub account
Deploy on Streamlit Cloud:
- Go to share.streamlit.io
- Connect your GitHub account
- Select this repository
- Set main file: app/app.py
- Deploy!
LLM and Environment Setup (Optional):
- Add GOOGLE_API_KEY or GEMINI_API_KEY for AI explanations
- Get key from Google AI Studio
- Export it in your shell:
```
export GOOGLE_API_KEY="your_key_here"
```

Troubleshooting

If the app crashes during embedding computation:

Try using a smaller data sample
Reduce the batch size
Check for SSL certificate issues if deployed on Streamlit Cloud
Pre-compute embeddings locally and upload them to your deployment

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.streamlit		.streamlit
app		app
data		data
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Wine Sommelier 🤵🏻‍♂️🍷

Find Your Wine

Personalized Recommmendations

Sommelier's Notes

Quick Examples, Configuration & More

Quick Start 🚀

Features ✨

Performance Optimizations

Configuration Options

In Sidebar:

Environment Variables:

Production Features

Quick Examples

Technical Stack

Technical Overview

Core AI Technologies

Technical Architecture

AI Development Considerations

Future AI Enhancement Potential

Performance Optimization Guide

For Faster Development/Testing

For Memory Optimization

Local Development 👨🏻‍💻

Troubleshooting

About

Uh oh!

Releases

Packages

Languages

License

wesleyscholl/ai-sommelier

Folders and files

Latest commit

History

Repository files navigation

AI Wine Sommelier 🤵🏻‍♂️🍷

Find Your Wine

Personalized Recommmendations

Sommelier's Notes

Quick Examples, Configuration & More

Quick Start 🚀

Features ✨

Performance Optimizations

Configuration Options

In Sidebar:

Environment Variables:

Production Features

Quick Examples

Technical Stack

Technical Overview

Core AI Technologies

Technical Architecture

AI Development Considerations

Future AI Enhancement Potential

Performance Optimization Guide

For Faster Development/Testing

For Memory Optimization

Local Development 👨🏻‍💻

Troubleshooting

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages