RTL Combinational Depth Predictor

An AI-based tool to predict the combinational logic depth of signals in RTL designs without running full synthesis, helping to identify potential timing violations early in the design process.

Problem Statement

Timing analysis is a crucial step in the design of any complex IP/SoC. However, timing analysis reports are generated after synthesis is complete, which is a very time-consuming process. This leads to overall delays in project execution time as timing violations can require architectural refactoring.

This tool uses machine learning to predict the combinational logic depth of signals in behavioral RTL, which can greatly speed up the timing analysis process.

Setup Instructions

Prerequisites

Python 3.8 or higher
Git
(Optional) Icarus Verilog for RTL parsing

Installation

Clone the repository:

git clone https://github.com/yourusername/rtl-depth-predictor.git
cd rtl-depth-predictor

Create and activate the virtual environment:

# On Unix/macOS
python3 -m venv venv
source venv/bin/activate

# On Windows
python -m venv venv
venv\Scripts\activate

Install the required dependencies:

pip install -r requirements.txt

Create necessary directories:

mkdir -p models plots

Usage

Running the Complete Pipeline

The easiest way to run the pipeline is using the run_pipeline.py script:

# On Unix/macOS
# Train the model and compare different model types
python3 run_pipeline.py --train --compare_models

# On Windows
# Train the model and compare different model types
python run_pipeline.py --train --compare_models

For predicting depth or running tests:

# On Unix/macOS
# Predict depth for a signal in an RTL file
python3 run_pipeline.py --predict --rtl_file path/to/your/rtl_file.v --signal signal_name

# Run tests
python3 run_pipeline.py --test

# On Windows
# Predict depth for a signal in an RTL file
python run_pipeline.py --predict --rtl_file path/to/your/rtl_file.v --signal signal_name

# Run tests
python run_pipeline.py --test

Training the Model

# On Unix/macOS
# Basic training
python3 src/train_model.py --data_path data/training_data.csv --test_data_path data/test_data.csv --model_output models/depth_predictor.joblib

# On Windows
# Basic training
python src/train_model.py --data_path data/training_data.csv --test_data_path data/test_data.csv --model_output models/depth_predictor.joblib

For comparing different model types:

# On Unix/macOS
python3 src/train_model.py --data_path data/training_data.csv --test_data_path data/test_data.csv --model_output models/depth_predictor.joblib --compare_models --plot_results

# On Windows
python src/train_model.py --data_path data/training_data.csv --test_data_path data/test_data.csv --model_output models/depth_predictor.joblib --compare_models --plot_results

Predicting Combinational Depth

# On Unix/macOS
python3 src/predict_depth.py --rtl_file path/to/your/rtl_file.v --signal signal_name --model_path models/depth_predictor.joblib

# On Windows
python src/predict_depth.py --rtl_file path/to/your/rtl_file.v --signal signal_name --model_path models/depth_predictor.joblib

Evaluating the Model

# On Unix/macOS
# Evaluate on test data
python3 evaluate_model.py

# Generate detailed visualizations
python3 visualize_results.py

# On Windows
# Evaluate on test data
python evaluate_model.py

# Generate detailed visualizations
python visualize_results.py

Running Tests

# On Unix/macOS
python3 -m unittest discover tests

# On Windows
python -m unittest discover tests

Running the Complete Pipeline with Shell Script

On Unix/macOS:

bash run_all.sh

On Windows (using Git Bash or WSL):

bash run_all.sh

Project Structure

rtl_depth_predictor/
├── data/                  # Training and test datasets
│   ├── training_data.csv  # Training dataset
│   ├── test_data.csv      # Test dataset
│   ├── sample_rtl_1.v     # Sample RTL file (counter)
│   ├── sample_rtl_2.v     # Sample RTL file (alu)
│   └── sample_rtl_3.v     # Sample RTL file (fifo_controller)
├── models/                # Trained model files
│   └── depth_predictor.joblib  # Saved model
├── notebooks/             # Jupyter notebooks for exploration
│   └── model_exploration.ipynb
├── src/                   # Source code
│   ├── feature_extraction.py  # RTL feature extraction
│   ├── model.py           # ML model definition
│   ├── train_model.py     # Model training script
│   └── predict_depth.py   # Prediction script
├── tests/                 # Test files
│   └── test_model.py      # Model tests
├── evaluate_model.py      # Model evaluation script
├── visualize_results.py   # Visualization script
├── run_pipeline.py        # Pipeline execution script
├── run_all.sh             # Shell script to run complete pipeline
├── requirements.txt       # Project dependencies
├── .gitignore             # Git ignore file
└── README.md              # Project documentation

Model Performance

The Random Forest Regressor model performs best on our enhanced dataset with the following metrics:

MSE: 0.7233 (improved from 0.8135)
MAE: 0.4910 (improved from 0.6238)
R²: 0.8375 (improved from 0.5119)
Within ±1 depth level: 86.67% (improved from 77.78%)

Enhanced Dataset

We significantly improved the model's performance by enhancing the dataset:

Enhanced Training Data:
- Added 60 new examples covering 5 additional module types (FPU, DSP, Video Processor, Crypto Engine, PCIe Controller, and Cache Controller)
- Included more complex circuits with higher combinational depths (up to 12)
- Added more diverse signal types with various combinations of operators
Enhanced Test Data:
- Added 12 new test examples from the new module types
- Included signals with higher combinational depths to test the model's ability to predict more complex circuits

The significant improvement in model performance after enhancing the dataset demonstrates the importance of diverse and representative training data in machine learning applications for hardware design.

Approach

Data Collection: Generate synthetic RTL designs and extract their combinational depth using synthesis tools.
Feature Engineering: Extract relevant features from RTL code that influence combinational depth.
Model Selection: Compare different ML algorithms to find the best predictor.
Training: Train the selected model on the dataset.
Evaluation: Evaluate the model's accuracy and performance.
Dataset Enhancement: Expand the dataset with more diverse RTL designs and module types.
Model Refinement: Retrain and optimize the model with the enhanced dataset.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

RTL Combinational Depth Predictor

Problem Statement

Setup Instructions

Prerequisites

Installation

Usage

Running the Complete Pipeline

Training the Model

Predicting Combinational Depth

Evaluating the Model

Running Tests

Running the Complete Pipeline with Shell Script

Project Structure

Model Performance

Enhanced Dataset

Approach

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
notebooks		notebooks
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
evaluate_model.py		evaluate_model.py
parser.out		parser.out
parsetab.py		parsetab.py
requirements.txt		requirements.txt
run_all.sh		run_all.sh
run_pipeline.py		run_pipeline.py
solution_document.md		solution_document.md
visualize_results.py		visualize_results.py

Uh oh!

License

Uh oh!

Hijanhv/Google-Girl-Hackathon_2025

Folders and files

Latest commit

History

Repository files navigation

RTL Combinational Depth Predictor

Problem Statement

Setup Instructions

Prerequisites

Installation

Usage

Running the Complete Pipeline

Training the Model

Predicting Combinational Depth

Evaluating the Model

Running Tests

Running the Complete Pipeline with Shell Script

Project Structure

Model Performance

Enhanced Dataset

Approach

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages