This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
-
Updated
Jun 1, 2020 - TSQL
This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
Reedelk Runtime Platform Community Edition
Hospital Database Management System (DBMS) is a comprehensive SQL project designed to streamline and optimize the management of hospital operations. This project aims to provide an efficient and user-friendly solution for storing, retrieving, and manipulating various types of healthcare-related data.
Information Integration Architecture (IIA CSE656) course project at IIIT-Delhi: end-to-end ETL pipelines, global-schema mapping, federated SQL querying, and AI-driven analytics for restaurant & vendor data. Built with Python, React, and LLM-powered natural-language interfaces.
AIDevs project files
Integrating multimodal data through heterogeneous ensembles
Uses Rapid API to fetch IMDb data, filters, & uploads the data in different tables in a MySQL Database, in one click using Talend.
A project to enhance ontology matching accuracy using Large Language Models (LLMs) like S-BERT.
This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.
Combining multimodality histopathology images for integrated cancer research
Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering life…
Farr, M. T., D. S. Green, K. E. Holekamp, and E. F. Zipkin. 2020. Integrating distance sampling and presence-only data to estimate species abundance. Ecology 00(00):e03204. 10.1002/ecy.3204
Project involves merging customer reviews from Fudgemart and FudgeFlix to create a unified data warehouse using Kimball's approach. Utilizing Power BI, it aims to extract actionable insights for Fudge Inc., guiding strategic decisions, product enhancements, and market expansion based on comprehensive business intelligence.
To integrate data from "Orderline.csv" and "Product.csv" using Talend, filtering based on price, and performing inner and left joins to extract insights and facilitate data warehousing integration with Microsoft SQL Server.
A lab for DataAnalytics | DataEngineering | AnalyticsEngineering | DataScience | DataVisualization | BusinessIntelligence
Building a modern data warehouse using PostgreSQL, covering the full data pipeline from raw data ingestion to analytics. This includes designing robust data models, implementing ETL processes, and organizing data into bronze, silver, and gold layers to support efficient analysis and reporting.
Proyecto para el Hackathon Innovation Challenge Microsoft, utilizando datos públicos para mejorar la gestión del conocimiento en salud global. Facilitamos la colaboración interinstitucional y decisiones basadas en evidencia entre agencias, empresas y organizaciones.
Hormone Therapy Decision Support System for Breast Cancer
Data integration and other data related programs
MeshJoin-Streaming-ETL-Data-Warehouse integrates real-time transactional data with master data using the Mesh Join algorithm. It processes and enriches data, then loads it into a data warehouse for analysis, leveraging efficient ETL processes and OLAP-ready SQL queries.
Add a description, image, and links to the dataintegration topic page so that developers can more easily learn about it.
To associate your repository with the dataintegration topic, visit your repo's landing page and select "manage topics."