Skip to content

πŸͺ 4- Social Buss: NLP - Class 1 : This repository provides resources and practical implementations for Natural Language Processing (NLP) focused on social media data analysis. It includes tutorials and demos on NLP preprocessing techniques such as regex, tokenization, lemmatization, stemming, count vectorization, and stopword removal.

License

Notifications You must be signed in to change notification settings

Mindful-AI-Assistants/4-social-buzz-ai--Natural_Language_Processing-NL-Class_1


[πŸ‡§πŸ‡· PortuguΓͺs] [πŸ‡ΊπŸ‡Έ English]


4- Social Buzz AI - Natural Language Processing (NLP) - Class 1


A repository for research, implementation, and best practices with Gradient Boosting methods (GBM, XGBoost, LightGBM), H2O AutoML, and robust strategies for modeling extreme class imbalance ("Low Default") in data science for finance and risk.





Course: Humanistic AI & Data Science (4th Semester)
Institution: PUC-SP
Professor: ✨ Rooney Ribeiro Albuquerque Coelho



Sponsor Mindful AI Assistants



Tip

This repository 2-social-buzz-ai-GBoost-and-LowDefault-Modeling is part of the main project 1-social-buzz-ai-main. To explore all related materials, analyses, and notebooks, visit the main repository

  • 1-social-buzz-ai-main Part of the Humanistic AI Research & Data Modeling Series β€” where data meets human insight.



Important

⚠️ Heads Up




Tip

  • Access Workbook - (Class 1)

  • Access: πŸ‡¬πŸ‡§ 1- NLP_Pre_Processing_ENGLISH

  • Access: πŸ‡§πŸ‡· 1-Code NLP_Pre_Processing_Portuguese

  • Access: NLP - Class 2 Repo




πŸŽ₯ DEMO - NLP Cascade Pre-Processing Pipeline




1-NLP_PreProcessing_Regex.mov




Β Β 

2-NLP_PreProcessing_Tokenizer.using.NLTK.mov




3-NLP_PreProcessing_.Lemma.mov




4-NLP_PreProcessing_Radicalisation.mov




5-NLP_PreProcessing_Count.Vectorizer.Bag-of-Words._.ConvertsTex_.into_Featur_.CountMatrix.mov




6-NLP_PreProcessing_Stopword.Removal.Remove.Common.Words.Stopwords.1.mov

















πŸ›ΈΰΉ‹ My Contacts Hub




────────────── βŠΉπŸ”­ΰΉ‹ ──────────────

➣➒➀ Back to Top

Copyright 2025 Mindful-AI-Assistants. Code released under the MIT license.

About

πŸͺ 4- Social Buss: NLP - Class 1 : This repository provides resources and practical implementations for Natural Language Processing (NLP) focused on social media data analysis. It includes tutorials and demos on NLP preprocessing techniques such as regex, tokenization, lemmatization, stemming, count vectorization, and stopword removal.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Sponsor this project