Loan Default Prediction
Utilized supervised machine learning models, including Logistic Regression, Random Forest and GBM, to predict mortgage loan defaults. Conducted thorough exploratory data analysis (EDA) and feature engineering, improving model performance by enhancing key metrics such as AUC by 20% through hyperparameter tuning and sampling techniques.
Fan App for Ed Sheeran's World Tour
Developed an interactive web application using R Shiny to simulate Ed Sheeran’s World Tour data. Designed and implemented a user-friendly interface with real-time visualizations and insights on tour statistics, venue information, and fan engagement metrics, leveraging ETL processes and data normalization for efficient data management.
Healthcare Provider Fruad Detection
Aenean ornare velit lacus, ac varius enim lorem ullamcorper dolore. Proin aliquam facilisis ante interdum. Sed nulla amet lorem feugiat tempus aliquam.
NYC Public Budget Expense Analysis
Developed a Flask web app integrated with both MongoDB and PostgreSQL to analyze NYC government agencies' budget expenses. The app enables users to explore over 2M+ records, visualizing trends and insights in public budget allocations. This project supports rational cost-saving strategies and optimizes resource allocation for city planning and development.
NYC Motor Vehicle Accidents Rate Analysis
Analyzed NYC motor vehicle accident cases in 2022. We used Tableau to map collisions and cluster data based on accident trends. The project examined patterns by day, time, and accident type, providing actionable insights for resource deployment and strategic decision-making for reducing accidents.
Sportify Music Rating Prediction
Used R to predict user ratings of music tracks on Sportify. Applied machine learning models and statistical analysis to identify factors influencing user preferences and music ratings, enhancing the accuracy of prediction models for personalized music recommendations.