Skip to content
View rosalinatorres888's full-sized avatar

Block or report rosalinatorres888

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
rosalinatorres888/README.md

Header Banner

Hi, I'm Rosalina Torres πŸ‘‹

MS Data Analytics Engineering student @ Northeastern University (April 2026) specializing in ML/AI systems and production data pipelines. Building intelligent, scalable systems that solve real problems. Currently serving 1000+ users in production with deployed ML applications.


πŸ—οΈ Portfolio Architecture

My projects form an interconnected ML ecosystem, not isolated demos:

Portfolio Architecture

This diagram illustrates how my 15+ projects connect: from data foundation (UCI HAR 10k+ samples, Federal Economic Data) through AI-powered strategic pillars (MotionInsight, Game Theory, Career Intelligence) to production-deployed applications (Streamlit dashboards serving real users), all orchestrated by cost-effective local LLM infrastructure.

Architecture Highlights:

  • Data Foundation: UCI HAR Dataset (10k+ samples), Federal Economic Data, Portfolio & Resume Data
  • Strategic AI Pillars: MotionInsight (Entropy-Complexity), Game Theory (Nash Equilibrium), Career Intelligence (Archetype Matching)
  • Deployed Applications: Production Streamlit/React dashboards serving 1000+ users
  • Infrastructure: Ollama/Local LLMs (Llama 3, Mistral) reducing API costs by 90%
  • Cross-Project Synergy: Agent orchestration enabling reusable ML components

⭐ Production Impact

Real systems serving real users:

System Users Uptime Impact
πŸ₯ Boston Heatwave Monitor 1000+ 99.9% Public health early warning system
⚑ Crypto ML Pipeline In Dev - 85% prediction accuracy on live data
🎯 Career Intelligence Active - 70% workflow automation, 45% response rate boost

πŸš€ View Live Dashboard ← Production ML system with 1000+ active users

πŸš€ Currently

  • πŸŽ“ MS Data Analytics Engineering @ Northeastern (GPA: 4.0)
  • πŸ€– Specializing in ML/AI, Semantic Matching, Production Pipelines
  • πŸ” Seeking Data Engineering Internships for January 2026
  • πŸ’‘ Building autonomous AI career assistant (ARIA)

Available immediately for ML/AI Engineering internships and full-time positions

  • πŸ“ Open to relocation | Remote-friendly
  • πŸ’Ό Authorized to work in the US
  • πŸ“… Can start: Immediately

πŸ’Ό Experience

AI Data Trainer (Bilingual) @ Alignerr (by Labelbox) (2023 - Present)

  • Working with generative AI and large language models for data labeling and model evaluation
  • Technology Focus: Specialized in LLM evaluation for factual accuracy and ethical integrity
  • Platforms: Generative AI alignment tools and human-in-the-loop ML systems

Regional Manager, Channel & Enterprise Sales (LATAM) @ Collibra (2018 - 2021)

  • Led data intelligence solution sales across LATAM region
  • Technology Focus: Enterprise-wide data governance, data catalog, and data intelligence
  • Platform Expertise: Data lineage, metadata management, and AI governance frameworks

Regional Sales Manager, Data Protection & Disaster Recovery @ Zerto (2015 - 2019)

  • Consistently exceeded quotas (up to 257%), earning Global Sales of the Year honors
  • Technology Focus: IT resilience platforms for cloud data protection and disaster recovery
  • Platform Expertise: Enterprise-grade continuous data protection and cloud mobility

Business Development Executive, Cloud, Middleware & Database @ Oracle Corp (Earlier Career)

  • Exceeded quarterly targets by 135%, earning Top Gun and Fast Start awards
  • Technology Focus: Database management systems, cloud infrastructure, and middleware
  • Platform Expertise: Oracle Cloud Infrastructure, Database Management Systems, Middleware Solutions

πŸ› οΈ Tech Stack

Languages & Frameworks

Python R SQL JavaScript

ML/AI Frameworks

TensorFlow PyTorch Scikit Learn

Data & MLOps

Pandas NumPy Apache Airflow PostgreSQL

Cloud & Deployment

AWS Docker Streamlit Git


πŸ“Š GitHub Stats


🎯 Featured Projects

Project Impact Tech Stack Links
Human Activity Monitoring πŸ”₯ 1000+ Active Users Python, Streamlit, Ensemble ML Live Demo Β· Code
Crypto ML Pipeline ⚑ 85% Prediction Accuracy TensorFlow, Airflow, PostgreSQL Live Demo · Code
Democracy Clustering πŸ“Š 195 Countries Β· 7 Clusters R, K-means, 0.89 Silhouette Code
Network Intelligence πŸ•ΈοΈ 0.73 Correlation Discovery NetworkX, NLP, Graph Analysis Code

πŸŽ“ Education

Northeastern University
M.S. Data Analytics Engineering | Boston, MA | Expected 2026

  • 4.0 GPA
  • Focus: Machine Learning & Artificial Intelligence

Bridgewater State University
B.S. Economics | Boston, MA

University of Limerick
Study Abroad: European Union Economics & Monetary Policy Analysis | Ireland


πŸ“œ Certifications

  • AWS Cloud Practitioner Certified
  • Google Data Analytics Professional
  • Generative AI Specialization Learning Path

🧠 Technical Skills

Machine Learning

  • Models Built: Neural Networks, Random Forests, XGBoost, LSTM, BERT
  • Frameworks: TensorFlow, PyTorch, Keras, Scikit-learn
  • MLOps: Model deployment, A/B testing, monitoring
  • Current Focus: LLMs, Generative AI, Production ML Systems

Core Competencies

  • Data-driven decision making
  • Complex problem solving
  • Strategic planning & execution
  • Cross-functional collaboration
  • Technical concept explanation
  • AI governance & ethics
  • Bilingual communication (English, Spanish)

πŸ“« Connect With Me

LinkedIn Portfolio Email


πŸ’‘ Open to collaboration on ML/AI projects and internship opportunities!

Last updated: December 2024

Pinned Loading

  1. Network-Word-Frequency-Analysis-for-Data-Mining Network-Word-Frequency-Analysis-for-Data-Mining Public

    A collection of articles for data analysis and research purposes.

    Jupyter Notebook

  2. democracy-clustering-analysis democracy-clustering-analysis Public

    195 countries analyzed

    HTML

  3. human-activity-entropy human-activity-entropy Public

    1000+ active users

    Jupyter Notebook

  4. rosalinatorres888 rosalinatorres888 Public