Skip to content
View BenjaminADecosta's full-sized avatar

Block or report BenjaminADecosta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BenjaminADecosta/README.md

Hi, I’m Benjamin deCosta 👋

🎓 Honours B.Sc. Computer Science & Mathematics (Statistics Minor) @ University of Toronto
📍 Toronto, Canada
💡 Interests: Machine Learning, Data Engineering, Risk & Finance, Systems


🚀 About Me

I’m a CS & Math student with a strong focus on machine learning, data systems, and risk-aware modeling. I enjoy building production-style pipelines end-to-end — from raw data ingestion and feature engineering to evaluation, deployment artifacts, and documentation.

My work spans reinforcement learning for trading, credit risk modeling, large-scale time-series forecasting, and software/code security auditing. I care deeply about reproducibility, clean architecture, and turning complex models into systems that can actually be used.


🧠 Research

Balancing Profit and Risk: Hybrid RL for Algorithmic Trading
Apr 2025 – Sep 2025

  • Built a modular RL trading framework with risk-aware reward functions (volatility, drawdown, Sharpe-hybrid).
  • Implemented Q-Learning and DQN agents with stabilizing techniques.
  • Achieved +27% return improvement, Sharpe ≈ 1.93, and reduced max drawdown from 30% → 15% across 53 equities/ETFs.

🛠️ Featured Projects

🔹 TechnationAI

Python · FastAPI · LangChain · Backend Systems

  • Built TechnationAI, an AI assistant project focused on clean APIs and modular backend design.
  • Designed components for tool usage, prompt orchestration, and extensibility for future features.

🔗 https://github.com/BenjaminADecosta/TechnationAI


🔹 AWS Credit Risk Pipeline

Python · PySpark · AWS EMR/S3 · scikit-learn

  • Built a production-style ETL pipeline using PySpark on AWS EMR to transform raw credit data into curated Parquet feature datasets stored in S3.
  • Designed a config-driven architecture (YAML) separating data contracts, pipelines, training, and artifacts for reproducible runs.
  • Implemented feature engineering + dataset standardization to support scalable training workflows.

🔗 https://github.com/BenjaminADecosta/AWS-Credit-Risk-Pipeline


🔹 Ability (Accessibility iOS App)

iOS · Accessibility · 3D Printing · Product Design

  • Accessibility-focused app empowering users (especially those with disabilities) to create custom 3D-printable assistive devices.
  • Built around user-driven customization with a focus on inclusive, practical everyday tooling.

🔗 https://github.com/jacobamobin/Ability


🔹 EMS Incident Forecaster

Pandas · SARIMA · Ridge Regression · scikit-learn

  • Forecasted weekly NYC EMS demand using 1.2M+ incident records and weather data.
  • Implemented automated backtesting and 26-week forecasts with borough-level confidence intervals.

🔗 https://github.com/BenjaminADecosta/EMS-Incident-Forecaster

💼 Experience

Security Code Auditor (Contract)Hssndusqooq LLC

  • Audited a React/Firebase creator platform (auth, storage rules, APIs, logging).
  • Identified security & privacy gaps with file-level evidence and repro steps.
  • Delivered an executive-ready remediation roadmap prioritizing real-world risk.

Machine Learning DeveloperQAIFS @ UofT

  • Built reusable regression, classification, and forecasting modules.
  • Developed modular Python data pipelines for cleaning, validation, and transformation.
  • Collaborated via Git with reproducible workflows and experiment tracking.

🧰 Tech Stack

Languages: Python, Java, SQL, JavaScript, C/C++, R
ML & Data: PySpark, Spark SQL, Pandas, NumPy, scikit-learn, PyTorch, Hadoop, Parquet
Cloud & Tools: AWS (EMR, S3, SageMaker), Git, Jupyter, VS Code, PyCharm, IntelliJ


📫 Let’s Connect


Pinned repositories below highlight selected work in ML, data engineering, and applied systems.

Pinned Loading

  1. jacobamobin/Ability jacobamobin/Ability Public

    Ability is an accessibility-focused iOS app that empowers users, especially those with disabilities, to create custom 3D-printable assistive devices. By providing text descriptions and optional ima…

    Python 2 1

  2. TechnationAI TechnationAI Public

    Python

  3. XDAathi/DS3-Datathon XDAathi/DS3-Datathon Public

    This repo is our submission to the anual 2025 DS3 Datathon. This competion was from Feb 17-22, and consisted of 3 Problems Our team placed 9th place, out of 180 participants, with a max group size …

    Jupyter Notebook 2

  4. EMS-Incident-Forecaster EMS-Incident-Forecaster Public

    NYC EMS Forecasting — Weekly emergency medical service call predictions using SARIMA and Ridge regression with historical NYC EMS data and Central Park weather features.

  5. AWS-Credit-Risk-Pipeline AWS-Credit-Risk-Pipeline Public

    Python

  6. Personal-Finance-Tracker Personal-Finance-Tracker Public

    CSC207 Proj

    Java