Skip to content

BABTUNA/open_source_finder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

op_finder

Find SF-based companies with active open source projects worth contributing to.

Pipeline

scrapers/  -> data/raw/*.csv  -> normalize -> data/companies.csv -> github_score -> data/ranked.csv

Setup

pip install -r requirements.txt
cp .env.example .env
# add your GITHUB_TOKEN to .env

Usage

python run.py              # run everything
python run.py scrape       # all scrapers only
python run.py normalize    # merge + SF-filter raw csvs
python run.py score        # GitHub enrichment + ranking
python run.py hn           # individual: hn_hiring
python run.py todo         # individual: todo_ospo
python run.py oc           # individual: opencollective
python run.py kv           # individual: keyvalues
python run.py gh           # individual: github_sponsors

Sources

Name Source Notes
hn HN "Who is hiring?" 2022-2026 via Algolia SF-filtered at scrape time
todo TODO Group OSPO landscape.yml Location enriched later
oc Open Collective GraphQL SF-filtered at scrape time
kv keyvalues.com directory SF-filtered at scrape time
gh GitHub Sponsors via seed crawl Location enriched later

Output

data/ranked.csv is sorted by contributability score:

score = external_pr_authors_90d * 10
      + good_first_issues * 3
      + min(repo_count, 50)
      + recent_commits_90d * 0.1

The external_pr_authors_90d metric is the strongest signal — it tells you the company actually merges PRs from outside contributors.

About

finding good open sources to work on is lowkey hella tuff

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages