Indy Explorer

Indy Explorer is a Streamlit app designed to help you navigate resorts in the Indy Pass product with ease.

Features

Resort Data Extraction: Uses BeautifulSoup to scrape data from the Indy Pass resort pages.
Location Normalization: Utilizes the Google Maps Geocoding API to normalize location data.
Interactive UI: Built with Streamlit for greater interactivity with resort information.

Data Source

Data is sourced from indyskipass.com as of December 23, 2025.

Installation

Clone the repository:

git clone https://github.com/jonathanstelman/indy-explorer.git
cd indy-explorer

Install Poetry:
```
pipx install poetry
```
Install the required dependencies:
```
poetry install
```
Set up your Mapbox API token:
- Sign up for a free Mapbox account.
- Create an access token in your Mapbox account dashboard.
- Add your token to .streamlit/secrets.toml:
```
MAPBOX_TOKEN = "your_mapbox_token_here"
```
  Or set it as an environment variable:
```
export MAPBOX_TOKEN=your_mapbox_token_here
```
Run the Streamlit app:
```
poetry run streamlit run src/app.py
```

Usage

Fetch Resort Data: Use the get_page_html function to fetch resort data from the web or cache.
Parse Resort Data: Use the parse_resort_page function to extract relevant resort data.
View Data: The data is displayed in an interactive Streamlit app.

Refreshing Resort Data

To update all resort data (recommended about once per year), follow these steps:

Backup your web cache and resort data (optional but recommended):

cp -r cache backups/cache_backup_$(date +%Y%m%d_%H%M%S)
cp -r data backups/data_backup_$(date +%Y%m%d_%H%M%S)

Remove the old cache and data folders (optional, for a clean refresh):
```
rm -rf cache
rm -rf data
```

Recreate required directories:

mkdir -p cache data/resort_page_extracts

Fetch and cache the latest resort data:
```
poetry run python src/page_scraper.py
```
This will:
- Download and cache the latest "our resorts" page.
- Parse and save data/resorts_raw.json.
- Download, cache, and parse each individual resort page, saving to data/resort_page_extracts/<slug>.json.
Notes:
- Use live mode to re-download everything:
```
poetry run python src/page_scraper.py --read-mode live
```
- Cached HTML files are not overwritten (the scraper uses open(..., 'x')). Delete cache/*.html if you want to re-fetch.
Placeholder: update blackout date handling (next task):
- The blackout Google Sheet format has changed; revisit and update the blackout pipeline here.
Fetch blackout dates (optional):
- src/prep_resort_data.py will auto-fetch data/blackout_dates_raw.csv if it is missing.
- To force a refresh, run:
```
poetry run python src/prep_resort_data.py --refresh-blackout
```
- To increase verbosity, add --log-level DEBUG.
- You can still use src/blackout.py to fetch the sheet and print QA output if you want to inspect it.
Fetch reservation requirements (optional):
- Cache the HTML page:
```
poetry run python src/cache_blackout_reservations.py --read-mode live
```
- Parse the cached reservations list into JSON:
```
poetry run python src/reservations.py --read-mode cache
```
- This produces data/reservations_raw.json.
Prepare the final CSV for the Streamlit app:
```
poetry run python src/prep_resort_data.py
```
This will:
- Use the Google Maps API to retrieve and save normalized location data if data/resort_locations.csv is missing.
- Merge all resort data and location info.
- Produce data/resorts.csv for the Streamlit app.
Run the Streamlit app:
```
poetry run streamlit run src/app.py
```

Note:

If you encounter errors related to missing or outdated data files, re-run the above steps in order.
The Google Maps API key must be set in your environment (see .env.example).
A Mapbox API token is required for map rendering in the Streamlit app.
If you have a limited Google Maps API quota, be aware that regenerating data/resort_locations.csv will make API calls for each unique resort location.
If you want to preserve previous location lookups, make sure to back up data/resort_locations.csv as well:
```
cp data/resort_locations.csv data_backup_$(date +%Y%m%d_%H%M%S)_resort_locations.csv
```
If you add new resorts or change location names, you may need to manually review or update data/resort_locations.csv.

Blackout Dates

Blackout dates are sourced from a published Google Sheet and merged into the resort data.

Refreshing blackout data

Fetch the latest sheet data and print QA output:
```
poetry run python src/blackout.py
```
This currently prints QA output only (it does not write data/blackout_dates_raw.csv).
Run the prep script to merge blackout dates into data/resorts.csv:
```
poetry run python src/prep_resort_data.py
```

QA / troubleshooting

Print the raw sheet and name-mismatch diagnostics:
```
poetry run python src/blackout.py
```
If blackout resort names don’t match data/resorts.csv, update BLACKOUT_RESORT_NAME_MAP in src/blackout.py.

Contributing

We welcome contributions! Please follow these steps:

Fork the repository.
Create a new branch (git checkout -b feature-branch).
Make your changes.
Commit your changes (git commit -am 'Add new feature').
Push to the branch (git push origin feature-branch).
Open a new Pull Request.

Testing & Continuous Integration ✅

We include unit tests and CI for this project. Here are the quick commands and notes you need to run the tests, generate reports, and keep code formatted:

Running tests locally

Install dependencies with Poetry:
```
poetry install
```
Run the full test suite:
```
poetry run pytest -q
```

Run tests and generate JUnit + coverage reports (matches CI):

mkdir -p reports
poetry run pytest --junitxml=reports/junit.xml --cov=src --cov-report=xml:reports/coverage.xml -q

Notes:

Tests live in tests/ and use fixtures in tests/fixtures/ (e.g., powder_ridge_fixture.html, powder_ridge_malformed.html).
tests/conftest.py ensures the project root is on sys.path so import src works in CI.
The project includes pytest-cov for coverage reporting (installed in the dev group).

Formatting

We use Black for code formatting. To check formatting locally:

poetry run black --check .

To run Black and reformat files:

poetry run black .

Black configuration lives in pyproject.toml (we set skip-string-normalization = true to preserve single quotes).

CI details

GitHub Actions runs tests and checks formatting on push/PRs. The CI workflow:
- Installs dependencies with Poetry
- Runs pytest and generates reports/junit.xml and reports/coverage.xml
- Uploads a test artifact named test-reports
- Publishes a GitHub Check Run summary using dorny/test-reporter@v2 (configured for python-xunit)
- Optionally uploads coverage to Codecov if CODECOV_TOKEN is set in repository secrets

Notes for maintainers

src/utils.py now avoids creating the googlemaps.Client at import time when GOOGLE_MAPS_API_KEY is missing, which prevents import failures in CI and local environments without the key. Tests monkeypatch utils.gmaps when needed.
beautifulsoup4 is declared in pyproject.toml so bs4 is available in CI.

Reporting Issues

Help improve this app:

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

Made by Jonathan Stelman

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
.devcontainer		.devcontainer
.github		.github
.streamlit		.streamlit
.vscode		.vscode
data		data
img		img
reports		reports
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Indy Explorer

Features

Data Source

Installation

Usage

Refreshing Resort Data

Blackout Dates

Refreshing blackout data

QA / troubleshooting

Contributing

Testing & Continuous Integration ✅

Running tests locally

Formatting

CI details

Notes for maintainers

Reporting Issues

License

Acknowledgements

About

Uh oh!

Packages

Uh oh!

Languages

License

jonathanstelman/indy-explorer

Folders and files

Latest commit

History

Repository files navigation

Indy Explorer

Features

Data Source

Installation

Usage

Refreshing Resort Data

Blackout Dates

Refreshing blackout data

QA / troubleshooting

Contributing

Testing & Continuous Integration ✅

Running tests locally

Formatting

CI details

Notes for maintainers

Reporting Issues

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Languages

Packages