sgi-machine

Structural Geospatial Identification — Python Package

WHERE IS WHAT?
GPS tells us where. SGI tells us what. Together: passive object identification, no camera, no radar.

Install

pip install sgi-machine

Or from source:

git clone https://github.com/or4k2l/Structural-Geospatial-Identification
cd Structural-Geospatial-Identification
pip install -e .

Quick Start

import sgi

# Train on synthetic data (takes ~5 seconds)
clf = sgi.train()

# Classify one sensor window
result = clf.predict(sensor_window)   # → 'truck'

# With confidence scores
proba = clf.predict_proba(sensor_window)
# → {'human': 0.01, 'bicycle': 0.01, 'car': 0.03, 'truck': 0.94, 'drone': 0.01}

# Save / load trained model
clf.save()
clf = sgi.load()

# Package info + physical constants
sgi.info()

Input Format

sensor_window can be any of:

Dict (from hardware parser):

window = {
    'velocity':     np.array([...]),   # forward speed [m/s]
    'a_long':       np.array([...]),   # longitudinal accel [m/s²]
    'a_lat':        np.array([...]),   # lateral accel [m/s²]
    'a_vert':       np.array([...]),   # vertical accel, gravity-removed [m/s²]
    'omega_z':      np.array([...]),   # yaw rate [rad/s]
    'heading_rate': np.array([...]),   # heading change [deg/s]
    'fs':           100.0,             # sample rate [Hz]
}

NumPy array (N, 6):

# columns: [velocity, a_long, a_lat, a_vert, omega_z, heading_rate]
arr = sensor_data[:, :6]
clf.predict(arr)

Pandas DataFrame with the above column names.

⚠️ GPS Velocity Required
velocity must be real GPS-derived speed in m/s. SGI is a GPS + IMU classifier.
Passing a near-zero or constant velocity array will trigger a UserWarning and will result in poor classification accuracy for all vehicle classes.

Known Limitations

Limitation	Detail
GPS speed is mandatory	`velocity` must come from real GPS (e.g. u-blox NEO-M9N). Integrated-acceleration proxies degrade vehicle-class accuracy to near-zero.
IMU-only mode	Without real GPS speed, only the `human` class is reliably classified (~93%). All vehicle classes (`car`, `truck`, `bicycle`) collapse.
Synthetic training	The classifier is trained on synthetic data. Cross-domain performance on real-world data depends on GPS availability and road surface conditions.
Dashboard IMU dampens engine frequency	When the IMU is mounted on the dashboard (not engine mount), motor vibration at 5 Hz (car) and 8 Hz (truck) is attenuated. `vib_freq` becomes unreliable as a car/truck discriminator.
car vs truck on dashboard IMU	See PVS Validation section below. Without real truck reference windows from the same road surface, car/truck separation is not achievable with synthetic-only training.

Real-World Validation — Collecty Dataset

Experiment: SGI-Machine classifier trained on synthetic data, evaluated against the Collecty dataset (Zagreb, Croatia, 100 Hz, 242 hours of labelled transport data) without real GPS speed.

Class	Accuracy	Note
`human`	93.4%	✅ IMU signal alone is sufficient
`truck`	12.4%	⚠️ Requires GPS speed
`car`	1.0%	❌ Requires GPS speed
`bicycle`	0.0%	❌ Requires GPS speed

Conclusion: SGI achieves strong human-class accuracy from IMU alone, but reliable vehicle classification (car, truck, bicycle) requires real GPS-derived speed in the velocity field.

Erdelić, M., Erdelić, T., & Carić, T. (2023). Dataset for multimodal transport analytics of smartphone users — Collecty. Data in Brief, 109481. https://doi.org/10.1016/j.dib.2023.109481

Real-World Validation — PVS Dataset (Vehicle Classes)

Experiment: SGI-Machine classifier trained on synthetic data (300 samples/class), evaluated against the PVS dataset (Brazil, 100 Hz dashboard IMU + 1 Hz GPS, 1,500 car windows, mean speed 34.5 km/h).

Metric	Value
Dataset	PVS — Passive Vehicular Sensors (Menegazzo, 2020)
IMU position	Dashboard (≈ smartphone in windshield mount)
Windows evaluated	1,500 (all ground-truth `car`)
GPS speed	Real — mean 34.5 km/h, std 25.8 km/h
Training	Synthetic only — 300 samples/class
car-Accuracy	0–21% (across 8 calibration attempts)
truck misclassification	67–88% of car windows classified as truck

Why car/truck separation fails on dashboard IMU:

No real truck reference. PVS contains only car windows — there are no real truck examples from the same road surface. Although the classifier was trained on synthetic truck data, it has never seen a real truck IMU recording from Brazilian roads, making it impossible to validate or calibrate the car/truck discrimination for this domain.
Engine frequency is damped. The dashboard IMU does not reliably capture the 5 Hz (car) vs 8 Hz (truck) engine signature — road surface broadband noise dominates the vertical acceleration PSD.
Heading/yaw encodes road roughness, not dynamics. On Brazilian roads, heading_rms and omega_rms reflect road surface irregularities transmitted through the suspension — not the vehicle's actual turning dynamics. This creates a systematic domain gap vs synthetic training data.

What is needed for reliable car/truck classification:

Real truck IMU windows from the same road surface and IMU mounting position, OR
IMU mounted on engine/drivetrain (not dashboard), OR
Additional sensor modality (e.g., acoustic, barometric)

Menegazzo, J., & von Wangenheim, A. (2020). PVS — Passive Vehicular Sensors Datasets. Data in Brief, 104863. https://doi.org/10.1016/j.dib.2019.104863

Object Classes

Class	Mass	Velocity	Key signature
`human`	80 kg	1.4 m/s	Gait ~1.8 Hz
`bicycle`	90 kg	4.5 m/s	Cadence ~2.5 Hz
`car`	1500 kg	9.5 m/s	Engine ~5 Hz
`truck`	15000 kg	11.1 m/s	Diesel ~8 Hz
`drone`	1.5 kg	8.0 m/s	Rotor ~50 Hz

Package Structure

sgi/
├── __init__.py          ← public API: train, load, predict, predict_proba, info
├── classifier.py        ← SGILightClassifier
├── _internal/
│   ├── features.py      ← SGIFeatureExtractor (14 features)
│   ├── generator.py     ← synthetic GPS+IMU data generator
│   └── physics.py       ← SGI-Full: K-field, gap analysis (theoretical)
└── models/
    └── sgi_light_v1.pkl ← bundled pretrained model (after sgi.train().save())
tests/
└── test_sgi.py          ← 54 tests, all passing

Hardware Interface

This package is designed to run on:

Platform	Role
Raspberry Pi Zero 2W	Full Python pipeline, WiFi streaming
ESP32-S3	Feature extraction in C++, WiFi output
Any Linux device	Full pipeline

Sensor requirements:

GPS: u-blox NEO-M9N (10 Hz) or equivalent
IMU: ICM-42688-P or MPU-6050 (100 Hz)

Total hardware cost: ~69€

SGI Spectrum

SGI-Light  (this package)   GPS+IMU+ML → deployed today
SGI-Medium (planned)        Quantum gravimetry → research
SGI-Full   (theoretical)    Relativistic K-field → 35-order gap

Physics API (SGI-Full, theoretical)

import sgi

# K-field amplitude for a single object
K = sgi.k_amplitude(mass_kg=80, velocity_ms=1.4)
# → 1.80e-52 m⁻¹  (unmeasurable with current technology)

# Gap analysis
result = sgi.gap_to_detector(80, 1.4, detector_sensitivity_ms2=1e-9)
result['gap_orders']    # → 26.8 orders to quantum gravimeter
result['n_coherent']    # → 6.17e+26 objects needed (coherent)

# Collective field
K_collective = sgi.collective_k(K, n_objects=1000, mode='incoherent')

# Earth relativistic correction (same as GPS correction)
sgi.EPSILON   # → 1.39e-9

See SGI Collective Field notebook for the full gap analysis.

Companion Notebooks (Kaggle)

Notebook	Description
SGI v3.0	Full theoretical framework
SGI Collective Field	Gap analysis: can N objects bridge 35 orders?
SGI-Light Prototype	Hardware spec, open hardware call

🔧 Hardware Partner Wanted

The software is ready. The hardware is not.

If you can build a GPS+IMU prototype on a Raspberry Pi or ESP32 — let's build this together.

You bring: hardware, soldering iron, 3D printer
I bring: software, documentation, physics
We build: the first open SGI-Light prototype

→ Open an issue or comment on the Kaggle notebook.

Citation

@software{akbay2025sgi,
  author  = {Akbay, Yahya},
  title   = {sgi-machine: Structural Geospatial Identification},
  year    = {2025},
  url     = {https://github.com/or4k2l/Structural-Geospatial-Identification},
  version = {1.0.0}
}

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
notebooks		notebooks
sgi		sgi
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sgi-machine

Install

Quick Start

Input Format

Known Limitations

Real-World Validation — Collecty Dataset

Real-World Validation — PVS Dataset (Vehicle Classes)

Object Classes

Package Structure

Hardware Interface

SGI Spectrum

Physics API (SGI-Full, theoretical)

Companion Notebooks (Kaggle)

🔧 Hardware Partner Wanted

Citation

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sgi-machine

Install

Quick Start

Input Format

Known Limitations

Real-World Validation — Collecty Dataset

Real-World Validation — PVS Dataset (Vehicle Classes)

Object Classes

Package Structure

Hardware Interface

SGI Spectrum

Physics API (SGI-Full, theoretical)

Companion Notebooks (Kaggle)

🔧 Hardware Partner Wanted

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages