RamanPL_2D

RamanPL_2D is a Python toolkit for the analysis and visualisation of Raman and photoluminescence (PL) spectra in two-dimensional materials.
It provides tools for extracting peak positions, peak intensities / heights, and FWHM, and supports single-spectrum fitting, batch analysis, and spectral mapping.

The package is designed to support reproducible spectroscopy analysis workflows for 2D materials research.

Features

Spectral analysis

Import and process Raman and PL spectra from .txt and .wdf
Compatible with Renishaw WiRE exported data
Single-spectrum fitting using two peak models:
- Lorentzian (default; compatible with materials libraries)
- Pseudo-Voigt (pVoigt) — linear combination of Lorentzian and Gaussian

Flexible peak definitions

Library-based peak definitions via materials and substrate
Fully user-defined peaks using custom_peaks
Remove unwanted peaks via remove_peaks
Consistent behaviour across:
- single-spectrum fitting
- batch fitting
- mapping

Batch processing

Batch workflows allow automated analysis of many spectra:

automated fitting across multiple spectra
extraction of peak parameters (position, FWHM, intensity / peak height)
summary statistics per peak
export to .csv / .txt

Mapping analysis

Heatmaps of fitted parameters:
- peak intensity / peak height
- peak position
- FWHM
Raman-specific derived maps:
- A1g − E2g peak separation
- E2g / A1g intensity ratio
Heatmaps of integrated spectral intensity
Mapping fit diagnostics:
- residual maps
- residual distribution inspection
- bound-sticking summaries
- optional compact or disabled per-pixel diagnostics for production runs

Preprocessing

Modular preprocessing with Pipeline
Shared preprocessing support across:
- single-spectrum fitting
- batch workflows
- mapping workflows
Supported preprocessing operations include:
- crop by range
- Savitzky–Golay smoothing
- baseline subtraction (poly, gaussian, asLS, arPLS, airPLS)

Performance and robustness

Adaptive mapping multistart fitting:
- cheap first-pass fit
- retry only when needed
Faster mapping preprocessing for common workflows:
- vectorised Savitzky–Golay smoothing
- vectorised Gaussian baseline subtraction
- batched polynomial baseline subtraction
Vectorised peak summation for cheaper repeated model evaluations during optimisation

Visualisation

Raw vs fitted spectra overlay
Waterfall plots for spectral collections
Dynamic inspection of spectral fitting results

For features such as pipeline, adaptive mapping fits, and diagnostics control, please check the demonstration section and example notebooks in example-usage/.

For development roadmap and future plans, see the development roadmap section below.

Repository Structure

RamanPL_2D/
├── src/
│   └── ramanpl/
│       ├── __init__.py
│       ├── baselineAPI.py                  # Baseline subtraction API and baseline spec parsing
│       ├── batch.py                        # Batch workflows for Raman / PL spectra
│       ├── exporter.py                     # CSV / TXT export helpers and metadata provenance
│       ├── preprocessing.py                # Shared preprocessing pipeline definitions and benchmark pipeline builders
│       ├── operation.py                    # Spectrum / map arithmetic operations
│       │
│       ├── data_importer/
│       │   ├── __init__.py
│       │   └── data_importer.py            # WDF / TXT import for single spectra and mapping data
│       │
│       ├── single_fit/
│       │   ├── __init__.py
│       │   ├── _single_fit_core.py         # Shared fitting core logic
│       │   ├── _raman_fit.py               # Raman single-spectrum fitting
│       │   └── _pl_fit.py                  # PL single-spectrum fitting
│       │
│       ├── mapping/
│       │   ├── __init__.py
│       │   ├── _io.py                      # Mapping I/O, coordinates, and layout helpers
│       │   ├── _preprocess.py              # Mapping preprocessing execution
│       │   ├── _raman_mapping.py           # Raman mapping workflow
│       │   └── _pl_mapping.py              # PL mapping workflow
│       │
│       └── integration/
│           ├── __init__.py
│           ├── ramanspy_adapter.py         # RamanPL_2D ↔ RamanSPy data conversion helpers
│           ├── ramanspy_bridge.py          # Backend execution bridge for RamanSPy preprocessing
│           └── ramanspy_translate.py       # Translation of supported preprocessing pipelines to RamanSPy
│
├── benchmarks/
│   └── benchmark_mapping_preprocessing.py  # v0.4.3 mapping benchmark harness
│
├── tests/
│   ├── test_preprocessing_backend_resolution.py
│   ├── test_single_fit_regressions.py
│   ├── test_data_importer_regressions.py
│   ├── test_export_provenance_regressions.py
│   ├── test_mapping_backend_parity.py
│   ├── test_mapping_cube_consistency.py 
│   ├── test_mapping_backend_benchmark_smoke.py
│   └── test_mapping_memory_runtime_smoke.py
│
├── example-usage/                          # Example notebooks and demonstrations
│   ├── Ramanfit/
│   ├── PLfit/
│   ├── multi-plot/
│   └── Mapping/
│
├── README.md
├── CHANGELOG
├── pyproject.toml / setup.py
└── requirements*.txt

Change log

See CHANGELOG for details on recent updates and new features.

Getting Started

For users new to Python or Visual Studio Code (VS Code), the following steps will help you get started:

1. Install Python

Download and install the latest version of Python from: https://www.python.org/downloads/

2. Install Visual Studio Code

Download and install VS Code from: https://code.visualstudio.com/

3. Set Up Python in VS Code

Install the official Python extension by Microsoft.
Follow the official VS Code tutorial: Getting Started with Python in VS Code

4. Clone the Repository

Press the code button on the webpage to find your best way to clone th repository, enter the bash command in the terminal. An example of clone it via http can be like:

git clone https://github.com/barry063/RamanPL_2D.git

This will clone the open version of RamanPL_2D codes.

5. Install Dependencies

After you clone it, first go to the directory of your local RamanPL_2D, enter the bash command:

cd RamanPL_2D

Then in the same directory, enter the bash command:

pip install -r requirements.txt

This will automatically check all the required python packages required and install them in your local environment.

6. (Optional) Installing the Library Locally (for VSCode & Jupyter Notebook)

To use the RamanPL_2D toolkit in your own scripts or Jupyter notebooks, you can install the package locally using either of the following methods:

Option 1: Install as Editable Package (Recommended)

This method allows you to import the package from anywhere, and source-code changes will be reflected without reinstalling.

Navigate to the source folder:

cd "path\to\RamanPL_2D\src"

Install in editable mode:

pip install -e .

You can then import it in Python as usual:

from ramanpl import RamanFit
from ramanpl import PLfit

Option 2: Run `install.ipynb` for installation

Go to the src folder, click open the install.ipynb jupyter-notebook file. DON'T move the install.ipynb out of the src folder! Simply run all the codes in the install.ipynb to run the installation in the jupyter-notebook

Important: all the .ipynb in the example usage folders assume you have installed the package. So if you couldn't run the example nicely, maybe install the packages first.

Option 3: Add Folder to PYTHONPATH or sys.path (Manual)

If you prefer not to install the package, you can manually add the source folder to your Python path.

In your script or jupyter-notebook:

import sys
sys.path.append(r"path\RamanPL_2D\src")

from ramanpl import RamanFit

This approach is transient – it must be repeated each time the Python kernel restarts unless automated via environment variables or startup scripts.

7. Run Example Notebook

Open example_analysis.ipynb in the example-usage/ folder using VS Code or Jupyter.
Run the cells to see the toolkit in action.

8. Optional RamanSPy backend

RamanPL_2D supports RamanSPy as an optional preprocessing backend for supported Raman workflows.

For a local source install from the src/ directory, use:

Install with:

pip install -e .[ramanspy]

If RamanSPy is not installed, preprocessing falls back to the native implementation when preprocessing_backend="auto" is used.

Demonstration

Preprocessing pipelines

Version v0.3.4 introduced a modular preprocessing pipeline framework for Raman and PL spectra.

Typical preprocessing steps include:

spectral cropping
smoothing
baseline subtraction

A pipeline consists of ordered preprocessing steps:

from ramanpl.preprocessing import Pipeline

Each step modifies a SpectralDataset object and passes the result to the next step.

Example steps currently included:

Step	Description
`CropByRange`	Crop spectra to selected spectral window
`SmoothSavGol`	Savitzky–Golay smoothing
`BaselineSubtract`	Background subtraction (`poly` / `airPLS` / `arPLS` / `AsLS` / `Gaussian`)

Legacy preprocessing arguments

For backward compatibility, the following arguments still work:

smoothing=True
background_remove=True
baseline_method="poly"

However, pipeline-based preprocessing is recommended for new workflows.

Baseline specification

Baseline algorithms are now configured using a dictionary specification:

Example of airPLS baseline specification:

baseline_spec = {
    "method": "airpls",
    "lam": 1e6,
    "niter": 50,
    "tol": 1e-6,
}

Example polynomial baseline:

baseline_spec = {
    "method": "poly",
    "poly_order": 3,
}

The legacy argument poly_degree argument is deprecated and will be removed in a later release.

Adaptive mapping fitting and diagnostics (v0.3.9)

Version v0.3.9 improves mapping efficiency in three main ways:

adaptive multistart fitting
faster cube-level preprocessing for common workflows
configurable diagnostics storage

Adaptive multistart fitting

Mapping fits can now use a cheap first pass and only retry with more expensive initialisations when needed.

Example:

raman_map.fit_spectra(
    warm_start=True,
    fit_spectrum_kwargs=dict(
        adaptive_multistart=True,
        fast_n_starts=1,
        n_starts=4,
        p0_strategy="jitter",
        retry_on_fail=True,
        retry_on_high_rmse=True,
        retry_on_bound_hit=False,
        retry_rmse_gate=0.10,
        diagnostics="light",
    )
)

How it works

Fitting is performed in peak-normalised space for stability.
The fitter generates n_starts initial guesses using one of:
- p0_strategy="midpoint": midpoint of bounds (baseline behaviour)
- p0_strategy="random": uniform random within bounds
- p0_strategy="jitter": Gaussian perturbations around the current p0, clipped to bounds
The best candidate is selected using RMSE (and optionally a penalty term for width “inflation” toward its upper bound).

Available strategies:

Strategy	Description
midpoint	midpoint of parameter bounds
random	random uniform sampling within bounds
jitter	Gaussian perturbation around initial guess

Example

raman_map.fit_spectra(
    warm_start=True,
    fit_spectrum_kwargs=dict(
        n_starts=10,
        p0_strategy="jitter",
        random_state=0
    )
)

Diagnostics levels

Three diagnostics modes are available for mapping fits:

Mode	Behaviour
`full`	stores full per-pixel diagnostics including bound masks
`light`	stores compact QA summaries only
`none`	disables per-pixel diagnostics storage

Example:

pl_map.fit_spectra(
    fit_spectrum_kwargs=dict(
        diagnostics="none"
    )
)

fit_summary() still works in diagnostics="none" mode using the residual map, but detailed bound-sticking diagnostics are not available.

Verifying bound-sticking (QA check)

After fitting, you can summarise mapping fit quality using:

rep = raman_map.fit_summary()

This reports:

fit success rate
RMSE statistics
failure reasons, when diagnostics are available
bound-sticking summaries, when diagnostics are available

New Features in v0.4.x: RamanSPy integration

Preprocessing backend selection

Preprocessing supports three backend modes:

native — always use the built-in preprocessing implementation
auto — use RamanSPy when available, supported for the input, and the full preprocessing pipeline is currently translatable; otherwise fall back to native
ramanspy — force RamanSPy preprocessing and raise an error if unavailable or unsupported

Current support

RamanSPy preprocessing support in the current build is limited to:

Raman workflows
Raman shift axis (cm^-1)
the following translated preprocessing steps:
- CropByRange
- SmoothSavGol
- BaselineSubtract with:
  - poly
  - asls
  - airpls
  - arpls

The following remain native-only for now:

PL preprocessing workflows
BaselineSubtract(method="gaussian")

Single-spectrum example

from ramanpl import RamanFit

raman_fit = RamanFit.RamanFit(
    spectra=spectra,
    wavenumber=wavenumber,
    custom_peaks={
        "P1": ([210, 2, 0], [235, 30, 10]),
        "P2": ([325, 2, 0], [360, 40, 10]),
    },
    smoothing=True,
    background_remove=True,
    baseline_method={"method": "poly", "poly_order": 3},
    preprocessing_backend="auto",
)

Custom preprocessing pipeline example

from ramanpl.preprocessing import Pipeline, CropByRange, SmoothSavGol, BaselineSubtract

pipe = Pipeline(
    steps=[
        CropByRange((120, 480)),
        SmoothSavGol(window_length=9, polyorder=3),
        BaselineSubtract({"method": "poly", "poly_order": 3}),
    ],
    backend="auto",
)

In auto mode, RamanPL_2D records the resolved backend in preprocessing metadata. In mapping exports, the requested and resolved backend are both written into export metadata for provenance.

Forced RamanSPy example

from ramanpl.preprocessing import Pipeline, SmoothSavGol

pipe = Pipeline(
    steps=[
        SmoothSavGol(window_length=9, polyorder=3),
    ],
    backend="ramanspy",
)

Mapping backend benchmarking (v0.4.3)

Version v0.4.3 introduces a reproducible benchmark harness (benchmarks/benchmark_mapping_preprocessing.py) for Raman mapping cube preprocessing. The harness compares native and RamanSPy backend runtime and memory usage across six pipeline configurations (crop, Savitzky–Golay, polynomial baseline, AsLS, airPLS, arPLS) on three synthetic datasets (3×4, 10×12, and 20×24 pixels). Cube consistency and parity tests validate axis ordering, shape invariants, and adapter round-trip correctness for both backends; RamanBatch integration remains planned for v0.4.4.

TO-DO

Development Roadmap

Roadmap update (v0.4.x)
Development remains focused on RamanSPy integration as an optional preprocessing backend.
The immediate priority is to complete backend propagation, harden preprocessing internals, and improve performance for supported Raman workflows.
Advanced features such as machine-learning-assisted fitting remain deferred until preprocessing and backend behaviour are stabilised.

v0.4.x — RamanSPy integration

Version	Scope	Details
v0.4.0	Backend infrastructure	- Add optional RamanSPy dependency - Introduce internal adapter layer (`integration/ramanspy_adapter`) - Implement Spectrum / mapping cube conversion - Add preprocessing backend selector (`native / ramanspy / auto`) - Record backend in metadata
v0.4.1	Pipeline translation and stabilisation	- Translate `preprocessing.Pipeline` → RamanSPy for supported Raman preprocessing steps - Support: crop, Savitzky–Golay, selected baselines (`poly`, `asls`, `airpls`, `arpls`) - Preserve native fallback for unsupported steps and workflows - Stabilise backend propagation through single-spectrum and mapping preprocessing
v0.4.2	Validation and documentation	- Add regression tests for `native / auto / ramanspy` backend behaviour - Add Raman vs PL backend-compatibility checks - Update notebooks and README examples - Verify export metadata and preprocessing provenance
v0.4.3	Mapping benchmarking and performance review	- Benchmark Raman mapping preprocessing: native vs RamanSPy - Check conversion overhead and memory behaviour - Confirm axis ordering and cube consistency on representative datasets
v0.4.4	Batch integration	- Propagate backend into `RamanBatch` - Ensure consistent export metadata - Maintain existing plotting and table behaviour

v0.4.x+ — stabilisation, hardening, and optimisation

Version	Scope	Details
v0.4.5	API cleanup and hardening	- Improve backend error messages and consistency - Reduce remaining duplication in mapping/batch internals - Consolidate preprocessing/export helper paths - Introduce an internal baseline-engine abstraction for `BaselineAPI` - Split façade / native implementation / optional third-party engine dispatch - Normalise baseline spec handling and capability checks
v0.4.6	Performance	- Reduce conversion overhead (cube ↔ RamanSPy) - Improve memory efficiency - Add Whittaker-system / penalty caching for repeated baseline solves on shared axes - Optimise native `asls` / `airpls` / `arpls` implementations around reusable system structure - Evaluate optional direct `pybaselines` integration for supported baseline methods - Profile native legacy vs native optimised vs optional `pybaselines` paths
v0.4.7	API cleanup & deprecation removal	- Consolidate backend interface - Remove deprecated parameters and legacy pathways - Finalise preprocessing schema (baseline / pipeline spec) - Improve error handling and messaging - Finalise baseline-engine interface and optionally expose user control for engine selection

v0.5.x — deferred features

Version	Scope	Details
v0.5.0	RamanSPy analysis (optional)	- Expose RamanSPy-based analysis workflows (e.g. decomposition, clustering) - Interoperability with RamanSPy objects
v0.5.1	Baseline caching (deferred)	- Cache preprocessing results across mapping pixels where appropriate beyond baseline-solver reuse - Backend-aware caching for native and RamanSPy pathways
v0.5.5+	Machine learning fitting (deferred)	- ML-assisted peak initialisation - Explore fit acceleration only after backend behaviour is mature

Notes

RamanSPy is currently used as an optional preprocessing backend only.
Integration is limited to Raman workflows (cm⁻¹ axis); PL workflows remain on the native backend.
Existing APIs (Pipeline, RamanFit, Mapping, Batch) remain backward compatible during v0.4.x.
BaselineSubtract(method="gaussian") remains native-only at this stage.
Current development priority is validation, documentation, and backend stabilisation before broader RamanSPy feature expansion.

License

This project is licensed for BSD 3-Clause License.
See the LICENSE file for details.

Contact

For issues, questions, or collaboration ideas:
Hao Yu – hy377@cam.ac.uk

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
benchmarks		benchmarks
example-usage		example-usage
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG		CHANGELOG
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

RamanPL_2D

Features

Spectral analysis

Flexible peak definitions

Batch processing

Mapping analysis

Preprocessing

Performance and robustness

Visualisation

Repository Structure

Change log

Getting Started

1. Install Python

2. Install Visual Studio Code

3. Set Up Python in VS Code

4. Clone the Repository

5. Install Dependencies

6. (Optional) Installing the Library Locally (for VSCode & Jupyter Notebook)

Option 1: Install as Editable Package (Recommended)

Option 2: Run install.ipynb for installation

Option 3: Add Folder to PYTHONPATH or sys.path (Manual)

7. Run Example Notebook

8. Optional RamanSPy backend

Demonstration

Preprocessing pipelines

Legacy preprocessing arguments

Baseline specification

Adaptive mapping fitting and diagnostics (v0.3.9)

Adaptive multistart fitting

How it works

Available strategies:

Example

Diagnostics levels

Verifying bound-sticking (QA check)

New Features in v0.4.x: RamanSPy integration

Preprocessing backend selection

Current support

Single-spectrum example

Custom preprocessing pipeline example

Forced RamanSPy example

Mapping backend benchmarking (v0.4.3)

TO-DO

Development Roadmap

v0.4.x — RamanSPy integration

v0.4.x+ — stabilisation, hardening, and optimisation

v0.5.x — deferred features

Notes

License

Contact

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Option 2: Run `install.ipynb` for installation

Packages