Agent Guidelines for Morphir Python

This document provides guidelines for AI code agents working on the Morphir Python codebase.

Project Overview

Morphir Python is a port of Morphir to Python, providing functional domain modeling capabilities. The project follows functional programming principles while maintaining Pythonic usability.

Monorepo Structure

morphir-python/
├── packages/
│   ├── morphir/           # Core library - IR models, types, pure functions
│   └── morphir-tools/     # CLI tools and extensions (depends on morphir)
├── tests/
│   ├── unit/             # Unit tests for morphir package
│   └── tools/            # Unit tests for morphir-tools package
└── features/             # BDD feature tests (behave)

Package Responsibilities

morphir (core library)

Morphir IR (Intermediate Representation) models
Type definitions and type algebra
Pure functional primitives
Zero CLI or IO dependencies
Should be usable as a standalone library

morphir-tools (CLI/tools)

Command-line interface
File I/O operations
Code generation backends
Depends on morphir core library

Coding Principles

1. Functional Programming

Immutability by Default

Use frozen dataclasses for all data structures:

from dataclasses import dataclass

@dataclass(frozen=True)
class Name:
    """An immutable name consisting of parts."""
    parts: tuple[str, ...]

Pure Functions

Functions should be pure - same inputs always produce same outputs with no side effects:

# Good: Pure function
def qualify(module_name: Name, local_name: Name) -> Name:
    return Name(parts=module_name.parts + local_name.parts)

# Bad: Side effect
def qualify(module_name: Name, local_name: Name) -> Name:
    print(f"Qualifying {local_name}")  # Side effect!
    return Name(parts=module_name.parts + local_name.parts)

2. Algebraic Data Types (ADTs)

Model domains using sum types (unions) and product types (dataclasses):

from dataclasses import dataclass
from typing import Union

# Product type (AND)
@dataclass(frozen=True)
class FunctionType:
    argument_type: "Type"
    return_type: "Type"

@dataclass(frozen=True)
class RecordType:
    fields: tuple[tuple[str, "Type"], ...]

@dataclass(frozen=True)
class UnitType:
    pass

# Sum type (OR) - Making illegal states unrepresentable
Type = Union[FunctionType, RecordType, UnitType]

3. Make Illegal States Unrepresentable

Design types so that invalid states cannot be constructed:

# Bad: Allows invalid states
@dataclass
class User:
    name: str | None
    email: str | None
    is_verified: bool  # Can be True even if email is None!

# Good: Invalid states are impossible
@dataclass(frozen=True)
class UnverifiedUser:
    name: str
    email: str

@dataclass(frozen=True)
class VerifiedUser:
    name: str
    email: str
    verified_at: datetime

User = Union[UnverifiedUser, VerifiedUser]

4. Type Annotations

All code must have complete type annotations. We use strict mode for both mypy and pyright.

from typing import TypeVar, Callable, Sequence

T = TypeVar("T")
U = TypeVar("U")

def map_list(func: Callable[[T], U], items: Sequence[T]) -> list[U]:
    """Apply a function to each item in a sequence."""
    return [func(item) for item in items]

5. Pattern Matching

Use structural pattern matching for handling sum types:

def type_to_string(t: Type) -> str:
    match t:
        case FunctionType(arg, ret):
            return f"({type_to_string(arg)} -> {type_to_string(ret)})"
        case RecordType(fields):
            field_strs = [f"{name}: {type_to_string(typ)}" for name, typ in fields]
            return f"{{ {', '.join(field_strs)} }}"
        case UnitType():
            return "()"

Code Style

Docstrings

Use Google-style docstrings:

def process_type(type_def: Type, context: Context) -> Result[ProcessedType, Error]:
    """Process a type definition within a context.

    Args:
        type_def: The type definition to process.
        context: The processing context containing scope information.

    Returns:
        A Result containing either the processed type or an error.

    Raises:
        ValueError: If the type definition is malformed.
    """

Formatting

Use ruff for formatting (configured in pyproject.toml)
Line length: 88 characters
Double quotes for strings
4-space indentation

Testing

TDD Workflow

Follow red-green-refactor:

Red: Write a failing test first
Green: Write minimal code to make it pass
Refactor: Clean up while keeping tests green

Unit Tests (pytest)

# tests/unit/test_name.py
import pytest
from morphir.ir import Name

class TestName:
    def test_from_string_splits_on_separators(self) -> None:
        name = Name.from_string("hello.world")
        assert name.parts == ("hello", "world")

    def test_to_string_joins_with_dot(self) -> None:
        name = Name(parts=("hello", "world"))
        assert name.to_string() == "hello.world"

BDD Tests (behave)

Write behavior-driven scenarios for user-facing features:

# features/type_checking.feature
Feature: Type Checking
  As a developer
  I want type checking for my domain models
  So that I catch errors at compile time

  Scenario: Function type matches argument
    Given a function type from Int to String
    And a value of type Int
    When I apply the function to the value
    Then the result type should be String

Test Coverage

Minimum 80% coverage required
All public APIs must have tests
Edge cases and error conditions must be tested

Development Commands

# Run all checks
mise run check

# Individual commands
mise run lint          # Linting
mise run format        # Format code
mise run typecheck     # Type checking (mypy + pyright)
mise run test          # Unit tests
mise run test-bdd      # BDD tests
mise run coverage      # Coverage report

Git Workflow

Create feature branch from main
Make changes following TDD
Ensure all checks pass: mise run check
Submit PR with clear description
Address review feedback

Dependencies

When adding dependencies:

morphir package: Should have minimal/zero dependencies
morphir-tools package: Can have CLI and IO dependencies
Always pin minimum versions in pyproject.toml

Error Handling

Use Result types instead of exceptions where appropriate:

from dataclasses import dataclass
from typing import TypeVar, Union

T = TypeVar("T")
E = TypeVar("E")

@dataclass(frozen=True)
class Ok[T]:
    value: T

@dataclass(frozen=True)
class Err[E]:
    error: E

Result = Union[Ok[T], Err[E]]

Performance Considerations

Use tuple instead of list for immutable sequences
Use frozenset instead of set for immutable sets
Consider __slots__ for frequently instantiated classes
Profile before optimizing

Documentation

All public APIs must have docstrings
Keep README.md up to date
Document non-obvious design decisions in code comments

Landing the Plane (Session Completion)

When ending a work session, you MUST complete ALL steps below. Work is NOT complete until git push succeeds.

MANDATORY WORKFLOW:

File issues for remaining work - Create issues for anything that needs follow-up
Run quality gates (if code changed) - Tests, linters, builds
Update issue status - Close finished work, update in-progress items

PUSH TO REMOTE - This is MANDATORY:

git pull --rebase
bd sync
git push
git status  # MUST show "up to date with origin"

Clean up - Clear stashes, prune remote branches
Verify - All changes committed AND pushed
Hand off - Provide context for next session

CRITICAL RULES:

Work is NOT complete until git push succeeds
NEVER stop before pushing - that leaves work stranded locally
NEVER say "ready to push when you are" - YOU must push
If push fails, resolve and retry until it succeeds

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Guidelines for Morphir Python

Project Overview

Monorepo Structure

Package Responsibilities

Coding Principles

1. Functional Programming

2. Algebraic Data Types (ADTs)

3. Make Illegal States Unrepresentable

4. Type Annotations

5. Pattern Matching

Code Style

Docstrings

Formatting

Testing

TDD Workflow

Unit Tests (pytest)

BDD Tests (behave)

Test Coverage

Development Commands

Git Workflow

Dependencies

Error Handling

Performance Considerations

Documentation

Landing the Plane (Session Completion)

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

Agent Guidelines for Morphir Python

Project Overview

Monorepo Structure

Package Responsibilities

Coding Principles

1. Functional Programming

2. Algebraic Data Types (ADTs)

3. Make Illegal States Unrepresentable

4. Type Annotations

5. Pattern Matching

Code Style

Docstrings

Formatting

Testing

TDD Workflow

Unit Tests (pytest)

BDD Tests (behave)

Test Coverage

Development Commands

Git Workflow

Dependencies

Error Handling

Performance Considerations

Documentation

Landing the Plane (Session Completion)