LLM Tester UI

A modern React TypeScript application for testing and comparing local LLM models with a focus on biomedical research, system prompts, structured outputs, and project organization. Now with Electron support for handling large files (up to 100MB), local file system storage, and comprehensive biomedical data integration.

Features

Core LLM Testing

Model Selection: Dropdown to select from available local LLM models
System Prompt Editor: Easy-to-use interface for editing system prompts
User Input Area: Large text area for input text processing
Structured Output Support: Toggle for JSON, XML, YAML, and CSV outputs
Real-time Streaming: Watch LLM responses generate in real-time with streaming text display
Project Management: Organize tests into projects with history tracking
Response Comparison: Side-by-side comparison of different model responses
History Tracking: Complete history of all interactions with metadata
Modern UI: Built with Material-UI for a professional look and feel

Biomedical Research Integration

ARS API Integration: Connect to the Automated Reasoning System (Translator network) for biomedical knowledge graphs
PubMed Abstract Fetching: Automatically retrieve publication abstracts and metadata from PubMed
Smart Abstract Selection: Choose between "All Abstracts" or "Top 3 Most Recent" per edge
Biomedical Data Processing: Transform complex knowledge graph data into LLM-friendly formats
Subject Node Selection: Interactive selection of biomedical entities for targeted analysis
Publication Metadata: Include journal, title, publication date, and full abstracts in summaries

File Processing & Storage

Large File Processing: Upload and process large JSON, CSV, YAML, and XML files (up to 100MB)
Data Subset Creation: Create filtered subsets of uploaded data for targeted analysis
Local File Storage: Store projects and data on your local file system (Electron mode)
Cross-Platform: Available as both web application and desktop application

Prerequisites

Node.js (v16 or higher)
npm or yarn
Local LLM service (Ollama, etc.) running on your machine
For Biomedical Features: Access to ARS API (Translator network) and PubMed API

Installation

Quick Setup (Recommended)

Navigate to the project directory:

cd llmTesterUI

Run the automated setup script:

chmod +x setup.sh
./setup.sh

The setup script will:

Check Node.js version requirements
Install all dependencies (including Electron)
Create a .env file with default configuration
Create data directories for file storage
Provide next steps for running the application

Manual Setup

Navigate to the project directory:

cd llmTesterUI

Install dependencies:

npm install

Create a .env file (optional):

# LLM Tester UI Configuration
REACT_APP_API_URL=http://localhost:3001
REACT_APP_OLLAMA_URL=http://localhost:11434

Create data directories:

mkdir -p data/projects data/uploads data/exports

Start the application:
- Web version: npm start
- Electron version: npm run electron-dev

Usage

Getting Started

Create a Project: Click "New Project" in the left sidebar to create your first project
Select a Model: Choose from the available models in the dropdown
Configure Your Test:
- Edit the system prompt to define the AI's behavior
- Enter your input text in the user input area
- Optionally enable structured output if the model supports it
Run the Test: Click "Run Test" to execute the model
Review Results: View the response in the main area and compare with previous tests

Biomedical Research Workflow

1. Fetch Biomedical Data

Access ARS Data: Use the Data Fetcher to retrieve biomedical knowledge graphs from the Translator network
Enter Parameters: Provide your primary key (PK) and select the environment (dev, test, or prod)
Process Knowledge Graph: The system automatically processes complex nested JSON data into a flattened, structured format

2. Select Subject and Configure Abstracts

Choose Subject Node: Select a biomedical entity (gene, disease, drug, etc.) from the processed data
Configure Abstract Options:
- No Abstracts: Basic summary without publication details
- All Abstracts: Include all supporting publications
- Top 3 Most Recent: Include only the 3 most recent publications per edge
Automatic PubMed Integration: The system automatically fetches abstracts and metadata from PubMed

3. Generate LLM Summaries

Formatted Input: The system creates a structured input combining:
- Biomedical relationships and evidence
- Selected subject node information
- Publication abstracts (if enabled)
Real-time Streaming: Watch the LLM generate responses in real-time
Comprehensive Summaries: Get detailed biomedical insights with supporting evidence

Large File Processing (Electron Only)

Upload Files: Use the File Processor to upload large JSON, CSV, YAML, or XML files
Preview Data: View a preview of the uploaded data before processing
Create Subsets: Apply filters to create targeted data subsets for analysis
Process with LLMs: Use the filtered data as input for your LLM tests

Project Management

Create Projects: Organize related tests into projects
Switch Between Projects: Click on any project in the sidebar to switch
Edit Projects: Click the edit icon to modify project details
Delete Projects: Remove projects you no longer need
Export Projects: Export project data in various formats

History and Comparison

View History: All interactions are saved in the project history
Compare Responses: Select two interactions and click "Compare Selected"
Delete Interactions: Remove individual test results from history
Metadata Tracking: Response times, model details, and configuration are preserved

Structured Output

For models that support structured output:

Enable the "Request Structured Output" toggle
Select your preferred format (JSON, XML, YAML, CSV)
Responses will be formatted and syntax-highlighted

Biomedical Data Integration

ARS (Automated Reasoning System) API

The application integrates with the Translator network's ARS API to retrieve biomedical knowledge graphs:

Knowledge Graph Data: Complex nested JSON structures containing biomedical relationships
Entity Types: Genes, diseases, drugs, phenotypes, and more
Relationship Types: Various biomedical predicates and edge types
Publication Support: Each relationship includes supporting PubMed publications

PubMed Integration

Automatic Abstract Fetching

On-Demand Fetching: Abstracts are fetched only when a subject is selected and abstracts are enabled
Rate Limiting: Built-in rate limiting (3 requests/second) to comply with PubMed API limits
Batching: Efficient batching of PubMed IDs (10 per request) to minimize API calls
Error Handling: Graceful handling of API failures and rate limit exceeded errors

Abstract Selection Strategies

All Abstracts: Fetches and includes all supporting publications
Top 3 Most Recent:
- Fetches metadata for ALL publications per edge
- Sorts by publication date (newest first)
- Selects the 3 most recent publications per edge
- Fetches full abstracts only for the selected publications

Publication Metadata

Each abstract includes:

Title: Full publication title
Journal: Journal name
Publication Date: Year of publication
Abstract: Complete abstract text
PubMed ID: Unique identifier for the publication

Data Processing Pipeline

1. ARS Data Retrieval

ARS API → Complex JSON → Flattened Structure → Processed Data

2. Subject Selection & Abstract Fetching

Processed Data → Subject Selection → PubMed ID Extraction → Abstract Fetching → Enriched Data

3. LLM Input Formatting

Enriched Data → Structured Input → LLM Processing → Real-time Streaming Response

Example Data Flow

Input Data Structure

Subject: "MDR1 gene"
Relationships: [
  {
    predicate: "increases_expression_of",
    object: "P-glycoprotein",
    publications: "PMID:123456,PMID:789012,PMID:345678",
    abstracts: [
      {
        title: "MDR1 polymorphisms and drug response",
        journal: "Pharmacogenomics",
        publicationDate: "2023",
        abstract: "Various polymorphisms of the MDR1 gene..."
      }
    ]
  }
]

LLM Summary Output

The MDR1 gene shows several important relationships in the biomedical literature:

1. MDR1 increases_expression_of P-glycoprotein
   - Supporting Publications:
     MDR1 polymorphisms and drug response
     Pharmacogenomics (2023)
     Abstract: Various polymorphisms of the MDR1 gene that encodes for P-glycoprotein (P-gp), a transmembrane pump, have been identified...

Architecture

Web vs Electron Mode

The application can run in two modes:

Web Mode (Browser)

Uses browser localStorage for data persistence
Limited to smaller files due to browser constraints
Accessible via web browser at http://localhost:3000
Limited PubMed Integration: Subject to CORS restrictions

Electron Mode (Desktop)

Uses local file system for data storage
Supports large files up to 100MB
Native desktop application experience
Better performance for large data processing
Full PubMed Integration: Bypasses CORS restrictions via main process

Data Storage

Web Mode

Storage: Browser localStorage
Limitations: 5-10MB storage limit, browser memory constraints
Location: Browser's local storage

Electron Mode

Storage: Local file system
Location: ~/Library/Application Support/llm-tester-ui/data/ (macOS)

Structure:

data/
├── projects/          # Project data and interactions
├── uploads/           # Uploaded large files
├── exports/           # Exported project data
└── examplefiles/      # Example biomedical data files

File Processing

The Electron version includes advanced file processing capabilities:

Streaming Processing: Large files are processed in chunks to avoid memory issues
Multiple Formats: Support for JSON, CSV, YAML, and XML files
Data Preview: Preview uploaded data before processing
Subset Creation: Create filtered subsets of data for targeted analysis
Metadata Tracking: Track file information, processing times, and data statistics

PubMed API Architecture

Rate Limiting Strategy

Batch Size: 10 PubMed IDs per request
Delay Between Batches: 350ms (allows max 3 requests/second)
Error Handling: Automatic retry with exponential backoff
Caching: Built-in caching to avoid redundant requests

Electron Integration

Main Process: All PubMed API calls routed through Electron main process
CORS Bypass: Avoids browser CORS restrictions
Centralized Control: Consistent rate limiting and error handling
IPC Communication: Secure communication between renderer and main processes

Configuration

Environment Variables

Create a .env file in the project root to customize the application:

# API Configuration
REACT_APP_API_URL=http://localhost:3001
REACT_APP_OLLAMA_URL=http://localhost:11434

# PubMed API Configuration (optional)
PUBMED_API_KEY=your_api_key_here

Application Configuration

Edit src/config.ts to customize the application behavior:

export const CONFIG = {
  // Set to true to use real Ollama API, false for mock data
  USE_REAL_OLLAMA: false,
  
  // Ollama API endpoint
  OLLAMA_URL: 'http://localhost:11434',
  
  // Default model settings
  DEFAULT_TEMPERATURE: 0.7,
  DEFAULT_MAX_TOKENS: 4096,
  
  // Available MedGemma models
  MEDGEMMA_MODELS: [
    'medgem-custom:latest',
    'medgemma:2b',
    'medgemma:7b',
    'medgemma:latest'
  ],
  
  // Models that support structured output
  STRUCTURED_OUTPUT_MODELS: [
    'llama3.2',
    'llama3.3',
    'qwen3',
    'mistral-large',
    'medgem-custom',
    'medgemma'
  ],
  
  // PubMed API settings
  PUBMED_BATCH_SIZE: 10,
  PUBMED_RATE_LIMIT_DELAY: 350, // milliseconds
};

Adding New Models

To add new models, edit the MOCK_MODELS array in src/services/api.ts:

const MOCK_MODELS: LLMModel[] = [
  {
    id: 'your-model-id',
    name: 'Your Model Name',
    provider: 'Your Provider',
    supportsStructuredOutput: true,
    maxTokens: 4096,
    temperature: 0.7
  }
];

Using MedGemma Models

The UI now includes support for MedGemma models:

Available Models: The UI includes medgem-custom:latest and other MedGemma variants
Medical Responses: MedGemma models provide specialized medical responses
Structured Output: MedGemma supports JSON structured output for medical assessments
Real Integration: Set USE_REAL_OLLAMA: true in src/config.ts to use real Ollama API

To enable real Ollama integration:

// In src/config.ts
export const CONFIG = {
  USE_REAL_OLLAMA: true,  // Change this to true
  OLLAMA_URL: 'http://localhost:11434',
  // ... other config
};

Backend Integration

The application includes both mock data and real API integration:

Current Implementation

Mock Mode (default): Uses simulated responses for development
Real API Mode: Integrates with actual Ollama API when enabled

Ollama API Integration

The application includes a dedicated Ollama API service (src/services/ollama-api.ts) that provides:

// Example usage
import { OllamaAPIService } from './services/ollama-api';

const ollamaService = new OllamaAPIService('http://localhost:11434');
const models = await ollamaService.getAvailableModels();
const result = await ollamaService.testModel(config);

Custom Backend Integration

To integrate with your own LLM service:

Update the API calls in src/services/api.ts
Replace mock responses with actual API calls to your LLM service
Configure the proxy in package.json to point to your backend

Example backend integration:

static async testModel(config: TestConfig): Promise<Interaction> {
  const response = await fetch('/api/test', {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify(config)
  });
  return await response.json();
}

Project Structure

src/
├── components/          # React components
│   ├── ModelSelector.tsx
│   ├── PromptEditor.tsx
│   ├── ResponseViewer.tsx
│   ├── ProjectManager.tsx
│   ├── HistoryViewer.tsx
│   ├── DataFetcher.tsx      # ARS API integration
│   ├── DataViewer.tsx       # Biomedical data visualization
│   ├── SubjectNodeSelector.tsx  # Subject selection with abstract options
│   └── FileProcessor.tsx    # Large file processing
├── services/           # API and data services
│   ├── api.ts
│   ├── ollama-api.ts
│   ├── ars-api.ts          # ARS API service
│   ├── pubmed-api.ts       # PubMed API service
│   ├── data-processor.ts   # Biomedical data processing
│   └── export-service.ts   # Data export functionality
├── types/              # TypeScript type definitions
│   └── index.ts
├── App.tsx             # Main application component
├── index.tsx           # Application entry point
└── index.css           # Global styles

public/
├── electron.js         # Main Electron process (includes PubMed API handlers)
├── preload.js          # Electron preload script
└── index.html          # Application entry point

data/
├── examplefiles/       # Example biomedical data files
├── exports/           # Exported project data
└── projects/          # Project data and interactions

Development

Available Scripts

npm start - Start development server (web mode)
npm run electron-dev - Start Electron development mode
npm run electron - Start Electron app (production build)
npm run build - Build for production
npm run electron-pack - Package Electron app
npm run dist - Create distributable
npm test - Run tests
npm eject - Eject from Create React App

Running the Application

Web Mode

npm start
# Open http://localhost:3000 in your browser

Electron Mode

npm run electron-dev
# Desktop application will open automatically

Customization

Styling: Modify the theme in App.tsx or add custom CSS
Components: Extend or modify components in the components/ directory
API: Update the API service to integrate with your backend
Types: Add new TypeScript interfaces in types/index.ts
File Processing: Extend file processing capabilities in public/electron.js
Biomedical Data: Customize data processing in src/services/data-processor.ts

Troubleshooting

Common Issues

Models not loading: Check that your LLM service is running and accessible
Tests failing: Verify your system prompt and input text are valid
Projects not saving:
- Web mode: Check browser localStorage permissions
- Electron mode: Check file system permissions
Structured output not working: Ensure the selected model supports structured output
Large files not processing: Ensure you're running in Electron mode
Electron app not starting: Check Node.js version and dependencies

Biomedical Research Issues

ARS API errors:
- Verify your primary key (PK) is valid
- Check that the selected environment (dev/test/prod) is accessible
- Ensure network connectivity to the Translator network
PubMed API rate limiting:
- The system automatically handles rate limiting (3 requests/second)
- If you see "429 Too Many Requests" errors, the system will retry automatically
- Consider using "Top 3 Most Recent" instead of "All Abstracts" for large datasets
Abstract fetching failures:
- Check console logs for detailed error messages
- Verify PubMed IDs are valid
- Ensure you're running in Electron mode for full PubMed integration

Browser Compatibility

Chrome (recommended)
Firefox
Safari
Edge

Electron Requirements

macOS 10.11+ (for macOS builds)
Windows 7+ (for Windows builds)
Linux (for Linux builds)

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

License

This project is open source and available under the MIT License.

Support

For issues and questions:

Check the troubleshooting section
Review the code comments
Open an issue on GitHub

UI fNote: The Electron version provides enhanced capabilities for large file processing, local data storage, and full biomedical research integration. For production use with large datasets and PubMed integration, the Electron version is recommended.

Example Biomedical Data

The following is an example of the type of biomedical data that can be processed and analyzed using this application:

Sample Data Structure

Prednisone treats Addison Disease (x19)
Prednisone in clinical trials for Addison Disease
Prednisone studied to treat Addison Disease (x4) supported by 1 publication
Prednisone treats congenital adrenal hyperplasia (x5)
congenital adrenal hyperplasia subclass of Addison Disease (x3)
MC2R gene associated with condition hypoaldosteronism disease
Prednisone affects ABCB1 supported by 1 publication
Prednisone affects CTLA4 supported by 1 publication
Hyponatremia manifestation of Addison Disease
X-linked adrenal hypoplasia congenita subclass of Addison Disease (x2)
triple-A syndrome subclass of Addison Disease
CALCA gene associated with condition epilepsy supported by 1 publication
ABCB1 gene associated with condition Vomiting supported by 1 publication
epilepsy manifestation of Addison Disease
Prednisone affects MC2R supported by 1 publication
NR3C1 gene associated with condition hypoglycemia (x2) supported by 2 publications
Prednisone affects VWF supported by 1 publication
Prednisone affects TG supported by 1 publication
NR3C1 gene associated with condition epilepsy supported by 1 publication
VWF gene associated with condition epilepsy supported by 1 publication
hypoglycemia manifestation of Addison Disease
CTLA4 physically interacts with VANGL2 supported by 1 publication
MC2R gene associated with condition congenital adrenal hyperplasia supported by 1 publication
Vomiting manifestation of Addison Disease
Prednisone affects NR3C1 (x3)
Prednisone affects PTPRC supported by 1 publication
NR3C1 gene associated with condition congenital adrenal hyperplasia (x2) supported by 1 publication
CTLA4 physically interacts with PEX6 supported by 1 publication
VANGL2 gene associated with condition Addison Disease
MC2R gene associated with condition epilepsy
PEX6 gene associated with condition Addison Disease
Hyperkalemia manifestation of Addison Disease
hypoaldosteronism disease subclass of Addison Disease
ABCB1 gene associated with condition epilepsy (x2) supported by 2 publications
MC2R gene associated with condition Hyponatremia
MC2R gene associated with condition Vomiting
MC2R gene associated with condition X-linked adrenal hypoplasia congenita (x2) supported by 1 publication
PTPRC gene associated with condition tuberculosis supported by 1 publication
tuberculosis causes Addison Disease supported by 1 publication
Prednisone affects CYP2C19
MC2R gene associated with condition Hyperkalemia
Prednisone affects CALCA supported by 1 publication
CYP2C19 gene associated with condition epilepsy supported by 1 publication
TG gene associated with condition epilepsy supported by 1 publication
NR3C1 gene associated with condition hypoaldosteronism disease
PTPRC gene associated with condition epilepsy supported by 1 publication
MC2R gene associated with condition triple-A syndrome supported by 1 publication
Prednisone treats classic congenital adrenal hyperplasia due to 21-hydroxylase deficiency
Prednisone studied to treat classic congenital adrenal hyperplasia due to 21-hydroxylase deficiency supported by 1 publication
Prednisone studied to treat congenital adrenal hyperplasia supported by 1 publication

Supporting Publications

The application can automatically fetch and include supporting publications from PubMed, such as:

Example Publication 1

Title: Microglial pattern recognition via IL-33 promotes synaptic refinement in developing corticothalamic circuits in mice.
Journal: The Journal of experimental medicine (2023)
Abstract: Microglia are critical regulators of brain development that engulf synaptic proteins during postnatal synapse remodeling. However, the mechanisms through which microglia sense the brain environment are not well defined. Here, we characterized the regulatory program downstream of interleukin-33 (IL-33), a cytokine that promotes microglial synapse remodeling...

Example Publication 2

Title: The MDR1 polymorphisms at exons 21 and 26 predict steroid weaning in pediatric heart transplant patients.
Journal: Human immunology (2002)
Abstract: Various polymorphisms of the MDR1 gene that encodes for P-glycoprotein (P-gp), a transmembrane pump, have been identified. A silent mutation C3435T in exon 26 and a G2677T mutation in exon 21 have been correlated with P-gp expression and function in humans. The objectives of this study were (a) to determine whether the MDR1 exon 21 and exon 26 polymorphisms were related to steroid weaning in a pediatric heart transplant (HTx) population, and (b) to determine whether an association exist between the MDR1 exon 21 and exon 26 polymorphisms in these patients...

Data Processing Capabilities

This application can process complex biomedical data including:

Drug-Disease Relationships: Treatment associations, clinical trials, and therapeutic effects
Gene-Disease Associations: Genetic factors and their relationship to conditions
Protein Interactions: Physical interactions between proteins and their functional implications
Disease Hierarchies: Subclass relationships and disease manifestations
Publication Evidence: Supporting literature with full abstracts and metadata
Multi-entity Networks: Complex relationships between drugs, genes, diseases, and phenotypes

The system automatically processes this data into structured formats suitable for LLM analysis, enabling comprehensive biomedical research and drug discovery workflows.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
data		data
node_modules		node_modules
public		public
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
MedGemma-Modelfile		MedGemma-Modelfile
README.HOWTO.MedGemma.md		README.HOWTO.MedGemma.md
README.HOWTO.medGemma-Google.md		README.HOWTO.medGemma-Google.md
README.INFO.OllamaDataReturn.md		README.INFO.OllamaDataReturn.md
README.PLAN.AddAbstracts.md		README.PLAN.AddAbstracts.md
README.PLAN.CompareResponses.md		README.PLAN.CompareResponses.md
README.PLAN.DataSelector.md		README.PLAN.DataSelector.md
README.PLAN.dataProces2.md		README.PLAN.dataProces2.md
README.PLAN.streamResults.md		README.PLAN.streamResults.md
README.PLAN.translatorDownload.md		README.PLAN.translatorDownload.md
README.md		README.md
extract-predicates.js		extract-predicates.js
move_vtt_to_txt.sh		move_vtt_to_txt.sh
package-lock.json		package-lock.json
package.json		package.json
setup.sh		setup.sh
tsconfig.json		tsconfig.json
verify-node-names.js		verify-node-names.js

Genomewide/llmTesterUI

Folders and files

Latest commit

History

Repository files navigation

LLM Tester UI

Features

Core LLM Testing

Biomedical Research Integration

File Processing & Storage

Prerequisites

Installation

Quick Setup (Recommended)

Manual Setup

Usage

Getting Started

Biomedical Research Workflow

1. Fetch Biomedical Data

2. Select Subject and Configure Abstracts

3. Generate LLM Summaries

Large File Processing (Electron Only)

Project Management

History and Comparison

Structured Output

Biomedical Data Integration

ARS (Automated Reasoning System) API

PubMed Integration

Automatic Abstract Fetching

Abstract Selection Strategies

Publication Metadata

Data Processing Pipeline

1. ARS Data Retrieval

2. Subject Selection & Abstract Fetching

3. LLM Input Formatting

Example Data Flow

Input Data Structure

LLM Summary Output

Architecture

Web vs Electron Mode

Web Mode (Browser)

Electron Mode (Desktop)

Data Storage

Web Mode

Electron Mode

File Processing

PubMed API Architecture

Rate Limiting Strategy

Electron Integration

Configuration

Environment Variables

Application Configuration

Adding New Models

Using MedGemma Models

Backend Integration

Current Implementation

Ollama API Integration

Custom Backend Integration

Project Structure

Development

Available Scripts

Running the Application

Web Mode

Electron Mode

Customization

Troubleshooting

Common Issues

Biomedical Research Issues

Browser Compatibility

Electron Requirements

Contributing

License

Support

Example Biomedical Data

Sample Data Structure

Supporting Publications

Example Publication 1

Example Publication 2

Data Processing Capabilities

About

Resources

Packages