FORMINDEX is a comprehensive project aimed at analyzing and visualizing the FORMIS myrmecological database. It combines advanced bibliometric techniques, artificial intelligence, and interactive visualizations to extract insights from published ant research.
- Metadata-Centric Analysis
- Fair Use Compliance
- LLM Integration
- Dynamic Visualization
- Longitudinal Analysis
- AI-Driven Synthesis
- Targeted Bibliography Generation
- Multi-modal Output (Podcasts, Summaries, Translations)
- Full coverage of FORMIS database (July 2024 export, ~80,000 records)
- Conversion of Bibtex to JSON format
- Generation of targeted bibliographies based on key terms
- Frequency analysis of record types, authors, venues, and locations
- Network analysis of co-authorship
- Temporal analysis of publications and venues
- Thematic analysis through word clouds
- Content analysis via AI-generated summaries
- Narrative synthesis through AI-generated podcasts
- Use of NotebookLM for generating conversational podcasts
- OpenAI API integration for summarization and translation
- Perplexity AI for internet-enabled myrmecological inquiries
- Interactive charts for publication trends, author networks, and topic distributions
- Word clouds for title and abstract analysis
- Automated generation of summary reports and translations
Read_in_FORMIS.py: Ingests Bibtex and stores records in JSON formatGenerate_Target_Bibliographies.py: Creates subsets of records based on key termsVisualize_FORMIS.py: Generates visualizations for full FORMIS and targeted bibliographiesLLM_Methods/: Scripts for AI-driven summarization and translationPerplexity_Methods/: Scripts for advanced myrmecological inquiries using Perplexity AI
- Comprehensive bibliographic analysis visualizations
- AI-generated literature summaries in multiple languages
- Conversational podcasts on targeted myrmecological topics
- Internet-enabled myrmecological inquiry results
- Strict adherence to fair use and copyright regulations
- Responsible use of AI technologies
- Integration with multi-omic phenotypic data
- Development of predictive models for research trends
- Expansion of analysis techniques to other entomological databases
- Integration with NCBI species ID for broader MetaInformAnt efforts
- Collaboration with FORMIS stakeholders for continuous improvement
- Clone the repository
- Install required dependencies (list to be provided)
- Run
Read_in_FORMIS.pyto ingest the FORMIS database - Use
Generate_Target_Bibliographies.pyto create subsets of interest - Execute
Visualize_FORMIS.pyfor comprehensive visualizations - Explore
LLM_Methods/andPerplexity_Methods/for AI-driven analyses
We welcome contributions! Please see our contributing guidelines (link to be added) for more information.
This project is licensed under [LICENSE NAME] - see the LICENSE.md file for details.
- FORMIS database maintainers
- Contributors to open-source libraries used in this project
For more information, visit our project website or contact [Your Contact Information].