Context
PR #250 (opened 2024-09-17 by @hrshdhgd, fixes #180) added IJSEM (International Journal of Systematic and Evolutionary Microbiology) as a data source. Closed after 18 months without merge due to codebase drift.
Why this matters now
Harry Caufield expressed interest (Feb 2026 MPIM) in checking on OntoGPT for IJSEM abstract extraction — OntoGPT is at a "transition point of what it still does well vs. what should be done agentically." This aligns with Luke Wang's AUTO term work in CultureBotAI/auto-term-catalog (13 OntoGPT extraction templates for METPO).
Data sources
What PR #250 did
- 80 additions, 12 files
- Added IJSEM as a transform source with phenotypic data ingestion
What would need to happen now
- Implement from scratch following current conventions (see
metatraits.py as recent example)
- Coordinate with Harry Caufield (@justaddcoffee) on OntoGPT integration
- Consider whether Luke's OntoGPT templates could be applied to IJSEM abstracts
- Evaluate data format and coverage against current KG-Microbe needs
Original PR
#250 by @hrshdhgd | Original issue: #180
Context
PR #250 (opened 2024-09-17 by @hrshdhgd, fixes #180) added IJSEM (International Journal of Systematic and Evolutionary Microbiology) as a data source. Closed after 18 months without merge due to codebase drift.
Why this matters now
Harry Caufield expressed interest (Feb 2026 MPIM) in checking on OntoGPT for IJSEM abstract extraction — OntoGPT is at a "transition point of what it still does well vs. what should be done agentically." This aligns with Luke Wang's AUTO term work in
CultureBotAI/auto-term-catalog(13 OntoGPT extraction templates for METPO).Data sources
What PR #250 did
What would need to happen now
metatraits.pyas recent example)Original PR
#250 by @hrshdhgd | Original issue: #180