Skip to content

GIMKit

Guided Infilling Modeling Toolkit — structured text generation and information extraction using language models.

GIMKit lets you define placeholders (masked tags) in text and have a language model fill them in. It gives you fine-grained control over model outputs through a typed tag system with optional regex constraints.

PyPI Version Python Versions Platform


What Can You Do With GIMKit?

GIMKit is a general-purpose information extraction framework. Write a natural-language template with embedded typed placeholders, and the model extracts structured data from any unstructured text.

Use Case Description
Contact extraction Parse names, emails, phone numbers from free-form text
Named entity recognition Extract organizations, people, locations, dates
Text classification Categorize text into labels, assign sentiment
Event extraction Pull structured event info (what/where/when/impact)
Relation extraction Find entities and the relationships between them
Resume / CV parsing Extract candidate name, title, education, experience
Product review analysis Parse product, price, rating, pros and cons
Privacy & PII protection Extract, classify, redact, and filter PII

See the Classic IE Use Cases, Privacy and PII Use Cases, and Other Use Cases pages for full code examples.


Features

  • Masked tag system — embed typed placeholders directly in f-strings.
  • Regex constraints — restrict model output to specific patterns.
  • Named access — retrieve results by tag name or index.
  • Multiple backends — OpenAI, vLLM (server and offline).
  • Small-model friendly — designed to work well with compact open-source models.