Changelog
Full release notes are on GitHub Releases.
kumo and kumo-derive are versioned independently — one may release without the other.
kumo
0.1.0 — 2026-04-13
- Async-first crawl engine via Tokio (
CrawlEngine::builder()) - CSS, regex, XPath, JSONPath selectors
- LLM extraction via Claude, OpenAI, Gemini, Ollama
- Rate limiting, auto-throttle, retry with backoff
JsonlStore,JsonStore,CsvStore,StdoutStore- PostgreSQL, SQLite, MySQL stores
- Item pipelines (
DropDuplicates,FilterPipeline,RequireFields) MemoryFrontier,FileFrontier,RedisFrontierLinkExtractorwith allow/deny filtering- HTTP response cache, Bloom filter dedup, robots.txt
- Headless browser fetcher, stealth mode
- Multi-spider engine
CrawlEngine::stream()— async item stream with backpressureSitemapSpider- OpenTelemetry OTLP/gRPC export (
otelfeature)
kumo-derive
0.1.2 — 2026-04-25
- Added
default = "value"— fallback string forStringfields - Added
transform = "trim|lowercase|uppercase"— post-extraction transform
0.1.1 — 2026-04-25
- Added crate metadata:
authors,rust-version,documentation,exclude
0.1.0 — 2026-04-21
#[derive(Extract)]proc-macro for structs with named fieldscss,attr,re,textfield optionsllm_fallback— CSS-first with LLM fallbackStringfields default to"",Option<String>toNone