Testing: Added comprehensive Python API client tests with streaming validation
v1.6 (2025-12-04)
Major Features
Streaming LLM Responses (#566, #567): Comprehensive streaming support for LLM text completion enabling real-time token-by-token delivery:
Infrastructure with streaming flag in schemas, Gateway API (REST/WebSocket), Python API, and CLI tools.
Full streaming implementation across all LLM providers including Azure, Azure OpenAI, Bedrock, Claude, Cohere, Google AI Studio, Llamafile, LM Studio, Mistral, Ollama, OpenAI, TGI, Vertex AI, and vLLM
Backward compatible with existing non-streaming clients
Support for WebSocket streaming
Reduces time-to-first-token and improved UX for long responses
Streaming RAG Responses (#568): Extended streaming support to GraphRAG and DocumentRAG services:
Token-by-token responses for knowledge graph and document retrieval queries
Consistent streaming UX across all TrustGraph services
Gateway support via WebSocket for real-time client applications
Streaming Agent Interactions (#570): Enhanced agent framework with streaming support:
Real-time streaming of ReAct agent thought/observation/answer chunks
Incremental response delivery for multi-step agent workflows
Streaming parser for agent responses with robust error handling
Improvements
Enhanced Integration Tests (#568, #570): Comprehensive test coverage for streaming functionality
Bug Fixes
AWS Bedrock Model Invocation (#572): Fixed compatibility issues with newer Bedrock model invocation API including proper streaming support
Minio Library Compatibility (#565): Fixed incompatible library change in Minio client for blob storage operations
Streaming Agent Interactions (#570): Fixed race conditions and message ordering issues in streaming agent responses
Infrastructure / Technical
CLI Improvements: Enhanced CLI tools with streaming output:
tg-dump-queues: New utility for developer queue diagnostics
Templates
Updates to Bedrock and Claude models to support latest models
v1.5 (2025-11-23)
New Features
OntoRAG: Ontology-Based Knowledge Extraction (#523): New processor kg-extract-ontology that uses ontology objects from config to guide triple extraction. Includes entity contexts and integrates with ontology extractor from workbench.
MCP Authentication (#557): Added MCP auth token header support for the simple authentication case.
Dynamic Embeddings Model Selection (#556): Embeddings model can now be selected dynamically rather than being fixed.
Bug Fixes
Collection deletion batch error (#559): Fixed batch error during collection deletion by reducing batch size.
Fix hard-coded vector size (#555): Vector store now lazy-creates collections with different collections for different dimension lengths.
Fix AgentStep schema error (#557): Agent step argument values are now converted to strings to fix schema errors.
Remove unnecessary OpenAI parameters (#561): Removed parameters from OpenAI invocation that were causing compatibility issues with OpenAI-compatible services.
Infrastructure / Technical
Python 3.13 Support (#553): Upgraded to Python 3.13, switched from cassandra-driver to scylla-driver (cassandra-driver doesn’t work with Python 3.13).
Vector Store Lifecycle Tech Spec (#555): Added technical specification for vector store lifecycle.
OntoRAG Tech Spec (#523, #558): Added technical specification for ontology-based knowledge extraction and query.
Testing
Ontology extraction tests (#560): Added tests for ontology extraction.
Dynamic embeddings tests (#556): Added tests for dynamic embeddings model selection.
MCP auth and agent step parsing tests (#557): Added tests for MCP authentication and agent step parsing.
OpenAI invocation tests (#561): Updated tests for OpenAI parameter changes.
v1.4 (2025-10-06)
New Features
Flow Configurable Parameters (#526, #530, #531, #532, #533, #541): Major enhancements to flow parameter system:
Flow configurable parameters with dynamic settings
LLM dynamic settings using llm-model and llm-rag-model parameters
Temperature parameter support for all LLMs
Flow creation uses parameter defaults in API and CLI
Advanced parameter mode with controlled-by relationships
New CLI tools: tg-show-parameter-types
Dynamic chunking parameters
Structured Data Diagnosis Service (#518, #519):
New structured data diagnosis service plumbed into API gateway