Commit Graph

18 Commits

Author SHA1 Message Date
lilong.129
1694f36837 change: replace openai model with ark model 2025-07-03 22:13:23 +08:00
lilong.129
b1719344c0 feat: enhance AI result handling with model name and usage statistics for query, action, and assertion types 2025-06-24 15:25:12 +08:00
lilong.129
1f3366453e feat: implement structured response parsing with enhanced error recovery and UTF-8 sanitization 2025-06-18 16:59:35 +08:00
lilong.129
7c45acd061 feat: add AI Querier module with custom output schema support and refactor common model calling logic
- Add new AI Querier module for structured information extraction from screenshots
- Support custom output schema for structured data response
- Implement automatic type conversion and data validation
- Add comprehensive test suite with various data structure examples
- Refactor callModelWithLogging to utils.go as shared function for planner, asserter, and querier
- Eliminate code duplication across AI modules (30+ lines of repeated code)
- Improve maintainability with unified logging and timing logic
- Add environment variable checks in test setup to handle missing API keys gracefully

Key features:
- Custom output schema support with JSON Schema generation
- Automatic data type conversion with reflection
- Fallback mechanisms for robust parsing
- Comprehensive documentation and usage examples
- Backward compatibility with existing functionality
2025-06-10 20:41:35 +08:00
lilong.129
88ae8faee1 feat: enhance VLM response parsing and DOUBAO model support
- Fix JSON extraction logic by prioritizing brace counting method
- Add support for DOUBAO string array coordinate format
- Introduce IS_UI_TARS helper function for model type checking
- Add comprehensive tests for JSON parsing and coordinate handling
- Improve error handling with retry delays for LLM service failures
2025-06-10 15:56:13 +08:00
lilong.129
4959c2e47e feat: extractJSONFromContent 2025-06-10 14:08:44 +08:00
lilong.129
484eebdefd feat: implement multi-model service configuration support
- Support configuring multiple LLM services simultaneously
- Auto-derive model names from service types to simplify configuration
- Maintain backward compatibility with existing configurations
- Refactor configuration logic into dedicated env module
- Add comprehensive unit test coverage
- Update documentation with new configuration approach
2025-06-06 22:17:59 +08:00
lilong.129
d883aa6a21 change: rename VLM name 2025-06-05 18:09:25 +08:00
lilong.129
3b77ade24f refactor: json asserter 2025-05-22 18:22:12 +08:00
lilong.129
c377664518 refactor: add LLMServiceTypeDoubaoVL 2025-05-22 15:34:11 +08:00
lilong.129
b2ab14efcc refactor: rename to AssertionResult 2025-05-19 11:51:49 +08:00
lilong.129
3f1ee03529 refactor: mcphost planner 2025-05-18 21:55:01 +08:00
lilong.129
fcddcfb630 refactor: GetModelConfig 2025-04-30 15:21:17 +08:00
lilong.129
0e9389c796 refactor: NewXTDriver api, return error if init failed 2025-04-30 14:31:36 +08:00
lilong.129
cc9a527274 refactor: select model type by env LLM_MODEL_USE 2025-04-29 23:14:12 +08:00
lilong.129
3ffa5d96d2 refactor: config llm env 2025-04-29 22:33:18 +08:00
lilong.129
429bfe3986 feat: assert with openai model 2025-04-29 22:03:11 +08:00
lilong.129
4d7c7e8aaf refactor: ai asserter 2025-04-29 20:08:22 +08:00