Skip to content

Conversation

han032206
Copy link

Pull Request: Integrate WebCanvas Key Node Evaluation and Mind2web-live Benchmark into BrowserGym

Description

This PR officially integrates the WebCanvas key node evaluation and the Mind2web-live benchmark into BrowserGym.

Core Features

  • Key Node-Based Evaluation:

    • Implements a key node-based evaluation system to provide detailed assessments of web task processes.
  • JavaScript Event Evaluation:

    • Utilizes page JavaScript events to deliver accurate evaluations independent of the action space.
  • Debug Modules and Logging:

    • Includes various debug modules and logging functionalities to clearly display the evaluation process.
  • Mind2web-live Dataset Integration:

    • Integrates the Mind2web-live dataset to support benchmark testing.
  • Community Contributions:

    • Encourages further contributions from the community to expand and refine the evaluation framework and benchmarks.

@amanjaiswal73892 amanjaiswal73892 added enhancement New feature or request help wanted Extra attention is needed labels Jul 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants