6 Matching Annotations
  1. Last 7 days
    1. feat(benchmarks): add screenshot-based evaluator, screenshot collector, and --parallelize flag - Add screenshot-based LLM judge evaluator (evaluator.ts) - Add ScreenshotCollector for capturing browser screenshots during runs

      令人惊讶的是:这个项目包含一个基于截图的评估系统,使用LLM作为评判员来评估自动化任务的结果。它能够捕获浏览器截图并在运行过程中收集这些视觉数据,这为网页自动化任务提供了一种全新的评估方式,超越了传统的文本比较方法。

  2. Jun 2021
  3. May 2020
  4. Nov 2015
  5. Jun 2015
    1. If you want a DIY version of this, try ImageMagick's compare command: compare bag_frame1.gif bag_frame2.gif compare.gif Documentation: http://www.imagemagick.org/Usage/compare/ reply no_gravity 2 hours ago Interesting. And to render the html you can use cutycapt. Which is in the Debian repos too. I just tried it, and this rendered a nice diff of two pages for me: cutycapt --url=http://www.xyz.com/1 --out=1.png cutycapt --url=http://www.xyz.com/2 --out=2.png compare 1.png 2.png diff.png reply easytiger 2 hours ago Now make it a webservice and charge $200/month! reply programmernews3 1 hour ago First part already done: http://archive.is/ reply prottmann 1 hour ago apt-get install the services and use them on my server ;-) reply

      Plain and simple way to report visual degradation of website or webapp