11 Matching Annotations
  1. Jul 2025
    1. In today’s fast-moving, AI-powered era, autonomous agents are playing a bigger role than ever. They are helping businesses run smoother and making decisions affecting millions of lives every day. While these systems are designed to make our lives easier and unlock new opportunities, we can’t get carried away—we need to implement proper AI Agent Evaluation frameworks and best practices to ensure these systems actually work as intended and follow ethical AI principles.

      Explore the key metrics, tools, and frameworks used for AI agent evaluation. Learn how to assess performance, reliability, and efficiency of AI agents in real-world scenarios.

  2. Oct 2024
  3. Dec 2023
  4. Feb 2023
    1. What we ultimately should care about is being able to use our knowledge to produce something new, whatever that may be. To not merely reproduce you must understand the material. And understanding requires application, a hermeneutic principle that particularly Gadamer worked out extensively. If you really want to measure your level of understanding, you should try to apply or explain something to yourself or someone else.
  5. Jan 2023
  6. Jun 2021
  7. Jul 2020
  8. Feb 2020
  9. Nov 2019
  10. Jun 2015
    1. If you want a DIY version of this, try ImageMagick's compare command: compare bag_frame1.gif bag_frame2.gif compare.gif Documentation: http://www.imagemagick.org/Usage/compare/ reply no_gravity 2 hours ago Interesting. And to render the html you can use cutycapt. Which is in the Debian repos too. I just tried it, and this rendered a nice diff of two pages for me: cutycapt --url=http://www.xyz.com/1 --out=1.png cutycapt --url=http://www.xyz.com/2 --out=2.png compare 1.png 2.png diff.png reply easytiger 2 hours ago Now make it a webservice and charge $200/month! reply programmernews3 1 hour ago First part already done: http://archive.is/ reply prottmann 1 hour ago apt-get install the services and use them on my server ;-) reply

      Plain and simple way to report visual degradation of website or webapp