EPASS, AI Evaluation Platform

As the main product at HydroX AI, the core of EPASS is to enable enterprises and individuals to test their LLMs for safety and security.


As the founding designer, I crafted the entire platform from scratch and am continuously refining and making additions to ensure an optimal user experience and functionality.


I'm only going to highlight a few pages of this product, to check out the full product visit it directly: Here!

Role:

Product Designer

Team:

2 PM, 3 Engineers

Duration:

Sep 2023 - Present

Tools:

Figma, React

Challenge

How can we efficiently and intuitively integrate all the company's features into the platform to maximize functionality and user experience?

Information Architecture

This information architecture map is constantly growing as more and more features are being added. The biggest challenge in creating this platform is how to effectively organize and guide customers through all the components in the easiest way possible.


The technology is complex but simplicity is the key!

Main Goal

The core functionality of this platform is to deploy test, judge, and evaluate.

Users

Initially, this platform targets enterprises as large as Meta and Google, to individual model creators. However, the emphasis is placed on enterprise customer's AI teams.

Dashboard

Visualizing aggregated metrics from all evaluations of LLM's

The biggest questions when designing this page was: What do our customers care about the most?


To tackle this question, I was able to join in business meetings with potential partners to understand key painpoints. Through these conversations I found that what they cared about are:


  1. Overall how safe is this model compared to others ➡️ Overall Score Trend Graph

  2. Which areas is this model the weakest ➡️ High Risk Categories

  3. Some companies only care about certain areas (such as Sexual Content, Political Sensitivity) and want to track those specifically ➡️ Category Insights

Early Prototypes and Designs

Takeaways

Working alone on such a large scale platform was definitely challenging. The biggest takeaway I had was to work through iteration not perfection. This platform is not (and will never be) perfect, make something happen first, then improve on it!


Another hard truth I had to accept was that the final product will not always match design. Working with engineers is essential and be willing to accept that sometimes it's just not going to be EXACTLY the way it looks on Figma.

There's more

There is so so so much more to this project. Features, pages, research, low fidelity mockups, and bare-bone sketches aren’t included in these few snapshots.


To see more please reach out!

Next Project ➡️

Next Project ➡️

Next Project ➡️

Back

Background

Research

Dashboard

Takeaways

More