Improving the AI Experience

Vendors don't need to wait on regulators or scientists to make these 5 simple fixes.

Sep 22, 2023

Part Two of Two

Yes, it’s finally happened. The word “Luddite” is fashionable again. Signs of AI backlash are imminent, but it’s clear Wall Street won’t be abandoning this cluster of technologies any time soon.

Federal regulation such as the Hawley-Blumenthal framework is being contemplated, but the tech industry has reason to be skeptical of legislative solutions. In the meantime, here are five steps that vendors can take to improve outcomes.

AI Wishlist

This list is written from the perspective of UX (my actual day job) rather than a trader’s, academic’s, or analyst’s perspective. Ticker symbols are not included because detailed product reviews and comparisons simply don’t exist yet for many AI solution providers. Very few are publicly traded.

Disclose. Most of us like to know when we are talking to a real person or a simulation on the phone. Same with web or video chat. It helps us manage expectations. Just as food products tell customers what is artificial, software can also label the “artificial ingredients” in a given interaction. A small button or icon (no larger than the lock 🔒 icon commonly used to show the presence of encryption) can indicate the presence of predictive or generative AI.
Flag apparent bias before release. The major problem with predictive AI is that it’s a black box. You know approximately what types of results you’re looking for, and you’re sorting through a vast field with multiple variables (often interdependent) influencing your search. But there isn’t a straightforward formula in the same way that, for instance, banks calculate acceptable costs for housing when determining if they will make a loan. You don’t want your brand to get a reputation for racism or other forms of bias.
So be proactive. Run audits. Make sure everyone in management understands the difference between test, training, and validation data. Write this phase of auditing into your sales contracts and project timelines. If the end results are out of whack with existing demographics, throw the model out.
If you are called upon to explain your system’s discriminatory practices in a court of law, “just because” is never a good enough answer.
Provide citation. The extra step of identifying publicly available sources for citation cannot be optional. Maybe it’s time to train a few models on Wikipedia. Or just add links search engine results on related topics and keywords as a starting point. That’s what human users are stuck doing right now.
Bring back the sandbox. Reliance on the cloud is profitable for Google and Amazon, but it squelches innovation. With the exception of Large Language Models (LLMs) most machine learning models don’t actually require supercomputers to run. Yet client-side tutorials and libraries, even those from well-known names such as PyTorch and Tensorflow, are sporadically maintained, if at all. In the past, Apple offered tools to build machine learning directly into client-side apps for cell phones, laptops, and tablets. By and large these still function but opportunities for customizing the models themselves are limited. Important gaps still exist that AI can fill — particularly around assistive technologies, reducing carbon footprint, and content moderation on community-run forums. Coming up with the right tools doesn’t necessarily require a large budget or a Ph.D. It does require working code from which to refine and iterate.
Invite feedback. There is no question that AI does some things well. But on other tasks, it underperforms. Surveying users and gathering the types of performance ratings that are standard for other customer service interactions should be the norm, not the exception.

Just because your employee is not human, there’s no excuse to skip their performance review!

Bonus Content

Me vs the Machine

I finally tried out ChatGPT from OpenAI. Scroll down if you’d like to read the alternate version of this article (authored exclusively from the prompt).

While I couldn’t find a “feedback” icon in the UI, my overall experience was positive. Simply put, it’s a great alternative to StackOverflow. It excelled at answering programming questions with specific language constraints and even provided code samples. While it didn’t help me directly with any of the specific tasks on my list, I haven’t yet closed the browser window. That should count for something.

AI on AI

Before writing this article, I asked the prompt for its perspective. The result was grammatical and logically coherent. Enough so that I’m prepared to go head-to-head.

When asked how to evaluate the accuracy and utility of AI solutions, ChatGPT delivered the following 533-word answer… I’m not threatened in the least!

Evaluating the effectiveness and truthfulness of AI solutions is crucial to ensure that they meet your expectations and deliver accurate results. Here are several steps and considerations for clients to evaluate AI solutions:

1. Define Clear Objectives and Metrics:

- Begin by defining clear objectives for the AI solution. What specific problem or task should it address? What are the expected outcomes and success metrics?

2. Understand the Data:

- Examine the quality and relevance of the data used to train the AI model. Data quality greatly impacts the performance of AI solutions.

- Assess the representativeness of the data. Does it cover a diverse range of cases and scenarios that the AI solution will encounter in the real world?

3. Benchmark and Baseline Performance:

- Establish benchmark or baseline performance metrics before implementing the AI solution. These metrics will serve as a point of comparison.

- Compare the AI solution's performance to these benchmarks to determine if it meets or exceeds expectations.

4. Interpretability and Explainability:

- Assess the AI model's interpretability and explainability. Can it provide clear explanations for its decisions and predictions?

- Utilize techniques such as feature importance analysis or model-agnostic methods to gain insights into how the AI model makes decisions.

5. Validation and Testing:

- Perform thorough validation and testing of the AI solution. Use holdout datasets or cross-validation to assess its generalization performance.

- Implement A/B testing if applicable to compare the AI solution's performance against the existing system or alternative approaches.

6. Monitoring and Continuous Evaluation:

- Implement monitoring systems to track the AI solution's performance in real-time. Look for performance degradation or shifts in behavior.

- Continuously evaluate and update the AI model as new data becomes available or as the problem domain evolves.

7. Ethical and Bias Considerations:

- Assess the AI solution for potential biases or ethical concerns. Evaluate whether it is treating all individuals or groups fairly.

- Implement fairness and bias mitigation techniques if needed.

8. Human-in-the-Loop:

- Consider incorporating human-in-the-loop validation and correction mechanisms, especially for critical decisions made by the AI solution.

9. User Feedback:

- Collect user feedback and insights. Users' experiences and observations can provide valuable information about the AI solution's effectiveness and truthfulness.

10. Third-Party Audits:

- In some cases, consider third-party audits or reviews by experts in the field to assess the AI solution's performance and adherence to ethical standards.

11. Transparency and Documentation:

- Ensure that the AI solution is well-documented, including details about the data, model architecture, and evaluation procedures.

- Transparency in how the AI system operates is essential for trust and evaluation.

12. Legal and Regulatory Compliance:

- Ensure that the AI solution complies with relevant legal and regulatory requirements, such as data privacy laws and industry-specific regulations.

13. Cost-Benefit Analysis:

- Evaluate the cost-effectiveness of the AI solution. Assess whether the benefits it provides outweigh the associated costs.

14. Iterative Improvement:

- Recognize that AI solutions may require iterative improvements over time. Continuously gather feedback and iterate on the solution to enhance its effectiveness and truthfulness.

Evaluating AI solutions is an ongoing process, and it may require collaboration between technical teams, domain experts, and end-users. It's important to set realistic expectations, be vigilant in monitoring, and adapt as needed to ensure that the AI solution remains effective and trustworthy.

Lotus Insight