Beyond the Screen Reader: Can Gemini’s AI Agent “Accessify” the Web?


AI as an Accessibility Bridge: Testing Gemini’s Auto Browse

For blind and low-vision users, the modern web is a minefield of good intentions gone wrong. Developers build visually polished interfaces — date pickers, multi-step dialogs, dynamic dropdowns — but the underlying code often fails to communicate with assistive technology. Screen readers like JAWS and NVDA rely on semantic structure and proper focus management to guide users through a page. When that structure breaks down, so does access.

That gap is exactly what I set out to probe in a recent demonstration of Auto Browse, an agentic AI feature built into the Gemini for Chrome side panel. My test case was deliberately unglamorous: a Salesforce “Add Work” form on the Trailblazer platform, featuring a date picker that routinely defeats standard keyboard navigation. The question wasn’t whether the interface looked functional. It was whether an AI agent could step in and make it work.

The Problem with Date Pickers (and Why It Matters)

Custom date pickers represent one of the most persistent accessibility failures on the web. Unlike native HTML <input type="date"> elements, which browsers render with built-in keyboard support, custom-built widgets frequently rely on mouse interaction, non-semantic markup, or JavaScript behavior that strips focus away from the user mid-task.

In my demo, the Salesforce dialog presents a “start date” selector with separate Month and Year dropdowns. For a sighted mouse user, this is trivial. For a screen reader user navigating by keyboard, it becomes a trap — the list receives focus but refuses to respond to arrow keys or selection commands, leaving the user stuck with no clear path forward.

This is not a niche problem. Date pickers appear in job applications, medical intake forms, financial dashboards, and e-commerce checkouts. When they break, they don’t just create friction — they create exclusion.

Letting the AI Take the Wheel

My approach was straightforward: rather than fighting the inaccessible interface, I delegated the task entirely. With the Gemini side panel open (activated via Alt+G), I issued a plain-language command: “Please set the start date to December 2004.”

What followed was notable not just for what the AI did, but for how it communicated while doing it. Auto Browse autonomously interacted with the form elements — opening the Year dropdown, scrolling to 2004, selecting it — while simultaneously providing real-time status updates in the side panel. Critically, those updates (“Updating the start year to 2004”) were announced by the screen reader, keeping me informed throughout the process without requiring me to shift focus manually.

A “Take Over Task” button remained visible at the top of the browser at all times, ensuring that AI autonomy didn’t come at the cost of user control — a design principle that will resonate with anyone familiar with WCAG’s emphasis on predictability and user agency.

Where It Still Falls Short

I want to be candid about the rough edges, because that honesty is part of what makes this worth examining closely.

During the interaction, the dialog closed unexpectedly at one point, requiring a page reload before I could restart the task. For sighted users, this is a minor inconvenience. For screen reader users, an unexpected context shift — a dialog closing, focus jumping to an unrelated part of the DOM, a dynamic content update that goes unannounced — can be deeply disorienting. Recovery depends on knowing where you are, and that knowledge is precisely what gets lost.

This points to a fundamental challenge for agentic AI in accessibility contexts: it isn’t enough to complete the task correctly; the AI must also maintain a coherent focus environment throughout. If a script refreshes a page region mid-task, the virtual cursor needs to land somewhere intentional. If a dialog closes, the user needs to know what replaced it. These aren’t edge cases — they’re the everyday texture of dynamic web applications, and they’ll need to be handled reliably before tools like Auto Browse can be genuinely depended upon.

A Glimpse of What’s Possible

Despite those caveats, I came away from this demonstration genuinely encouraged. Gemini successfully populated both fields with the correct date, confirmed by the screen reader’s final readout. More importantly, it did so through natural language — no custom scripts, no manual DOM inspection, no workarounds requiring technical knowledge that most users don’t have and shouldn’t need.

The implications extend well beyond date pickers. Agentic AI that can interpret intent and act on a user’s behalf has the potential to make complex web interfaces navigable for people who have been effectively locked out of them. Not by fixing the underlying code — though that remains the gold standard — but by providing a capable, responsive intermediary that can bridge the gap in real time.

The web has always required remediation to be accessible. What’s new is who, or what, might be doing the remediating.

Visual Descriptions (Alt-Text for Video Keyframes)

To ensure this post is as accessible as the technology it discusses, here are descriptions of the critical visual moments in the video:

Frame 1: The Accessibility Barrier
A screenshot of the Salesforce “Add Work” dialog box. The “Month” and “Year” drop-down menus are highlighted, showing the visual interface that I am unable to navigate using standard screen reader commands.
Frame 2: The Gemini Interface
The Chrome browser split-screen view. On the left is the Trailblazer site; on the right is the Gemini side panel where I have typed my request. The AI is showing a progress spinner labeled “Task started.”
Frame 3: Agentic Interaction
The video shows the “Year” drop-down menu on the webpage opening and scrolling automatically as the Gemini agent selects “2004” without any manual mouse movement or keyboard input from the user.
Frame 4: Success Confirmation
The final state of the form showing “December” and “2004” successfully populated in the fields. The Gemini side panel displays a “Task done” message with a summary of the actions performed.

I am a CPWA-certified digital accessibility specialist. When I’m not testing the latest in AI or keeping up with my family, you can find me on the amateur radio bands under the call sign NU7I.

Slack Update Breaks Accessibility in Simplified Layout Mode

In this video, Darrell Hilliker (CPWA) demonstrates a critical accessibility regression in Slack version 4.47.69. While the new “Activity” view aims to consolidate messages and mentions for improved efficiency, it appears to be completely incompatible with Slack’s Simplified Layout Mode when using a screen reader.

Darrell walks through several standard navigation techniques—including keyboard shortcuts (Ctrl+Shift+3), the F6 key to move between regions, and Tab/Arrow key navigation—showing that none of these methods allow a JAWS user to access the actual message content within the Activity tab. Instead of displaying notifications, the screen reader merely reports “loading” or “blank,” effectively locking out users who rely on these specific settings to perform their professional duties.


Bug Report: Accessibility Regression in Slack Activity View

  • Priority: P1 (Critical / Blocker)
  • Status: Open
  • Affected Version: Slack 4.47.69 (Windows 11)
  • Assistive Tech: JAWS 2026
  • Configuration: Simplified Display Mode: ON

Title

New Activity View is Keyboard/Screen Reader Inaccessible in Simplified Display Mode.

Description

The newly introduced “Activity” tab fails to render or focus message content when “Simplified Display Mode” is enabled. This prevents screen reader users from reading mentions, DMs, or threads within the Activity view, creating a total task blocker for collaboration.

Steps to Reproduce

  1. Open Slack (v4.47.69) on Windows 11 with JAWS running.
  2. Ensure Simplified Display Mode is enabled in Slack settings.
  3. Use Ctrl+Shift+3 to navigate to the Activity tab.
  4. Attempt to navigate into the message list using Tab, Arrow Keys, or F6 to switch regions.
  5. Observe the screen reader output and focus behavior.

Actual Behavior

The Activity tab reports as “Loading” or “Blank.” Focus remains trapped on the “Breadcrumbs toolbar” (Activity and Workspace buttons) or “Notification Preferences.” There is no keyboard path to reach or read the actual list of notifications or messages.

Expected Behavior

When the Activity tab is focused, the message list should be populated and reachable via standard keyboard navigation (Tab or Arrow keys). Screen readers should announce the content of mentions, DMs, and threads as they do in the standard view.

Impact Statement

This is a task-blocking accessibility issue. Slack is a critical infrastructure tool for thousands of companies, educational institutions, and government agencies. By breaking compatibility with Simplified Display Mode, this update prevents blind and visually impaired professionals from participating in essential workplace communications, disrupting their ability to perform their jobs.

Demonstration: Guide Accessifies the Addition of Components to Salesforce Experience Cloud Site Pages

At the intersection of the Salesforce ecosystem and the accessibility community, it has been long known that Experience Builder contains task-blocking accessibility issues that hold many disabled people back from being able to perform important job duties including site administration and content management. While the company continues efforts to improve the accessibility of Experience Builder, disabled administrators, content managers and site developers who rely on keyboard-only navigation and screen readers are finding ways to work around barriers thanks to new tools based on artificial intelligence (AI).

Read more

Blind Street Crossing Video

Watch this video of me crossing the busy intersection of Central Ave. and McDowell Road in Phoenix during the start of Friday-evening rush hour on Oct 22 as I made my way to the state fair.

httpv://www.youtube.com/watch?v=v_kH3lNUKfY

Light Rail Arrives, I Climb Aboard and Find a Seat

Check out this Oct. 22 video of me waiting for the light rail, going inside the car and finding an available seat. While this is done thousands of times each day by blind people all over the world, I doubt anyone has posted it as a YouTube video. As always, constructive feedback is welcome.

httpv://www.youtube.com/watch?v=TNkUiuNonBQ

Blind Video Experiment – Standing on a Street Corner – Take 2

I have implemented some useful feedback I received Tuesday morning, and this second take from the same street corner is the result. The iPhone was oriented in landscape mode and I took care to pan more slowly by moving my hips and shoulders while keeping my feet still. Additional feedback is now requested. The next video will be completely different.

httpv://www.youtube.com/watch?v=ADhk-cg4-S8

Blind Video Experiment – Standing on a Street Corner

Watch as I stand on a street corner near my apartment “looking” around with the iPhone’s camera while chatting about what I’m hearing around me. Sighted readers, does this work at all for you? What feedback could you give to help me improve my videography skills?

httpv://www.youtube.com/watch?v=8R0zY0_7HzE

Resolving the "Click to Run ActiveX Control" Error in Internet Explorer 7.0 When Playing Audio on Web Sites

For quite some time now, I have been experiencing the annoying “Click to run ActiveX control” error message when playing audio from various web sites, including those providing audio CAPTCHA alternatives. In some cases, the browser would crash completely after pressing the OK button. An example of a page that can cause the issue is the FormShield audio CAPTCHA demonstration. Thanks to Jeff Bishop for finding and resolving this rather frustrating issue. Follow these steps to reset Internet Explorer to its factory defaults, associate all media types to Windows Media Player and reconfigure your desired media player.

Reset Internet Explorer

Warning: Resetting Internet Explorer returns it to original factory defaults, deletes all temporary files and disables browser add-ons. Though resetting Internet Explorer is recommended to assure full resolution of media and other browser issues, you may wish to skip this first step to see if you can resolve the issue without performing a full browser reset. Follow these steps to reset Internet Explorer to its factory default configuration:

  1. Open Internet Explorer.
  2. Press Alt+T to open the Tools menu.
  3. Press o to open the Internet Options dialogue.
  4. Press Shift+Tab to move to the list of Internet Explorer tabs, then press End to navigate to the Advanced page.
  5. Press Shift+Tab to move to the Reset… button and press Enter to begin the process of disabling add-ons and completely resetting the browser to its factory defaults.
  6. Follow the confirmation prompt by pressing Tab to another Reset button and pressing enter.
  7. Press Tab to move to the Close button and press Enter.
  8. Press OK to restart Internet Explorer.
  9. It is necessary to reconfigure Internet Explorer’s accessibility settings. Press Alt+T followed by o to return to the Internet Options dialogue.
  10. Press Shift+Tab to move to the list of Internet Explorer tabs, then press End to navigate to the Advanced page.
  11. Press Tab to navigate into the Settings tree view.
  12. Press down arrow to navigate to the “Always expand ALT text for images” checkbox and press the space bar to activate this setting.
  13. Press down arrow to navigate to the “Move system caret with focus/selection changes” checkbox and press the space bar to activate this setting.
  14. Press Shift+Tab twice to navigate to the Apply button and press the space bar.
  15. Press Shift+Tab twice to navigate to the OK button and press enter.

Configure Windows Media Player as the Default for all File Types

  1. Press Windows Key +M to minimize all applications and focus on the desktop. If Windows Media Player is not found on the desktop, press the Start button, select All Programs and choose Windows Media Player.
  2. Arrow to Windows Media Player and press Enter to open the application.
  3. Press Alt+T to pull down the Tools menu.
  4. Press o to open the Options dialogue.
  5. Press Control+Shift+Tab three times to navigate to the File Types page.
  6. Press Tab to navigate to the Select All button and press enter.
  7. Tab to OK and press enter.
  8. Press Alt+F4 to close Windows Media Player.

Reconfigure File Type Associations on the Desired Media Player

If you do not use Windows Media Player exclusively, it will now be necessary to reconfigure the associations in your preferred player. For example, follow these steps to select all file types in Winamp:

  1. If you are not already on the desktop, press Windows Key+M to minimize all application windows and return focus to that location.
  2. Arrow to Winamp and press Enter to open the application.
  3. Press Control+P to open Winamp’s preferences dialogue.
  4. Press down arrow until you reach File Types. These options are found under the General Preferences branch of the tree view.
  5. Press tab until you reach the All button and press the space bar. This selects all audio and video file types for association to Winamp.
  6. Press Shift+Tab until you reach the Close button and press enter to dismiss the preferences dialogue.

After making these changes, you should find that audio CAPTCHAs and other web sites that play inline audio and video content will function properly without presenting the “Click to run ActiveX control” error message or crashing the browser.