Delphi for Python GUI

Somewhat modular text2image GUI, initially just for Stable Diffusion.

Relies on a slightly customized fork of the InvokeAI Stable Diffusion code: Code Repo Multiple prompts at once: Enter each prompt on a new line (newline-separated). Word wrapping does not count ...

IEEE

Deep GUI: Black-box GUI Input Generation with Deep Learning

Abstract: Despite the proliferation of Android testing tools, Google Monkey has remained the de facto standard for practitioners. The popularity of Google Monkey is largely due to the fact that it is ...

Microsoft

Coordinate-Free Visual Grounding for GUI Agents

One of the principal challenges in building VLM-powered GUI agents is visual grounding, i.e., localizing the appropriate screen region for action execution based on both the visual content and the ...

GitHub

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Current GUI grounding approaches rely heavily on large-scale pixel-level annotations and training-time optimization, which are expensive, inflexible, and difficult to scale to new domains. we observe ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results