Screen Parsing Task

A Screen Parsing Task is a vision parsing task that extracts structured information from UI screenshots for understanding and interaction with graphical user interfaces.



References

2024