If I want to use python to achieve RPA effects, are there any good solutions and development aids?

I want to achieve RPA effects on my computer. I would like to ask for advice on using Python to implement RPA. Are there any good solutions and development tools available?

For example, I want to achieve the following: I have a client-side program that needs to automatically log in every morning at 8 AM. The login steps include "launching the program — entering the username — entering the password — clicking login — closing any pop-up that may appear during login — switching to a specified interface — retrieving the text information and images loaded on the interface."

To implement the above functionality, I have a question I don't know how to solve: how to locate the input fields for the username and password, the login button, handle pop-ups, click interface elements, and retrieve text from the client-side program.

Of course, I know that I can use pixel coordinates for positioning, but that approach is too restrictive and makes the operation unstable.

I could take screenshots of specific areas and save them to search for the corresponding element positions, but this method is slow, and if other programs cover the interface, it can easily lead to errors.

What I want to achieve is:

1.Are the elements within the client-side program organized in a hierarchical manner like web elements? 2.What tools can be used to parse the hierarchical structure of a client-side program?
3.Which libraries in Python can be used to locate elements in a client-side program for entering the username and password, clicking the login button, clicking interface elements, and retrieving pop-ups? 4.Additionally, how can I retrieve text and image information from the client-side program?
5.I am looking for a tool similar to Selenium for web applications, but for client-side programs.

No Comments Yet.

Leave a comment