A complete 10-week curriculum to build an AI-powered workflow automation agent from scratch — no prior ML knowledge required. You do not need to watch or read every resource; use whatever works best for you, focus on understanding, and once a topic clicks move on to the next part. Do not get stressed by the number of links — you can always ask AI to help explain anything confusing and come back to the resources later.
By end of this week, you should have a working dev environment and demonstrate comfort with Python.
Build 3 small automation scripts that will later become building blocks of your agent:
After this week, you should be able to control a browser programmatically — clicking, typing, navigating, and extracting data. This is the execution layer of your agent.
Build the core intelligence module of your agent — a function that converts natural language into structured browser actions:
{"action": "fill_form"|"navigate"|"email"|"summarize"|"click", "target_url": "...", "data": {...}, "steps": [...]}Your intent parser correctly converts user commands into structured JSON action plans. This is the "brain" of your agent.
Integrate the Week 2 and Week 3 work into a LangChain agent that can actually use the browser:
Build the server that the frontend will communicate with:
Prepare for the build phase with clear plans and a working UI shell:
wait_for_selector()Demonstrate a complete task execution across these scenarios:
The agent can now handle chained tasks: navigate → read context → act → report. The core execution loop is working.
Present a fully functional demo covering all 5 milestones:
These are deeper resources to consult when you need to go further on specific topics. Not required, but valuable.