Natural Language Processing (NLP) with Python spaCy Training Course
This instructor-led, live training (online or onsite) is designed for developers and data scientists who want to use spaCy to process large volumes of text to uncover patterns and gain insights.
By the end of this training, participants will be able to:
- Install and configure spaCy.
- Understand spaCy's approach to Natural Language Processing (NLP).
- Extract patterns and derive business insights from extensive data sources.
- Integrate the spaCy library with existing web and legacy applications.
- Deploy spaCy in live production environments to predict human behavior.
- Use spaCy to pre-process text for Deep Learning.
Format of the Course
- Interactive lecture and discussion.
- Plenty of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
- To learn more about spaCy, please visit: https://spacy.io/
Course Outline
Introduction
- Defining "Industrial-Strength Natural Language Processing"
Installing spaCy
spaCy Components
- Part-of-speech tagger
- Named entity recognizer
- Dependency parser
Overview of spaCy Features and Syntax
Understanding spaCy Modeling
- Statistical modeling and prediction
Using the SpaCy Command Line Interface (CLI)
- Basic commands
Creating a Simple Application to Predict Behavior
Training a New Statistical Model
- Data (for training)
- Labels (tags, named entities, etc.)
Loading the Model
- Shuffling and looping
Saving the Model
Providing Feedback to the Model
- Error gradient
Updating the Model
- Updating the entity recognizer
- Extracting tokens with rule-based matcher
Developing a Generalized Theory for Expected Outcomes
Case Study
- Distinguishing Product Names from Company Names
Refining the Training Data
- Selecting representative data
- Setting the dropout rate
Other Training Styles
- Passing raw texts
- Passing dictionaries of annotations
Using spaCy to Pre-process Text for Deep Learning
Integrating spaCy with Legacy Applications
Testing and Debugging the spaCy Model
- The importance of iteration
Deploying the Model to Production
Monitoring and Adjusting the Model
Troubleshooting
Summary and Conclusion
Requirements
- Python programming experience.
- A basic understanding of statistics
- Experience with the command line
Audience
- Developers
- Data scientists
Open Training Courses require 5+ participants.
Natural Language Processing (NLP) with Python spaCy Training Course - Booking
Natural Language Processing (NLP) with Python spaCy Training Course - Enquiry
Natural Language Processing (NLP) with Python spaCy - Consultancy Enquiry
Testimonials (3)
Examples/exercices perfectly adapted to our domain
Luc - CS Group
Course - Scaling Data Analysis with Python and Dask
Very good preparation and expertise of a trainer, perfect communication in English. The course was practical (exercises + sharing examples of use cases)
Monika - Procter & Gamble Polska Sp. z o.o.
Course - Developing APIs with Python and FastAPI
Trainer develops training based on participant's pace
Farris Chua
Course - Data Analysis in Python using Pandas and Numpy
Upcoming Courses
Related Courses
Advanced LangGraph: Optimization, Debugging, and Monitoring Complex Graphs
35 HoursLangGraph is a framework designed for constructing stateful, multi-actor LLM applications using composable graphs that maintain persistent state and provide control over execution.
This instructor-led, live training (available online or on-site) targets advanced AI platform engineers, DevOps professionals for AI, and ML architects who aim to optimize, debug, monitor, and operate production-grade LangGraph systems.
By the end of this training, participants will be able to:
- Design and optimize complex LangGraph topologies for improved speed, cost efficiency, and scalability.
- Enhance reliability through mechanisms such as retries, timeouts, idempotency, and checkpoint-based recovery.
- Effectively debug and trace graph executions, inspect state, and systematically reproduce production issues.
- Instrument graphs with logs, metrics, and traces, deploy them to production, and monitor SLAs and costs.
Format of the Course
- Interactive lecture and discussion sessions.
- Extensive exercises and practical activities.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange the details.
Building Coding Agents with Devstral: From Agent Design to Tooling
14 HoursDevstral is an open-source framework designed to create and run coding agents that can interact with codebases, developer tools, and APIs, thereby enhancing engineering productivity.
This instructor-led, live training (available online or on-site) is targeted at intermediate to advanced ML engineers, developer-tooling teams, and SREs who want to design, implement, and optimize coding agents using Devstral.
By the end of this training, participants will be able to:
- Set up and configure Devstral for developing coding agents.
- Create agentic workflows for exploring and modifying codebases.
- Integrate coding agents with developer tools and APIs.
- Implement best practices for secure and efficient agent deployment.
Format of the Course
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Scaling Data Analysis with Python and Dask
14 HoursThis instructor-led, live training in Taiwan (online or onsite) is aimed at data scientists and software engineers who wish to use Dask with the Python ecosystem to build, scale, and analyze large datasets.
By the end of this training, participants will be able to:
- Set up the environment to start building big data processing with Dask and Python.
- Explore the features, libraries, tools, and APIs available in Dask.
- Understand how Dask accelerates parallel computing in Python.
- Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask.
- Optimize the Dask environment to maintain high performance in handling large datasets.
Data Analysis with Python, Pandas and Numpy
14 HoursThis instructor-led, live training in Taiwan (online or onsite) is aimed at intermediate-level Python developers and data analysts who wish to enhance their skills in data analysis and manipulation using Pandas and NumPy.
By the end of this training, participants will be able to:
- Set up a development environment that includes Python, Pandas, and NumPy.
- Create a data analysis application using Pandas and NumPy.
- Perform advanced data wrangling, sorting, and filtering operations.
- Conduct aggregate operations and analyze time series data.
- Visualize data using Matplotlib and other visualization libraries.
- Debug and optimize their data analysis code.
Open-Source Model Ops: Self-Hosting, Fine-Tuning and Governance with Devstral & Mistral Models
14 HoursDevstral and Mistral models are open-source AI technologies designed for flexible deployment, fine-tuning, and scalable integration.
This instructor-led, live training (online or onsite) is aimed at intermediate to advanced ML engineers, platform teams, and research engineers who wish to self-host, fine-tune, and manage Mistral and Devstral models in production environments.
By the end of this training, participants will be able to:
- Set up and configure self-hosted environments for the Mistral and Devstral models.
- Apply fine-tuning techniques to enhance performance for specific domains.
- Implement versioning, monitoring, and lifecycle management practices.
- Ensure secure, compliant, and responsible use of open-source models.
Format of the Course
- Interactive lectures and discussions.
- Practical exercises in self-hosting and fine-tuning.
- Live-lab implementation of governance and monitoring pipelines.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
FARM (FastAPI, React, and MongoDB) Full Stack Development
14 HoursThis instructor-led, live training in (online or onsite) is aimed at developers who wish to use the FARM (FastAPI, React, and MongoDB) stack to build dynamic, high-performance, and scalable web applications.
By the end of this training, participants will be able to:
- Set up the necessary development environment that integrates FastAPI, React, and MongoDB.
- Understand the key concepts, features, and benefits of the FARM stack.
- Learn how to build REST APIs with FastAPI.
- Learn how to design interactive applications with React.
- Develop, test, and deploy applications (front end and back end) using the FARM stack.
Developing APIs with Python and FastAPI
14 HoursThis instructor-led, live training in Taiwan (online or onsite) is aimed at developers who wish to use FastAPI with Python to build, test, and deploy RESTful APIs easier and faster.
By the end of this training, participants will be able to:
- Set up the necessary development environment to develop APIs with Python and FastAPI.
- Create APIs quicker and easier using the FastAPI library.
- Learn how to create data models and schemas based on Pydantic and OpenAPI.
- Connect APIs to a database using SQLAlchemy.
- Implement security and authentication in APIs using the FastAPI tools.
- Build container images and deploy web APIs to a cloud server.
LangGraph Applications in Finance
35 HoursLangGraph is a framework designed for constructing stateful, multi-actor LLM applications using composable graphs that maintain persistent state and offer control over execution.
This instructor-led, live training (available both online and onsite) is targeted at intermediate to advanced professionals who aim to design, implement, and manage LangGraph-based financial solutions while ensuring proper governance, observability, and compliance.
By the end of this training, participants will be able to:
- Create finance-specific LangGraph workflows that comply with regulatory and audit requirements.
- Incorporate financial data standards and ontologies into the graph state and tooling.
- Implement reliability, safety, and human-in-the-loop controls for critical processes.
- Deploy, monitor, and optimize LangGraph systems to meet performance, cost, and SLA requirements.
Format of the Course
- Interactive lectures and discussions.
- Ample exercises and practice sessions.
- Hands-on implementation in a live lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph Foundations: Graph-Based LLM Prompting and Chaining
14 HoursLangGraph is a framework designed to build graph-structured LLM applications that support planning, branching, tool use, memory, and controllable execution.
This instructor-led, live training (available online or onsite) is aimed at beginner-level developers, prompt engineers, and data practitioners who want to design and build reliable, multi-step LLM workflows using LangGraph.
By the end of this training, participants will be able to:
- Understand key LangGraph concepts (nodes, edges, state) and know when to apply them.
- Create prompt chains that can branch, call tools, and maintain memory.
- Incorporate retrieval and external APIs into graph workflows.
- Test, debug, and evaluate LangGraph applications for reliability and safety.
Format of the Course
- Interactive lectures and facilitated discussions.
- Guided labs and code walkthroughs in a sandbox environment.
- Scenario-based exercises focused on design, testing, and evaluation.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph in Healthcare: Workflow Orchestration for Regulated Environments
35 HoursLangGraph facilitates stateful, multi-actor workflows driven by LLMs, offering precise control over execution paths and state persistence. In the healthcare sector, these features are essential for ensuring compliance, interoperability, and the development of decision-support systems that align with medical workflows.
This instructor-led, live training (available online or on-site) is designed for intermediate to advanced professionals who aim to design, implement, and manage LangGraph-based healthcare solutions while addressing regulatory, ethical, and operational challenges.
By the end of this training, participants will be able to:
- Create healthcare-specific LangGraph workflows with a focus on compliance and auditability.
- Integrate LangGraph applications with medical ontologies and standards such as FHIR, SNOMED CT, and ICD.
- Implement best practices for reliability, traceability, and explainability in sensitive environments.
- Deploy, monitor, and validate LangGraph applications in healthcare production settings.
Format of the Course
- Interactive lectures and discussions.
- Hands-on exercises with real-world case studies.
- Practical implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph for Legal Applications
35 HoursLangGraph is a framework designed for constructing stateful, multi-actor LLM applications using composable graphs that maintain persistent state and offer precise control over execution.
This instructor-led, live training (available both online and onsite) targets intermediate to advanced professionals who aim to design, implement, and operate LangGraph-based legal solutions with the necessary compliance, traceability, and governance controls.
By the end of this training, participants will be able to:
- Design legal-specific LangGraph workflows that ensure auditability and compliance.
- Integrate legal ontologies and document standards into graph state and processing.
- Implement guardrails, human-in-the-loop approvals, and traceable decision paths.
- Deploy, monitor, and maintain LangGraph services in production with observability and cost controls.
Format of the Course
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Dynamic Workflows with LangGraph and LLM Agents
14 HoursLangGraph is a framework designed for creating graph-structured workflows with LLMs, supporting branching, tool integration, memory management, and controlled execution.
This instructor-led, live training (available online or on-site) is targeted at intermediate engineers and product teams who want to merge LangGraph’s graph logic with LLM agent loops to develop dynamic, context-aware applications such as customer support agents, decision trees, and information retrieval systems.
By the end of this training, participants will be able to:
- Design graph-based workflows that coordinate LLM agents, tools, and memory effectively.
- Implement conditional routing, retries, and fallbacks to ensure robust execution.
- Integrate retrieval mechanisms, APIs, and structured outputs into agent loops.
- Evaluate, monitor, and enhance the reliability and safety of agent behavior.
Format of the Course
- Interactive lectures and facilitated discussions.
- Guided labs and code walkthroughs in a sandbox environment.
- Scenario-based design exercises and peer reviews.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph for Marketing Automation
14 HoursLangGraph is a graph-based orchestration framework designed to facilitate conditional, multi-step workflows involving language models and tools. It is particularly useful for automating and personalizing content pipelines.
This instructor-led, live training (available both online and on-site) is tailored for intermediate-level marketers, content strategists, and automation developers who are interested in implementing dynamic, branching email campaigns and content generation pipelines using LangGraph.
By the end of this training, participants will be able to:
- Create graph-structured content and email workflows that incorporate conditional logic.
- Integrate language models, APIs, and data sources for automated personalization.
- Manage state, memory, and context throughout multi-step campaigns.
- Assess, monitor, and optimize the performance and delivery outcomes of workflows.
Format of the Course
- Interactive lectures and group discussions.
- Hands-on labs for implementing email workflows and content pipelines.
- Scenario-based exercises focusing on personalization, segmentation, and branching logic.
Course Customization Options
- For a customized training session for this course, please contact us to arrange.
Le Chat Enterprise: Private ChatOps, Integrations & Admin Controls
14 HoursLe Chat Enterprise is a private ChatOps solution that offers secure, customizable, and governed conversational AI capabilities for organizations. It supports Role-Based Access Control (RBAC), Single Sign-On (SSO), connectors, and integrations with enterprise applications.
This instructor-led, live training (available online or on-site) is designed for intermediate-level product managers, IT leaders, solution engineers, and security/compliance teams who want to deploy, configure, and govern Le Chat Enterprise in their enterprise environments.
By the end of this training, participants will be able to:
- Set up and configure Le Chat Enterprise for secure deployments.
- Enable RBAC, SSO, and compliance-driven controls.
- Integrate Le Chat with enterprise applications and data stores.
- Design and implement governance and admin playbooks for ChatOps.
Format of the Course
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Accelerating Python Pandas Workflows with Modin
14 HoursThis instructor-led, live training in Taiwan (online or onsite) is aimed at data scientists and developers who wish to use Modin to build and implement parallel computations with Pandas for faster data analysis.
By the end of this training, participants will be able to:
- Set up the necessary environment to start developing Pandas workflows at scale with Modin.
- Understand the features, architecture, and advantages of Modin.
- Know the differences between Modin, Dask, and Ray.
- Perform Pandas operations faster with Modin.
- Implement the entire Pandas API and functions.