:doc:`Glue <../../glue>` / Client / start_ml_labeling_set_generation_task_run

*****************************************
start_ml_labeling_set_generation_task_run
*****************************************



.. py:method:: Glue.Client.start_ml_labeling_set_generation_task_run(**kwargs)

  

  Starts the active learning workflow for your machine learning transform to improve the transform's quality by generating label sets and adding labels.

   

  When the ``StartMLLabelingSetGenerationTaskRun`` finishes, Glue will have generated a "labeling set" or a set of questions for humans to answer.

   

  In the case of the ``FindMatches`` transform, these questions are of the form, “What is the correct way to group these rows together into groups composed entirely of matching records?”

   

  After the labeling process is finished, you can upload your labels with a call to ``StartImportLabelsTaskRun``. After ``StartImportLabelsTaskRun`` finishes, all future runs of the machine learning transform will use the new and improved labels and perform a higher-quality transformation.

   

  Note: The role used to write the generated labeling set to the ``OutputS3Path`` is the role associated with the Machine Learning Transform, specified in the ``CreateMLTransform`` API.

  

  See also: `AWS API Documentation <https://docs.aws.amazon.com/goto/WebAPI/glue-2017-03-31/StartMLLabelingSetGenerationTaskRun>`_  


  **Request Syntax**
  ::

    response = client.start_ml_labeling_set_generation_task_run(
        TransformId='string',
        OutputS3Path='string'
    )
    
  :type TransformId: string
  :param TransformId: **[REQUIRED]** 

    The unique identifier of the machine learning transform.

    

  
  :type OutputS3Path: string
  :param OutputS3Path: **[REQUIRED]** 

    The Amazon Simple Storage Service (Amazon S3) path where you generate the labeling set.

    

  
  
  :rtype: dict
  :returns: 
    
    **Response Syntax**

    
    ::

      {
          'TaskRunId': 'string'
      }
      
    **Response Structure**

    

    - *(dict) --* 
      

      - **TaskRunId** *(string) --* 

        The unique run identifier that is associated with this task run.

        
  
  **Exceptions**
  
  *   :py:class:`Glue.Client.exceptions.EntityNotFoundException`

  
  *   :py:class:`Glue.Client.exceptions.InvalidInputException`

  
  *   :py:class:`Glue.Client.exceptions.OperationTimeoutException`

  
  *   :py:class:`Glue.Client.exceptions.InternalServiceException`

  
  *   :py:class:`Glue.Client.exceptions.ConcurrentRunsExceededException`

  