Processing Operators¶
The MI Pipelines provides the following operators related to task processing:
Notebook Operator
Python Operator
Shell Operator
Email Operator
Notebook Operator¶
The Notebook operator is often used to process ipynb-type tasks that have been verified and saved in Notebook. Usually, the developed model code files will be saved onto Git, and the Notebook operator can be used to get the code file from the Git Directory operator and execute it. Its typical scenario is to execute Python tasks, run Python code files, and train machine learning models, and the generated model files will be recorded and outputted through the logmodel method of MLflow.
The Notebook operator is often used in combination with the Git Directory operator. For example:
Input Parameters Description¶
Name |
Required/optional |
Type |
Description |
---|---|---|---|
workspace |
Required |
Directory |
Specify the file directory where the code is located, which usually comes from the directory specified by the Git Directory operator. |
entrypoint |
Required |
String |
Specify the name of the entry program file, which should include the path (because there may be files with the same name in different directories). |
requirements_file_path |
Optional |
String |
Specify the file path of the dependent package to be installed. |
env |
Optional |
List |
Specify the list of parameters to be passed. |
Output parameters description¶
Name |
Type |
Description |
---|---|---|
mlflow_model_file_paths |
List |
List of model file paths recorded and outputted by MLflow’s logmodel method. |
Python Operator¶
The Python operator is used to handle Python script tasks. The parameters of the Python operator are divided into fixed parameters and dynamic parameters. The fixed parameters cannot be deleted, while the dynamic parameters can be added, modified or deleted according to the usage needs.
Input Parameters Description¶
The table below lists the fixed parameters of the Python operator.
Name |
Required/optional |
Type |
Description |
---|---|---|---|
workspace |
Required |
Directory |
Specify the file directory where the code is located, which usually comes from the Git Directory operator. |
entrypoint |
Required |
String |
Specify the name of the entry program file, which should include the path. |
requirements_file_path |
Optional |
String |
Specify the file path of the dependent package to be installed. |
Output parameters description¶
The Python operator has no fixed output parameters, and its output parameters can be dynamically increased according to the usage needs.
Shell Operator¶
The Shell operator is used to process Shell script tasks, and its input and output parameters are the same as those of Python operator. See the documentation of Python operator.
Email Operator¶
The Email operator is used to achieve the email alert function.
Input Parameters Description¶
Name |
Required/optional |
Type |
Description |
---|---|---|---|
mail_host |
Required |
String |
IP address or domain name of the email server; e.g. smtp.163.com and smtp.office365.com. |
mail_user |
Required |
String |
Email service user name. |
mail_pass |
Required |
Password |
Password corresponding to the user name. |
sender |
Required |
String |
Sender. |
receivers |
Required |
List |
Recipient list, which can be derived from the user list in the organization. |
content |
Required |
String |
Content of the sent email. |
subject |
Required |
String |
Subject of the sent email. |
Output parameters description¶
Name |
Type |
Description |
---|---|---|
status |
String |
Email sending status. |
content_out |
String |
Email content. |