You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
MLSQL Engine provides rest APIs to run a job script. It's convenient to integrate with job schedulers since the only thing you should do is just to send a post request in your job scheduler.
However, we hope there is a built-in job scheduler so people can use MLSQL Stack in production more easily.
The key point is where to put the scheduler and how to use it?
Where to put?
Console -> Cluster -> Engine 1
-> Engine 2
The best place is Cluster. And we hope we can use it more mlsql-style.
You can use it like this:
!crontab */5 ****"/project/dir1/dir2/a.mlsql";
Or even more, run it like following:
!crontab */5**** self;
---you script contentselect*from hive1 as hiveTable2;
save......
This makes the script self-contains how to execute itself.
How to configure the depends? The first way should like this:
set a_script ="/project/dir1/dir2/a.mlsql";
set b_script ="/project/dir1/dir2/b.mlsql";
set c_script ="project/dir1/dir2/c.mlsql";
!build a_script depends on c_script;
!build b_script depends on c_script;
-- trigger c with crontab.
!crontab */5****"${c_script}";
The second way is to use it like this:
!build self depends on c_script;
---you script contentselect*from hive1 as hiveTable2;
save......
And the system should build the dependency graph by scanning all scripts.
How to run?
When we execute the !crontab command and the mlsql engine will resend this command to Cluster, and
the scheduler in Cluster should record it and schedule as required.
The text was updated successfully, but these errors were encountered:
allwefantasy
changed the title
MPIP: build-in Job Scheduler support in MLSQL Stack
MPIP: built-in Job Scheduler support in MLSQL Stack
Apr 23, 2019
Background
MLSQL Engine provides rest APIs to run a job script. It's convenient to integrate with job schedulers since the only thing you should do is just to send a post request in your job scheduler.
However, we hope there is a built-in job scheduler so people can use MLSQL Stack in production more easily.
The key point is where to put the scheduler and how to use it?
Where to put?
The best place is Cluster. And we hope we can use it more mlsql-style.
You can use it like this:
Or even more, run it like following:
This makes the script self-contains how to execute itself.
How to configure the depends? The first way should like this:
The second way is to use it like this:
And the system should build the dependency graph by scanning all scripts.
How to run?
When we execute the !crontab command and the mlsql engine will resend this command to Cluster, and
the scheduler in Cluster should record it and schedule as required.
The text was updated successfully, but these errors were encountered: