Multiple jobs on oryx

Question

Multiple jobs on oryx

PalanQu opened this issue 7 years ago · comments

Hi sir, I have a problem with the framework, If I want to deploy multiple jobs on the oryx, each job has each Batch Layer, Speed Layer, Serving Layer for a pipline processing . For example If I have a document and I want to do a WordCount and after the WordCount, I want to find a most similar document use the wordcound result, All the algorithm I want to write by myself in Spark, I mean I don't want to use existed algorithm, How can I do?

Sean Owen · Answer 1 · Fri Sep 29 2017 15:05:21 GMT+0800 (China Standard Time)

You need to run separate instances of the layer processes. The layers running the same app would share the same configuration, each. They'd share an app ID and Kafka topics, and different apps would have different IDs and topics.

Questions can go to https://groups.google.com/a/cloudera.org/forum/#!forum/oryx-user

Jiabao Qu · Answer 2 · Fri Sep 29 2017 17:59:38 GMT+0800 (China Standard Time)

Thx, Sir, You mean I need to deploy multiple layers? Could you please show me some simple examples?
Or I have found this , It's this answer is correct?

https://groups.google.com/a/cloudera.org/forum/#!topic/oryx-user/4oNVz4JrAt0

Sean Owen · Answer 3 · Fri Sep 29 2017 18:01:28 GMT+0800 (China Standard Time)

Yes that answer is correct. You simply run a whole different set of layer processes for each app.