Distributed Processing¶
You can use the Naïve Bayes classifier algorithm in the distributed processing mode only at the training stage.
This computation mode assumes that the data set is split in nblocks blocks across computation nodes.
Training¶
Algorithm Parameters¶
At the training stage, Naïve Bayes classifier in the distributed processing mode has the following parameters:
Parameter |
Default Valude |
Description |
---|---|---|
|
Not applicable |
The parameter required to initialize the algorithm. Can be:
|
|
|
The floating-point type that the algorithm uses for intermediate computations. Can be |
|
|
Available computation methods for the Naïve Bayes classifier:
|
|
Not applicable |
The number of classes. A required parameter. |
|
\(1/\text{nClasses}\) |
Vector of size |
|
\(1\) |
Vector of size \(p\) that contains the imagined occurrences of features. The default value applies to each vector element. |
Use the two-step computation schema for Naïve Bayes classifier training in the distributed processing mode, as illustrated below:
Step 1 - on Local Nodes¶
In this step, Naïve Bayes classifier training accepts the input described below.
Pass the Input ID
as a parameter to the methods that provide input for your algorithm.
For more details, see Algorithms.
Input ID |
Input |
---|---|
|
Pointer to the \(n_i \times p\) numeric table that represents the current data block. |
|
Pointer to the \(n_i \times 1\) numeric table with class labels associated with the current data block. |
Note
These tables can be objects of any class derived from NumericTable
.
In this step, Naïve Bayes classifier training calculates the result described below.
Pass the Result ID
as a parameter to the methods that access the results of your algorithm.
For more details, see Algorithms.
Result ID |
Result |
---|---|
|
Pointer to the partial Naïve Bayes classifier model that corresponds to the \(i\)-th data block. The result can only be an object of the |
Step 2 - on Master Node¶
In this step, Naïve Bayes classifier training accepts the input described below.
Pass the Input ID
as a parameter to the methods that provide input for your algorithm.
For more details, see Algorithms.
Input ID |
Input |
---|---|
|
A collection of partial models computed on local nodes in Step 1. The collection contains objects of the |
In this step, Naïve Bayes classifier training calculates the result described below.
Pass the Result ID
as a parameter to the methods that access the results of your algorithm.
For more details, see Algorithms.
Result ID |
Result |
---|---|
|
Pointer to the Naïve Bayes classifier model being trained. The result can only be an object of the |