Class PairwiseDetector<T extends PairwiseDetectorWorker>

  • All Implemented Interfaces:
    IDetector<T>
    Direct Known Subclasses:
    NGramDetector, VariableNameDetector

    public class PairwiseDetector<T extends PairwiseDetectorWorker>
    extends Detector<T>
    An abstract IDetector implementation which constructs an individual, parallel worker for each combination of files in the dataset. This can be used as a base for pairwise matching algorithms.

    More advanced implementations should directly implement the IDetector interface.
    • Constructor Detail

      • PairwiseDetector

        public PairwiseDetector​(java.lang.String displayName,
                                java.lang.Class<T> typeArgumentClass,
                                PreProcessingStrategy... preProcessingStrategies)
        IDetector implementation which automatically builds a worker for each possible combination of the source files passed
        Parameters:
        displayName - user facing display name for the detector
        typeArgumentClass - class object for the generic type of this detector's worker
        preProcessingStrategies - preprocessing strategies to use for this detector, can be one of many.
      • PairwiseDetector

        public PairwiseDetector​(java.lang.String displayName,
                                java.lang.String description,
                                java.lang.Class<T> typeArgumentClass,
                                PreProcessingStrategy... preProcessingStrategies)
        IDetector implementation which automatically builds a worker for each possible combination of the source files passed
        Parameters:
        displayName - user facing display name for the detector
        description - user facing description for the detector
        typeArgumentClass - class object for the generic type of this detector's worker
        preProcessingStrategies - preprocessing strategies to use for this detector, can be one of many.
    • Method Detail

      • getAbstractPairwiseDetectorWorker

        public T getAbstractPairwiseDetectorWorker​(ModelDataItem file1Data,
                                                   ModelDataItem file2Data)
        Fetches a new instance of the worker for this implementation
        Parameters:
        file1Data - ModelDataItem for file 1
        file2Data - ModelFataItem for file 2
        Returns:
        the new worker instance