Package | Description |
---|---|
marytts.htsengine | |
marytts.signalproc.adaptation |
Packages for speaking style and speaker identity adaptation in Mary TTS
supporting various voice conversion algorithms.
|
marytts.signalproc.analysis |
A collection of analysis algorithms for signal processing.
Important classes are as follows: LpcAnalyser: Linear prediction analysis using autocorrelation appraoch and Durbin recursion LsfAnalyser: Computation of line spectral frequencies (LSFs, or line spectral pairs - LSPs) based on LpcAnalyser EnergyAnalyser: Energy contour estimation with voice activity detection support F0TrackerAutocorrelationHeuristic: An autocorrelation based f0 analysis algorithm extended with heuristic post-processing to reduce voiced/unvoiced errors and f0 doubling/halving problems. |
marytts.signalproc.display | |
marytts.signalproc.effects |
A wrapper package for audio effects in Mary TTS.
|
marytts.signalproc.filter |
Various classes that support filter and filterbank design and filtering operations.
|
marytts.signalproc.process | |
marytts.unitselection.concat | |
marytts.util.data |
Various relatively generic utilities for data generation and buffering.
|
marytts.util.data.audio |
Various relatively generic utilities for audio input/output.
|
marytts.util.data.text |
Various relatively generic utilities for text input/output.
|
Modifier and Type | Class and Description |
---|---|
protected class |
HTSVocoder.HTSVocoderDataProducer |
Modifier and Type | Field and Description |
---|---|
protected DoubleDataSource |
FdpsolaAdapter.input |
Modifier and Type | Class and Description |
---|---|
class |
Signal2EnergyConverter |
Modifier and Type | Method and Description |
---|---|
protected DoubleDataSource |
F0TrackerAutocorrelationDP.preprocess(DoubleDataSource signal) |
protected abstract DoubleDataSource |
F0Tracker.preprocess(DoubleDataSource signal) |
Modifier and Type | Method and Description |
---|---|
F0Tracker.F0Contour |
F0Tracker.analyse(DoubleDataSource signal,
int samplingRate) |
protected FrameBasedAnalyser |
F0TrackerAutocorrelationDP.getCandidateEstimator(DoubleDataSource preprocessedSignal,
int samplingRate) |
protected abstract FrameBasedAnalyser |
F0Tracker.getCandidateEstimator(DoubleDataSource preprocessedSignal,
int samplingRate) |
void |
F0TrackerAutocorrelationHeuristic.pitchAnalyze(DoubleDataSource signal)
Analyse the f0 contour of the given audio signal.
|
protected DoubleDataSource |
F0TrackerAutocorrelationDP.preprocess(DoubleDataSource signal) |
protected abstract DoubleDataSource |
F0Tracker.preprocess(DoubleDataSource signal) |
Constructor and Description |
---|
EnergyAnalyser_dB(DoubleDataSource signal,
int framelength,
int samplingRate) |
EnergyAnalyser_dB(DoubleDataSource signal,
int framelength,
int frameShift,
int samplingRate) |
EnergyAnalyser_dB(DoubleDataSource signal,
int framelength,
int frameShift,
int samplingRate,
int maxSize) |
EnergyAnalyser(DoubleDataSource signal,
int framelength,
int samplingRate) |
EnergyAnalyser(DoubleDataSource signal,
int framelength,
int frameShift,
int samplingRate) |
EnergyAnalyser(DoubleDataSource signal,
int framelength,
int frameShift,
int samplingRate,
int maxSize) |
F0Tracker.CandidateEstimator(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate,
int nCandidates) |
F0TrackerAutocorrelationDP.CandidateEstimator(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate)
Track the F0 contour, using the Autocorrelation method.
|
F0TrackerAutocorrelationDP.CandidateEstimator(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate,
int minF0,
int maxF0)
Track the F0 contour, using the Autocorrelation method.
|
FrameBasedAnalyser(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate)
Initialise a FrameBasedAnalyser.
|
LpcAnalyser(DoubleDataSource signal,
int framelength,
int samplingRate) |
LpcAnalyser(DoubleDataSource signal,
int framelength,
int frameShift,
int samplingRate) |
LpcAnalyser(DoubleDataSource signal,
int framelength,
int frameShift,
int samplingRate,
int order,
int windowType) |
LpcAnalyser(DoubleDataSource signal,
int framelength,
int frameShift,
int samplingRate,
int order,
int windowType,
float preCoef) |
LpcAnalyser(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate) |
LpcAnalyser(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate,
int order) |
LpcAnalyser(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate,
int order,
float preCoef) |
LpcPitchSynchronousAnalyser(DoubleDataSource signal,
DoubleDataSource pitchmarks,
int windowType,
int samplingRate)
Initialise a PitchLPCAnalyser.
|
LpcPitchSynchronousAnalyser(DoubleDataSource signal,
DoubleDataSource pitchmarks,
int windowType,
int samplingRate,
int framePeriods,
int shiftPeriods)
Create a new PitchLPCAnalyser with a configurable number of pitch periods per frame and pitch periods to shift by.
|
PitchFrameAnalyser(DoubleDataSource signal,
DoubleDataSource pitchmarks,
int windowType,
int samplingRate)
Initialise a PitchFrameAnalyser.
|
PitchFrameAnalyser(DoubleDataSource signal,
DoubleDataSource pitchmarks,
int windowType,
int samplingRate,
int framePeriods,
int shiftPeriods)
Create a new PitchFrameAnalyser with a configurable number of pitch periods per frame and pitch periods to shift by.
|
ShortTermAutocorrelationAnalyser(DoubleDataSource signal,
Window window,
int frameShift,
int samplingRate) |
ShortTermCepstrumAnalyser(DoubleDataSource signal,
int fftSize,
int invFftSize,
Window window,
int frameShift,
int samplingRate)
Initialise a FrameBasedAnalyser.
|
ShortTermLogSpectrumAnalyser(DoubleDataSource signal,
int fftSize,
Window window,
int frameShift,
int samplingRate)
Initialise a FrameBasedAnalyser.
|
ShortTermPhaseSpectrumAnalyser(DoubleDataSource signal,
int fftSize,
Window window,
int frameShift,
int samplingRate)
Initialise a FrameBasedAnalyser.
|
ShortTermSpectrumAnalyser(DoubleDataSource signal,
int fftSize,
Window window,
int frameShift,
int samplingRate)
Initialise a FrameBasedAnalyser.
|
Signal2EnergyConverter(DoubleDataSource signal) |
Modifier and Type | Method and Description |
---|---|
protected void |
SilenceMarker.initialise(DoubleDataSource signal,
int samplingRate,
int width,
int height) |
protected void |
F0Graph.initialise(DoubleDataSource signal,
int samplingRate,
int width,
int height) |
protected void |
EnergyHistogram.initialise(DoubleDataSource signal,
int samplingRate,
int width,
int height) |
protected void |
EnergyGraph.initialise(DoubleDataSource signal,
int samplingRate,
int width,
int height) |
Modifier and Type | Class and Description |
---|---|
class |
FIRFilter.FIROutput |
Modifier and Type | Method and Description |
---|---|
DoubleDataSource |
RecursiveFilter.apply(DoubleDataSource signal)
Apply this filter to the given input signal.
|
DoubleDataSource |
FIRFilter.apply(DoubleDataSource signal)
Apply this filter to the given input signal.
|
Modifier and Type | Method and Description |
---|---|
DoubleDataSource |
RecursiveFilter.apply(DoubleDataSource signal)
Apply this filter to the given input signal.
|
DoubleDataSource |
FIRFilter.apply(DoubleDataSource signal)
Apply this filter to the given input signal.
|
Constructor and Description |
---|
FIRFilter.FIROutput(DoubleDataSource inputSource) |
Modifier and Type | Class and Description |
---|---|
class |
FrameOverlapAddSource
Compute the overlap-add of a framewise-processed input signal, with optional time stretching (in subclasses).
|
class |
FramewiseMerger
A class to merge two audio signals, using pitch-synchronous frames.
|
class |
NaiveVocoder
A naive overlap-add time stretching algorithm without any phase correction; used only for demonstrating the artefacts that
arise from not correcting phase.
|
class |
PhaseVocoder
A phase vocoder implementation for time stretching.
|
class |
Robotiser |
Modifier and Type | Field and Description |
---|---|
protected DoubleDataSource |
PsolaFrameProvider.input |
protected DoubleDataSource |
FDPSOLAProcessor.input |
protected DoubleDataSource |
FramewiseMerger.labelTimes |
protected DoubleDataSource |
FramewiseMerger.otherLabelTimes |
protected DoubleDataSource |
PitchFrameProvider.pitchmarks |
protected DoubleDataSource |
FrameProvider.signal |
Modifier and Type | Method and Description |
---|---|
DoubleDataSource |
EnergyNormaliser.apply(DoubleDataSource signal) |
Modifier and Type | Method and Description |
---|---|
DoubleDataSource |
EnergyNormaliser.apply(DoubleDataSource signal) |
static double |
EnergyNormaliser.determineAveragePower(DoubleDataSource signal)
For a given audio signal, determine the average power.
|
protected void |
FrameOverlapAddSource.initialise(DoubleDataSource inputSource,
int windowType,
boolean applySynthesisWindow,
int frameLength,
int samplingRate,
InlineDataProcessor processor)
To be called by constructor in order to set up this frame overlap add source.
|
Constructor and Description |
---|
EnergyNormaliser(DoubleDataSource reference)
Adapt the amplitudes of a signal such that the average power is the same as the one in the reference.
|
FrameOverlapAddSource(DoubleDataSource inputSource,
int windowType,
boolean applySynthesisWindow,
int frameLength,
int samplingRate,
InlineDataProcessor processor) |
FrameOverlapAddSource(DoubleDataSource inputSource,
int frameLength,
int samplingRate,
InlineDataProcessor processor) |
FrameProvider(DoubleDataSource signal,
InlineDataProcessor processor,
int frameLength,
int frameShift,
int samplingRate,
boolean stopWhenTouchingEnd)
Initialise a FrameProvider.
|
FramewiseMerger(DoubleDataSource inputSource,
DoubleDataSource pitchmarks,
int samplingRate,
DoubleDataSource labelTimes,
DoubleDataSource otherSource,
DoubleDataSource otherPitchmarks,
int otherSamplingRate,
DoubleDataSource otherLabelTimes,
InlineFrameMerger merger)
Create a new merger, creating audio by pitch-synchronous merging of audio frames from a source (aka the "signal") and a
target (aka the "other"), linearly mapping the corresponding times between the two sources.
|
FramewiseMerger(DoubleDataSource inputSource,
int frameLength,
int samplingRate,
DoubleDataSource labelTimes,
DoubleDataSource otherSource,
int otherSamplingRate,
DoubleDataSource otherLabelTimes,
InlineFrameMerger merger)
Create a new merger, creating audio by merging of audio frames at a fixed frame rate, from a source (aka the "signal") and
a target (aka the "other"), linearly mapping the corresponding times between the two sources.
|
NaiveVocoder(DoubleDataSource inputSource,
int samplingRate,
double rateChangeFactor) |
PhaseVocoder(DoubleDataSource inputSource,
int samplingRate,
double rateChangeFactor) |
PitchFrameProvider(DoubleDataSource signal,
DoubleDataSource pitchmarks,
InlineDataProcessor processor,
int samplingRate)
Create a new PitchFrameProvider providing one period at a time.
|
PitchFrameProvider(DoubleDataSource signal,
DoubleDataSource pitchmarks,
InlineDataProcessor processor,
int samplingRate,
int framePeriods,
int shiftPeriods)
Create a new PitchFrameProvider with a configurable number of pitch periods per frame and pitch periods to shift by.
|
PsolaFrameProvider(DoubleDataSource inputSource,
double fixedWindowSizeInSeconds,
double fixedSkipSizeInSeconds,
int fs,
int totalFrames) |
PsolaFrameProvider(DoubleDataSource inputSource,
PitchMarks pm,
int fs,
int psPeriods) |
Robotiser(DoubleDataSource inputSource,
int samplingRate) |
Robotiser(DoubleDataSource inputSource,
int samplingRate,
float amount) |
Modifier and Type | Class and Description |
---|---|
class |
DatagramOverlapDoubleDataSource |
Modifier and Type | Class and Description |
---|---|
class |
BaseDoubleDataSource |
class |
BlockwiseDoubleDataSource |
class |
BufferedDoubleDataSource |
class |
DatagramDoubleDataSource |
class |
MixerDoubleDataSource
Create one DoubleDataSource from a parallel stream of DoubleDataSources.
|
class |
NoiseDoubleDataSource |
class |
ProducingDoubleDataSource |
class |
SequenceDoubleDataSource
Create one DoubleDataSource from a sequence of DoubleDataSources.
|
class |
SilenceDoubleDataSource |
Modifier and Type | Field and Description |
---|---|
protected DoubleDataSource |
BaseDoubleDataSource.inputSource |
protected DoubleDataSource[] |
MixerDoubleDataSource.sources |
Constructor and Description |
---|
BaseDoubleDataSource(DoubleDataSource inputSource) |
BlockwiseDoubleDataSource(DoubleDataSource inputSource,
int blockSize) |
BufferedDoubleDataSource(DoubleDataSource inputSource) |
BufferedDoubleDataSource(DoubleDataSource inputSource,
InlineDataProcessor dataProcessor) |
MixerDoubleDataSource(DoubleDataSource[] inputSources) |
SequenceDoubleDataSource(DoubleDataSource[] inputSources) |
Modifier and Type | Class and Description |
---|---|
class |
AudioDoubleDataSource |
Modifier and Type | Field and Description |
---|---|
protected DoubleDataSource |
DDSAudioInputStream.source |
Constructor and Description |
---|
DDSAudioInputStream(DoubleDataSource source,
AudioFormat format)
From the given DoubleDataSource, create an AudioInputStream of the given audio format.
|
Modifier and Type | Class and Description |
---|---|
class |
ESTTextfileDoubleDataSource |
class |
LabelfileDoubleDataSource |
class |
PraatTextfileDoubleDataSource |
class |
SnackTextfileDoubleDataSource |
class |
TextReaderDoubleDataSource |
Copyright © 2000–2016 DFKI GmbH. All rights reserved.