FDPSOLAProcessor (MaryTTS 5.2 API)

java.lang.Object
- marytts.signalproc.process.VocalTractModifier
- - marytts.signalproc.process.FDPSOLAProcessor

All Implemented Interfaces:: InlineDataProcessor

public class FDPSOLAProcessor
extends VocalTractModifier

Field Summary

Fields
Modifier and Type	Field and Description
`protected boolean`	`bBroke`
`protected boolean`	`bLastFrame`
`boolean`	`bSilent`
`protected boolean`	`bWarp`
`protected LEDataInputStream`	`din`
`protected LEDataOutputStream`	`dout`
`protected double[]`	`f0s`
`protected double[]`	`frm`
`protected double`	`frmEn`
`protected int`	`frmSize`
`protected double[]`	`frmy`
`protected double`	`frmyEn`
`protected static int`	`FROM_CODE`
`protected static int`	`FROM_FILE`
`protected static int`	`FROM_TARGET`
`protected double`	`gain`
`protected int`	`halfWin`
`protected ComplexArray`	`hy`
`protected DoubleDataSource`	`input`
`protected AudioInputStream`	`inputAudio`
`protected int`	`inputFrameIndex`
`protected double[]`	`inputVT`
`protected boolean`	`isWavFileOutput`
`protected double`	`localDurDiff`
`protected double`	`localDurDiffSaved`
`protected int`	`lpOrder`
`protected static double`	`MAX_PSCALE`
`protected static double`	`MAX_TSCALE`
`protected int`	`maxFrmSize`
`protected int`	`maxNewFrmSize`
`protected static double`	`MIN_PSCALE`
`protected static double`	`MIN_TSCALE`
`protected VoiceModificationParametersPreprocessor`	`modParams`
`protected int`	`newFftSize`
`protected int`	`newFrmSize`
`protected int`	`newMaxFreq`
`protected int`	`newPeriod`
`protected int`	`newSkipSize`
`protected double[]`	`newVScales`
`protected double`	`nextAdd`
`protected static int`	`NUM_PITCH_SYNC_PERIODS`
`protected int`	`numfrm`
`protected int`	`numfrmFixed`
`protected int`	`numPeriods`
`protected int`	`origLen`
`protected double[]`	`outBuff`
`protected int`	`outBuffLen`
`protected int`	`outBuffStart`
`protected DDSAudioInputStream`	`outputAudio`
`protected String`	`outputFile`
`protected PitchMarks`	`pm`
`protected PsolaFrameProvider`	`psFrm`
`protected double[]`	`py2`
`protected int`	`repeatSkipCount`
`protected double`	`ssFixedInSeconds`
`protected double`	`sumLocalDurDiffs`
`protected int`	`synthFrameInd`
`protected int`	`synthFrmInd`
`protected int`	`synthSt`
`protected int`	`synthTotal`
`protected String`	`tempOutBinaryFile`
`protected double[]`	`tmpvsc`
`protected int`	`totalWrittenToFile`
`protected double`	`tscaleSingle`
`static int`	`TTS_MODIFICATION`
`static int`	`WAVEFORM_MODIFICATION`
`protected double[]`	`wgt`
`protected double[]`	`wgty`
`protected DynamicWindow`	`windowIn`
`protected DynamicWindow`	`windowOut`
`protected double`	`wsFixedInSeconds`
`protected double[]`	`wSynthBuff`
`protected double[]`	`ySynthBuff`
`protected int`	`ySynthInd`

Fields inherited from class marytts.signalproc.process.VocalTractModifier
fftSize, fs, h, maxFreq, p, tmpCount, vtSpectrum

Constructor Summary

Constructors
Constructor and Description
`FDPSOLAProcessor()`
`FDPSOLAProcessor(String strInputFile, String strPitchFile, String strOutputFile, double[] pscales, double[] tscales, double[] escales, double[] vscales)`
`FDPSOLAProcessor(String strInputFile, String strPitchFile, String strOutputFile, double[] pscales, double[] tscales, double[] escales, double[] vscales, boolean isFixedRate)`

Method Summary

Methods
Modifier and Type	Method and Description
`void`	`convertToWav(AudioFormat audioformat)`
`void`	`fdpsolaOnline()`
`double[]`	`getScalesFromTextFile(String strScaleFile)`
`protected void`	`init(int initialisationType)`
`protected void`	`init(int initialisationType, String strInputFile, String strPitchFile, String strOutputFile, double[] pscales, double[] tscales, double[] escales, double[] vscales, boolean isFixedRate)`
`static void`	`main(String[] args)`
`static void`	`mainParametric(String inputWavFile, double[] pscales, double[] tscales, double[] escales, double[] vscales)`
`DDSAudioInputStream`	`process(Datagram[][] datagrams, Datagram[] rightContexts, AudioFormat audioformat, boolean[][] voicings, double[][] pitchScales, double[][] timeScales)`
`DDSAudioInputStream`	`process(double[] x, int[] pitchMarks, AudioFormat audioformat, boolean[] voicings, double[] pitchScales, double[] timeScales)`
`double[]`	`processDatagram(Datagram[] datagrams, Datagram rightContext, AudioFormat audioformat, boolean[] voicings, double[] pitchScales, double[] timeScales, boolean bLastDatagram)`
`DDSAudioInputStream`	`processDecrufted(Datagram[][] datagrams, Datagram[] rightContexts, AudioFormat audioformat, boolean[][] voicings, double[][] pitchScales, double[][] timeScales)` Functionally equivalent to `process(marytts.util.data.Datagram[][], marytts.util.data.Datagram[], javax.sound.sampled.AudioFormat, boolean[][], double[][], double[][])` (but with most of the cruft removed, which should make this easier to modify)
`double[]`	`processFrame(double[] frmIn, boolean isVoiced, double pscale, double tscale, double escale, double vscale, boolean isLastInputFrame, int currentPeriod, int inputFrameSize)`
`double[]`	`writeFinal()`

Methods inherited from class marytts.signalproc.process.VocalTractModifier
applyInline, initialise, initialise, processSpectrum

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail

WAVEFORM_MODIFICATION

public static int WAVEFORM_MODIFICATION

TTS_MODIFICATION
```
public static int TTS_MODIFICATION
```

input
```
protected DoubleDataSource input
```

inputAudio
```
protected AudioInputStream inputAudio
```

outputAudio

protected DDSAudioInputStream outputAudio

modParams

protected VoiceModificationParametersPreprocessor modParams

numfrm
```
protected int numfrm
```

numfrmFixed
```
protected int numfrmFixed
```

lpOrder
```
protected int lpOrder
```

outputFile
```
protected String outputFile
```

tempOutBinaryFile
```
protected String tempOutBinaryFile
```

origLen
```
protected int origLen
```

pm
```
protected PitchMarks pm
```

f0s
```
protected double[] f0s
```

psFrm
```
protected PsolaFrameProvider psFrm
```

wsFixedInSeconds
```
protected double wsFixedInSeconds
```

ssFixedInSeconds
```
protected double ssFixedInSeconds
```

numPeriods
```
protected int numPeriods
```

NUM_PITCH_SYNC_PERIODS

protected static int NUM_PITCH_SYNC_PERIODS

FROM_CODE
```
protected static int FROM_CODE
```

FROM_FILE
```
protected static int FROM_FILE
```

FROM_TARGET
```
protected static int FROM_TARGET
```

bSilent
```
public boolean bSilent
```

dout
```
protected LEDataOutputStream dout
```

din
```
protected LEDataInputStream din
```

windowIn
```
protected DynamicWindow windowIn
```

windowOut
```
protected DynamicWindow windowOut
```

wgt
```
protected double[] wgt
```

wgty
```
protected double[] wgty
```

frmSize
```
protected int frmSize
```

newFrmSize
```
protected int newFrmSize
```

newPeriod
```
protected int newPeriod
```

synthFrmInd
```
protected int synthFrmInd
```

localDurDiff
```
protected double localDurDiff
```

repeatSkipCount
```
protected int repeatSkipCount
```

localDurDiffSaved
```
protected double localDurDiffSaved
```

sumLocalDurDiffs
```
protected double sumLocalDurDiffs
```

nextAdd
```
protected double nextAdd
```

synthSt
```
protected int synthSt
```

synthTotal
```
protected int synthTotal
```

maxFrmSize
```
protected int maxFrmSize
```

maxNewFrmSize
```
protected int maxNewFrmSize
```

synthFrameInd
```
protected int synthFrameInd
```

bLastFrame
```
protected boolean bLastFrame
```

bBroke
```
protected boolean bBroke
```

newFftSize
```
protected int newFftSize
```

newMaxFreq
```
protected int newMaxFreq
```

outBuffLen
```
protected int outBuffLen
```

outBuff
```
protected double[] outBuff
```

outBuffStart
```
protected int outBuffStart
```

totalWrittenToFile
```
protected int totalWrittenToFile
```

ySynthBuff
```
protected double[] ySynthBuff
```

wSynthBuff
```
protected double[] wSynthBuff
```

ySynthInd
```
protected int ySynthInd
```

frm
```
protected double[] frm
```

bWarp
```
protected boolean bWarp
```

inputVT
```
protected double[] inputVT
```

py2
```
protected double[] py2
```

hy
```
protected ComplexArray hy
```

frmy
```
protected double[] frmy
```

frmEn
```
protected double frmEn
```

frmyEn
```
protected double frmyEn
```

gain
```
protected double gain
```

newSkipSize
```
protected int newSkipSize
```

halfWin
```
protected int halfWin
```

newVScales
```
protected double[] newVScales
```

tmpvsc
```
protected double[] tmpvsc
```

isWavFileOutput
```
protected boolean isWavFileOutput
```

inputFrameIndex
```
protected int inputFrameIndex
```

MIN_PSCALE
```
protected static double MIN_PSCALE
```

MAX_PSCALE
```
protected static double MAX_PSCALE
```

MIN_TSCALE
```
protected static double MIN_TSCALE
```

MAX_TSCALE
```
protected static double MAX_TSCALE
```

tscaleSingle
```
protected double tscaleSingle
```

Constructor Detail

FDPSOLAProcessor

public FDPSOLAProcessor(String strInputFile,
                String strPitchFile,
                String strOutputFile,
                double[] pscales,
                double[] tscales,
                double[] escales,
                double[] vscales)
                 throws UnsupportedAudioFileException,
                        IOException

Throws:: UnsupportedAudioFileException; IOException

FDPSOLAProcessor

public FDPSOLAProcessor(String strInputFile,
                String strPitchFile,
                String strOutputFile,
                double[] pscales,
                double[] tscales,
                double[] escales,
                double[] vscales,
                boolean isFixedRate)
                 throws UnsupportedAudioFileException,
                        IOException

Throws:: UnsupportedAudioFileException; IOException

FDPSOLAProcessor
```
public FDPSOLAProcessor()
```

Method Detail

init

protected void init(int initialisationType)

init

protected void init(int initialisationType,
        String strInputFile,
        String strPitchFile,
        String strOutputFile,
        double[] pscales,
        double[] tscales,
        double[] escales,
        double[] vscales,
        boolean isFixedRate)

processDecrufted
```
public DDSAudioInputStream processDecrufted(Datagram[][] datagrams,
                                   Datagram[] rightContexts,
                                   AudioFormat audioformat,
                                   boolean[][] voicings,
                                   double[][] pitchScales,
                                   double[][] timeScales)
                                     throws IOException
```
Functionally equivalent to process(marytts.util.data.Datagram[][], marytts.util.data.Datagram[], javax.sound.sampled.AudioFormat, boolean[][], double[][], double[][]) (but with most of the cruft removed, which should make this easier to modify)

Parameters:
datagrams - array of Datagram arrays, one element per SelectedUnit
rightContexts - array of Datagrams, one element per SelectedUnit
audioformat - audioformat
voicings - array of boolean arrays, matching datagrams
pitchScales - array of double arrays, matching datagrams, pitch modification factors
timeScales - array of double arrays, matching datagrams, duration modification factors

Returns:
modified audio as a DoubleDataSource audio stream

Throws:

IOException - if frames cannot be processed

process

public DDSAudioInputStream process(Datagram[][] datagrams,
                          Datagram[] rightContexts,
                          AudioFormat audioformat,
                          boolean[][] voicings,
                          double[][] pitchScales,
                          double[][] timeScales)

process

public DDSAudioInputStream process(double[] x,
                          int[] pitchMarks,
                          AudioFormat audioformat,
                          boolean[] voicings,
                          double[] pitchScales,
                          double[] timeScales)

processDatagram

public double[] processDatagram(Datagram[] datagrams,
                       Datagram rightContext,
                       AudioFormat audioformat,
                       boolean[] voicings,
                       double[] pitchScales,
                       double[] timeScales,
                       boolean bLastDatagram)

getScalesFromTextFile

public double[] getScalesFromTextFile(String strScaleFile)

fdpsolaOnline

public void fdpsolaOnline()
                   throws IOException

Throws:: IOException

processFrame

public double[] processFrame(double[] frmIn,
                    boolean isVoiced,
                    double pscale,
                    double tscale,
                    double escale,
                    double vscale,
                    boolean isLastInputFrame,
                    int currentPeriod,
                    int inputFrameSize)
                      throws IOException

Throws:: IOException

writeFinal

public double[] writeFinal()
                    throws IOException

Throws:: IOException

convertToWav

public void convertToWav(AudioFormat audioformat)
                  throws IOException

Throws:: IOException

mainParametric

public static void mainParametric(String inputWavFile,
                  double[] pscales,
                  double[] tscales,
                  double[] escales,
                  double[] vscales)
                           throws UnsupportedAudioFileException,
                                  IOException

Throws:: UnsupportedAudioFileException; IOException

main

public static void main(String[] args)
                 throws Exception

Throws:: Exception

Class FDPSOLAProcessor

Field Summary

Fields inherited from class marytts.signalproc.process.VocalTractModifier

Constructor Summary

Method Summary

Methods inherited from class marytts.signalproc.process.VocalTractModifier

Methods inherited from class java.lang.Object

Field Detail

WAVEFORM_MODIFICATION

TTS_MODIFICATION

input

inputAudio

outputAudio

modParams

numfrm

numfrmFixed

lpOrder

outputFile

tempOutBinaryFile

origLen

pm

f0s

psFrm

wsFixedInSeconds

ssFixedInSeconds

numPeriods

NUM_PITCH_SYNC_PERIODS

FROM_CODE

FROM_FILE

FROM_TARGET

bSilent

dout

din

windowIn

windowOut

wgt

wgty

frmSize

newFrmSize

newPeriod

synthFrmInd

localDurDiff

repeatSkipCount

localDurDiffSaved

sumLocalDurDiffs

nextAdd

synthSt

synthTotal

maxFrmSize

maxNewFrmSize

synthFrameInd

bLastFrame

bBroke

newFftSize

newMaxFreq

outBuffLen

outBuff

outBuffStart

totalWrittenToFile

ySynthBuff

wSynthBuff

ySynthInd

frm

bWarp

inputVT

py2

hy

frmy

frmEn

frmyEn

gain

newSkipSize

halfWin

newVScales

tmpvsc

isWavFileOutput

inputFrameIndex

MIN_PSCALE

MAX_PSCALE

MIN_TSCALE