SAIL Publications

Sorted by DateSorted by First Author Last NameClassified by Research Category

Classified by Research Category

text-independentmultipitch analysisacoustic correlatesunderwater acoustic communicationsperceptionmovie event detectionaudio resynthesisdata selectionjoint coding-classificationconcept classificationfeature level fusioninformation change rate (ICR)Supervised i-vectormarine phenomenapitch contour analysisKullback–Leibler distance and divergencesentence segmentationvisionflexible MRI systemLanguage identificationLempel Ziv 78rapid vocal tract shapingsignal reconstructionspoken dialogue systemsarticulatory acousticsfuzzy logicmatrix completion algorithmsmagnetic resonance imaginglanguage learningGlottal source estimationexpressive synthesishumming databaseautomatic assessmentspherical cubic interpolationspeaker identificationautomatic speech recognitionenvironmental soundsreal-time MRILittle CHIMPclustering error rate (CER)user interface human factorsutterance clusteringbehavioral signal processingGaussian Mixture ModelGenderSpeaker stateOceanspolygonal shapesmovie skimmingSimplified i-vectormultiresolution analysisSurveyGlobal level cuesPCAactorRFCacting styleslattice enrichmentSimplified Supervised i-vectorlarge marine expansespronunciation evaluationspeaker model constructionParalinguisticsvocal tractaudio clusteringspeaker adaptationprosodic boundaryprominence detectionAutism diagnostic observation schedule (ADOS)user interactiondyadic wavelet transformRandom accessDetectionspeech productionintelligent sensorsemotionsilence detectionaudio classificationhead motionEngagementonomatopoeia based audio descriptionsauditory gistaffective stateemotion resynthesisSignal processingfacial motionhead motion synthesislattice posterioraudiovisual integrationmaximum entropy modelMarkov chain Monte Carlo (MCMC) methodtexture synthesispathological speechincremental Gaussian mixture cluster modelingelicitation techniquesreading assessmentlexical rulesaudio-visual databasecouple’s therapydialogueGlottal flow derivativevoice sourcechildren's speechsparse representationsmatrix algebraemotional perceptionacoustic adaptationdata-drivencontent based audio retrievalspeech recognitionrepeating patternsspeechToBIaudio information retrievalAgespeech synthesisTrendsaudio ontologypronunciation modelingfricativespsychologyacoustic correlates of prosodyvirtual microphonescustom upper-airway coilsibilantsspeech acousticmusic indexingApproximation methodsacoustic confidence scoressyllablefeature vector selectionBIC-based stopping criterionuncertainty quantificationChallengespeech understandingCognitive and motor loadAutismreasoningmultichannel audioBSPtranslationagentgamesmotion capturedemosematics of emotionsemotion space conceptspeech actarticulatory featuresSpeaker verificationregularizationcategorical prosody modelsturbulenceSVCconstrained reconstructiondata acquisitionprincipal component analysisdata-dependent partitionsuser interfaceaudio indexingmatching pursuitGMM supervectorsOOV detectionfeedbackrecognition of speech variationAutism Diagnosisacoustic discrimination measuresmachine learningchildren’s speech recognitionTILTdialogmultiple instance learningstressSparse matricesattention modelfacial animationdecision level fusionspeech-to-speechAutism diagnostic interview (ADI-R)I-vectoremotion recognitionarticulatory movementsretrievallanguage model adaptationBayes’ decision approachSea measurementsMel frequency filtersHMM based transcriptionpronunciation verificationmutual information estimationsemi-supervised learningmovie content analysisdiscriminative modelingdatabase searchingspeaker diarizationTIMITaudio databaseTVpronunciation variationarticulatory stroketarget trackingknowledge engineeringanimation synthesisvocal tract normalizationLocal level cuespronunciationWireless sensor networkstalking face detectioncontent-based video indexingquestion turnvideo signal processingspoken dialog systemsBayesian information criterion (BIC)formant scalingscreeningnon-parametric mutual information estimationacoustic modelingmusic databasere-ranking N-best listsaudio-video analysisexpressive animationtree-structured bases and wavelet packets (WPs)embodied conversational agentaffective statesAffectsensor networksmulti-sliceaudio representationmicachildreninferencemulti-pass recognitiontree grammarsequence modelshort-segment speaker identificationpolyphony music signaldialogue modelingHMMdialog systemsfield reconstructionprosodic language modelanalysis of articulatory measurementsnatural languageagglomerative hierarchical clusteringstress detectionWavelet packetsnonnative speechmusic information retrievalletter-soundsvocal entrainmentpiecewise polynomial approximationlexical featuresJittervideo content analysisbehavioral informaticsemotional salienceinter-cluster distance measurepart of speech (POS)objective and subjective measuresdiscourse contextnatural language processingrobustnessspiral readoutscouple therapyexpression of emotionsynthesisemotional speechauditory saliency maphuman factorsunderwater acoustic communicationuser response to errortalking avatars driven by speechHMMsHidden Markov Modelportrayal of emotionsmultimodal analysissparse approximationfacial emotion expressiondisfluency detectionspoken language processingspeaker normalizationlocalized search algorithm (LSA)vehicle kinematicsautomatic evaluationspoken languagequery by humminguncertaintyanalysis of audio clustersJoint communication and detectiondistortion measuresaccented Englishsmart roomdyadic interactiontract variablessplit lexiconevaluation of human-computer dialog systemsaffective recognitionchild-computer interactionquery by exampleletter-namesUnderwater communicationdiagnosisphase constraintrelative entropycritical articulationunsupervised adaptationbspselectionunsupervised speaker indexinghuman interactionsupport vector machinessample speaker models (SSM)total variationsimilarity measureprosodic cuesfeature extractionemotion theoriesspeech animationBayesian reasoningemotional assessmentbehavioral signal processing (BSP)user modelingmutual informationacoustic source localizationvocal tract area functionsuser typeintonationcontour trackingClassifier decision fusionquery-by-hummingspeaker clusteringauditory scene recognitionShimmerprominencepitch accentprosody recognitionhidden Markov models (HMMs)machine mediated communicationword boundaryuniversal gender models (UGM)LZ-78information theoryscalable encodingbackground modelingselective agglomerative hierarchical clustering (SAHC)music fingerprintstatistical methodspitch stylizationmotion capture systemcomplexity regularizationvocal tract shapingWeb 2.0 applicationsvoice onset timespontaneous speechAutomatic literacy assessmentn-gram modelshuman-computer interaction (HCI)neutral speechhuman annotationsyntaxAdaptive sensingUnderwater acousticsmultipulse excitationsignal representation for classificationunstructured audio classificationbasis selectioncanonical correlation analysisauditory scene analysismaximum entropy modelingMcGurk effectmelody searchexpressive speechgradient vector flowgeneric modelspattern searchemotional databasesaudio-visual emotion perceptioncross-lingual interactionsminimum probability of error signal representationchild-adult vs. child-machine interactionsexploration-exploitationauditory attentionspeech to speech translationreverse lookupgeneralized likelihood ratio (GLR)syllable recognitionMel-frequency cepstral coefficient (MFCC)accentfilter bank selectionaudio retrievalspoken name recognitionagglomerative hierarchical clustering (AHC)Mel filter bank (MFB)compressed sensing MRIBayesian networkminimum cost tree pruningsensitivity encodingchildren’s speechnoise classificationIntoxication detectiondiscriminant analysis for tensor representationprominent syllable detectionchild engagementpitch period estimationfeature selectionactuated sensor networksrecognition for synthesisvideo event detectiondynamic Bayesian networkuser-centereddialog act taggingagglomerative hierarchical speaker clusteringmultimodalitytongue shape analysisparalinguistic feedbackBehavior signal processinginter-evaluator agreementinformation retrievalenriched latticesimage segmentationreal-time magnetic resonance imaging (MRI)musicuniversal background model (UBM)approximation theorydistributed speech recognitionacoustic featuresemotional speech analysisLatent Perceptual Indexingspeech analysislistsemotionsspeech coarticulationmaximum a posteriori (MAP)linear discriminant analysis (LDA)speech processingemotional speech recognitionfront-end featuresneural networkcompressed sensingclusteringupper airwaychildren's read speechinformation fusionmagnetic resonance images (MRI)data source variationdata representationprosodyphonological classificationnoise source modelscategorizationautomatic reading assessmentHierarchical featurescortical representationhuman behaviorboundary toneobservational studiesspanspeechlinkscareaaaannotation modelingUnspecified


text-independent

multipitch analysis

acoustic correlates

underwater acoustic communications

perception

movie event detection

audio resynthesis

data selection

joint coding-classification

concept classification

feature level fusion

information change rate (ICR)

Supervised i-vector

marine phenomena

pitch contour analysis

Kullback–Leibler distance and divergence

sentence segmentation

vision

flexible MRI system

Language identification

Lempel Ziv 78

rapid vocal tract shaping

signal reconstruction

spoken dialogue systems

articulatory acoustics

fuzzy logic

matrix completion algorithms

magnetic resonance imaging

language learning

Glottal source estimation

expressive synthesis

humming database

automatic assessment

spherical cubic interpolation

speaker identification

automatic speech recognition

environmental sounds

real-time MRI

Little CHIMP

clustering error rate (CER)

user interface human factors

utterance clustering

behavioral signal processing

Gaussian Mixture Model

Gender

Speaker state

Oceans

polygonal shapes

movie skimming

Simplified i-vector

multiresolution analysis

Survey

Global level cues

PCA

actor

RFC

acting styles

lattice enrichment

Simplified Supervised i-vector

large marine expanses

pronunciation evaluation

speaker model construction

Paralinguistics

vocal tract

audio clustering

speaker adaptation

prosodic boundary

prominence detection

Autism diagnostic observation schedule (ADOS)

user interaction

dyadic wavelet transform

Random access

Detection

speech production

intelligent sensors

emotion

silence detection

audio classification

head motion

Engagement

onomatopoeia based audio descriptions

auditory gist

affective state

emotion resynthesis

Signal processing

facial motion

head motion synthesis

lattice posterior

audiovisual integration

maximum entropy model

Markov chain Monte Carlo (MCMC) method

texture synthesis

pathological speech

incremental Gaussian mixture cluster modeling

elicitation techniques

reading assessment

lexical rules

audio-visual database

couple’s therapy

dialogue

Glottal flow derivative

voice source

children's speech

sparse representations

matrix algebra

emotional perception

acoustic adaptation

data-driven

content based audio retrieval

speech recognition

repeating patterns

speech

ToBI

audio information retrieval

Age

speech synthesis

Trends

audio ontology

pronunciation modeling

fricatives

psychology

acoustic correlates of prosody

virtual microphones

custom upper-airway coil

sibilants

speech acoustic

music indexing

Approximation methods

acoustic confidence scores

syllable

feature vector selection

BIC-based stopping criterion

uncertainty quantification

Challenge

speech understanding

Cognitive and motor load

Autism

reasoning

multichannel audio

BSP

translation

agent

games

motion capture

demo

sematics of emotions

emotion space concept

speech act

articulatory features

Speaker verification

regularization

categorical prosody models

turbulence

SVC

constrained reconstruction

data acquisition

principal component analysis

data-dependent partitions

user interface

audio indexing

matching pursuit

GMM supervectors

OOV detection

feedback

recognition of speech variation

Autism Diagnosis

acoustic discrimination measures

machine learning

children’s speech recognition

TILT

dialog

multiple instance learning

stress

Sparse matrices

attention model

facial animation

decision level fusion

speech-to-speech

Autism diagnostic interview (ADI-R)

I-vector

emotion recognition

articulatory movements

retrieval

language model adaptation

Bayes’ decision approach

Sea measurements

Mel frequency filters

HMM based transcription

pronunciation verification

mutual information estimation

semi-supervised learning

movie content analysis

discriminative modeling

database searching

speaker diarization

TIMIT

audio database

TV

pronunciation variation

articulatory stroke

target tracking

knowledge engineering

animation synthesis

vocal tract normalization

Local level cues

pronunciation

Wireless sensor networks

talking face detection

content-based video indexing

question turn

video signal processing

spoken dialog systems

Bayesian information criterion (BIC)

formant scaling

screening

non-parametric mutual information estimation

acoustic modeling

music database

re-ranking N-best lists

audio-video analysis

expressive animation

tree-structured bases and wavelet packets (WPs)

embodied conversational agent

affective states

Affect

sensor networks

multi-slice

audio representation

mica

children

inference

multi-pass recognition

tree grammar

sequence model

short-segment speaker identification

polyphony music signal

dialogue modeling

HMM

dialog systems

field reconstruction

prosodic language model

analysis of articulatory measurements

natural language

agglomerative hierarchical clustering

stress detection

Wavelet packets

nonnative speech

music information retrieval

letter-sounds

vocal entrainment

piecewise polynomial approximation

lexical features

Jitter

video content analysis

behavioral informatics

emotional salience

inter-cluster distance measure

part of speech (POS)

objective and subjective measures

discourse context

natural language processing

robustness

spiral readouts

couple therapy

expression of emotion

synthesis

emotional speech

auditory saliency map

human factors

underwater acoustic communication

user response to error

talking avatars driven by speech

HMMs

Hidden Markov Model

portrayal of emotions

multimodal analysis

sparse approximation

facial emotion expression

disfluency detection

spoken language processing

speaker normalization

localized search algorithm (LSA)

vehicle kinematics

automatic evaluation

spoken language

query by humming

uncertainty

analysis of audio clusters

Joint communication and detection

distortion measures

accented English

smart room

dyadic interaction

tract variables

split lexicon

evaluation of human-computer dialog systems

affective recognition

child-computer interaction

query by example

letter-names

Underwater communication

diagnosis

phase constraint

relative entropy

critical articulation

unsupervised adaptation

bsp

selection

unsupervised speaker indexing

human interaction

support vector machines

sample speaker models (SSM)

total variation

similarity measure

prosodic cues

feature extraction

emotion theories

speech animation

Bayesian reasoning

emotional assessment

behavioral signal processing (BSP)

user modeling

mutual information

acoustic source localization

vocal tract area functions

user type

intonation

contour tracking

Classifier decision fusion

query-by-humming

speaker clustering

auditory scene recognition

Shimmer

prominence

pitch accent

prosody recognition

hidden Markov models (HMMs)

machine mediated communication

word boundary

universal gender models (UGM)

LZ-78

information theory

scalable encoding

background modeling

selective agglomerative hierarchical clustering (SAHC)

music fingerprint

statistical methods

pitch stylization

motion capture system

complexity regularization

vocal tract shaping

Web 2.0 applications

voice onset time

spontaneous speech

Automatic literacy assessment

n-gram models

human-computer interaction (HCI)

neutral speech

human annotation

syntax

Adaptive sensing

Underwater acoustics

multipulse excitation

signal representation for classification

unstructured audio classification

basis selection

canonical correlation analysis

auditory scene analysis

maximum entropy modeling

McGurk effect

melody search

expressive speech

gradient vector flow

generic models

pattern search

emotional databases

audio-visual emotion perception

cross-lingual interactions

minimum probability of error signal representation

child-adult vs. child-machine interactions

exploration-exploitation

auditory attention

speech to speech translation

reverse lookup

generalized likelihood ratio (GLR)

syllable recognition

Mel-frequency cepstral coefficient (MFCC)

accent

filter bank selection

audio retrieval

spoken name recognition

agglomerative hierarchical clustering (AHC)

Mel filter bank (MFB)

compressed sensing MRI

Bayesian network

minimum cost tree pruning

sensitivity encoding

children’s speech

noise classification

Intoxication detection

discriminant analysis for tensor representation

prominent syllable detection

child engagement

pitch period estimation

feature selection

actuated sensor networks

recognition for synthesis

video event detection

dynamic Bayesian network

user-centered

dialog act tagging

agglomerative hierarchical speaker clustering

multimodality

tongue shape analysis

paralinguistic feedback

Behavior signal processing

inter-evaluator agreement

information retrieval

enriched lattices

image segmentation

real-time magnetic resonance imaging (MRI)

music

universal background model (UBM)

approximation theory

distributed speech recognition

acoustic features

emotional speech analysis

Latent Perceptual Indexing

speech analysis

lists

emotions

speech coarticulation

maximum a posteriori (MAP)

linear discriminant analysis (LDA)

speech processing

emotional speech recognition

front-end features

neural network

compressed sensing

clustering

upper airway

children's read speech

information fusion

magnetic resonance images (MRI)

data source variation

data representation

prosody

phonological classification

noise source models

categorization

automatic reading assessment

Hierarchical features

cortical representation

human behavior

boundary tone

observational studies

span

speechlinks

care

aaa

annotation modeling

Unspecified


Generated by bib2html.pl (written by Patrick Riley ) on Fri Jun 16, 2017 23:16:45