The Structure of Speech
Processing speech is complex:
Environment can interfere with speech signal
Variability in speech signal
No one-to-one correspondence between the acoustic stimuls and the speech sounds we hear.
How do recognize sounds in a way so that we perceive hearing a stable set of phonemes?
Articulatory Phonetics
the study of the pronounciation of speech sounds
For Consonants:
Place of articulation
Bilabial: lips ([b], [p], [m])
Alveolar: alveolar ridge ([d], [t], [s], [z], [n], [l])
Velar: back of the mouth ([k], [g], [ng])
Manner of articulation
Fricatives: obstruction of airflow, but some airflow ([f], [s], [z])
Stops: obstruction of airflow and then release ([b], [p], [k])
Affricate: Stop followed by fricative ([ch], [j])
Voicing
Voiced vs voiceless: (z/s, b/p, v/f)
Vowels
Articulatory gestures are mainly associated with the position of the tongue
Part of tongue (pit, putt, put)
Height of tongue (pit, pet, pat)
Acoustic Phonetics: Studying the acoustic signature of speech sounds
Problem: No parallel between phonetic segments and letters
How do we study acoustic energy of speech sounds?
Spectograms: representations of sound frequency by time
Interpreting spectograms
Formants: Bands representing sounds occuring at particular frequency levels
Aspects of Formants
Formant Transitions: Changes in the formant frequency
Steady state: No changes in formant frequency
Acoustic properties of the speech signal
Parallel transmission
The speech signal is continuous
Context-conditioned variation
The spectographic appearance of a phone is related to the context and not to that particular phone in other contexts
Coarticulation: We are producing more than one speech sound at the same time
The problem of invariance: There is no one-to-one correspondence between acoustic cues and perceptual events
The perception of isolated speech segments
Categorical perception
Can we discriminate between two sounds/colors/etc. that are very similar?
In order to process speech, we need to impose categorical identification on the incoming sounds.
Test of categorical perception: /ba/ vs. /pa/
Lisker & Abramson (1960)
Synthesized speech sounds
Varied VOT from -0.15 to +0.15 secs
Does categorical perception hold for nonspeech sounds as well?
Mattingly, Liberman, Syrdal & Halwes (1971)
Synthesized speech syllables:
Only presented the second formant, or reversed the sound, or removed the transition.
Found no categorical perception.
Suggests that the formant transitions are important for categorical perception
What about vowels?
Vowel perception is continuous and noncategorical
Why? longer duration of cues
The role of attentional processes
Mann, Madden, Russell, & Liberman (1981).
Binaural procedure
Base presented to one ear, transitions presnted to other ear.
The conditions to process speech in this phonetic mode depend on attention and not just the acoustic cues
The perception of continuous speech
Acoustic structure varies greatly as we speak.
Pollack and Picket (1964)
Processing words out of context
Words were identified correctly only 47% of the time
Acoustic information alone may not be enough to identify speech sounds. We need context
The role of Semantic and Syntactic Features
Millar & Isard (1963).
Present three different types of sentences
1) grammatical
2) anamalous, preserving grammatical word order
3) ungramatical
1) Accidents kill motorists on the highways
2) Accidents carry honey between the house
3) Around accidents country honey the shoot
Illusions
Miller (1956) Processing sounds through low-pass filter
Pooh kluss free soub eatwull size
Two plus three should equal five
Warren (1970)
Phonemic Restoration
The state governors met with their respective legi*latures convening in the capital city.
Warren & Warren (1970)
It was found that the *eel was on the axle
It was found that the *eel was on the show
It was found that the *eel was on the orange
It was found that the *eel was on the table
Acoustic Phonetics lab review
da ta ba pa dee dew day daw doe
A) Is there a definable break between the consonant and the vowel? (i.e. can you select a portion of the consonant so that you can hear the consonant, but not the vowel?)
Coarticulation:
We produce more than one speech sound at a time
B) Identify the difference in Voice Onset Time (VOT) between "da" and "ta" and between "ba" and "pa".
C) How do the first and second formats differ across the sounds "dee, dew, day, daw, and doe"? Is there any feature of the "d" sound that is consistent across the sounds?
Context conditioned variation
: the spectographic appearance depends on the speech context
2) My dog is lying under the bed
A) Can you identify the breaks between words? How much time elapses between words?
B) How does the "d" differ in "dog", "under", and "bed"?
Problem of Invariance: no one to one correspondence acoustics and phonetics
The Motor Theory of Speech Perception
Perception proceeds by reference to production
Place of articulation cued by the eyes
Manner of articulation cued by the ear
Transformation of acoustic signal to intended phonetic gesture is done in an automatic manner
A solution to the problem of invariance.
Evidence for the Motor theory:
What is the link between perception and production?
Teaching students foreign languages: Articulating new sounds help students hear them.
The McGurk Effect: Using articulatory knowledge during speech perception
speaker's lips say [ga], sounds says [ba], is perceived as [da]
The TRACE model of Speech Perception
(McClelland and Elman)three different interacting processing levels
Feature level: nasality, acuteness, diffuseness etc.
Phonemes: set of all phonemes in english
words: set of words known

The role of activation in the TRACE model
Activation of higher order units can also activate lower units
inhibition occurs between units at the same level
Advantage of the model:
Top down and bottom up processing
Uses context to help figure out the word
Coarticulation between words:
Foolish Capes
Christmas Capes
people hear a /t/ with /sh/ but a /k/ with /s/.
Articulatory differences influence perception of the initial phoneme of the second word
Elman and McClelland presented
FooliX capes
ChristmaX capes.
Found coarticulatory effects that people still heard the /k/ or the /t/.
Results similar to their model, indicates that there is top-down processing and the activation of word units influenced their interpretation of the t or k.
Perception of Written Language
Why did writing develop so late?
From a developmental point of view, recognizing printed word is the central problem of reading.
Questions:
How long does it take to identify a word?
Is identifying words effortful and the rest of the reading process automatic?
Are words identified by accessing the sound and then the meaning?
Are letters in words processed serially or are words processed as wholes?
How much does context affect the process of word identification?
How long does it take to identify a word?
How would you do an experiment to identify a word?
Response-time method
Display a word: How long for it to be said?
400 msecs w/practice
Categorization task.
Is this an animal? "Dog", "Bacteria", "Rose", "Stone"
For common instances, it takes about 700 msecs, but longer for "rose" and "bacteria"
Lexical Decision Task
Is this a word?
Simpler and faster than categorization
Accessing some knowledge.
Brief Presentation Methods
Give subjects only a short time to process the word
Flash a word for only 60msec
Masking
Estimates from Reading Text
A college student reads about 300 words per minutes (5 words/sec=~200 msecs per word
Problems with reading studies
Physiological Studies
Evoked potentials: Well defined peaks in response to a word stimulus at 150-200msecs
Summary: How long does it take to identify a word?
Reaction Time studies: 400 msecs
Brief Presentation studies: > 60 msecs
Reading studies: 200 msecs
Physiological studies: 150- 200 msecs
What factors cause variation in identifying a word?
Coat vs. Cove:
Lexical decision task there is a difference of 100msecs
Naming times, the difference is only 30 msecs
The term "word identification" can mean a lot of things
Is word processing automatic?
What is taking up the time for doing the reading?
What do I mean by automatic?
unaware of the process
not under conscious control
takes no processing capacity
Identifying the meanings of words is a rapid process for the skilled reader
Is processing words unconscious?
Priming study (Meyer & Schvaneveldt, 1971)
DOG #### CAT
FAN #### CAT
Lexical Decision 30 to 50 msecs faster for CAT if DOG is shown than if FAN is shown
Is processing words automatic?
Stroop effect. the name and meaning of words are processed even when the reader does not want to process the word.
Does word identification take processing capacity?
Word identification takes only about 60 msecs of processing time,
Reading takes about 200 msecs per word
Identification is not a major part of the mental processing needed for reading
Are letters processed serially or are words processed as wholes?
Healy (1980) T-circling task
Evidence for reading units larger than letters
Number of missed ts ___ /40 = Percent errors on t
Number of missed thes _____/8 = Percent error in the
What percent of all the errors were made on the word the?
Number of missed thes / Number of missed ts
Why do we get the t-circling effect?
High-frequency words
Related effects:
t circling in scrambled text
proofreaders missing silent "e"
circling "f" in "of"
The Word Superiority Effect
Do we process words serially letter-by-letter?
Cattell (1886)
Presented words or letters very briefly
Subjects were able to report words better than letters.
But, he
did not control for guessing
did not control for persistence of iconic memory
Reicher (1969)
Pattern mask to control the stimulus presentation time.
Changed the task to control for guessing
Subjects were better at picking out letters if they were in the context of words than if they were alone or if they were in a non-word
Is there a word superiority effect for psuedowords (mave)?
Are words then just visual templates?
Is there a detector for a particular pattern for each word?
dog, DOG, dOg, dog, dog, dog, dog, dog, dog dog, dog,
We can't have a template that would match each possible typeface that we would encounter.
We are not always processing words as "whole words"
A model of word recognition
McClelland & Rumelhart (1981) Interactive Activation model
Detectors at three levels
Processing occurs in parallel
There is interaction among the levels, with activation flowing from detectors to other detectors

Eyemovement studies of reading.
Leisure reading: about 280 words per minute
What can our eye movements tell us about reading?
Eye Tracking device
Presentation on a computer screen
Position of eye is monitored
Eye position is mapped on to position of text on screen
Fixations
Rest periods
150-500 msecs
Saccades
Rapid eye movements taking 20-35 msecs
Ballistic movement
Typically 7 to 9 character spaces with each saccade
No visual information during saccade
Regressions
Saccades that move backwards
10-15 percent of all saccades
Visual Acuity
Foveal area subtends about 2 degrees of visual angle around fixation point
Parafoveal area subtends 10 degrees of visual angle arounnd fixation point
Span of fixation
What is the size of the area from which a reader picks up information?
Moving Window technique (Rayner, 1975)
1) How large does a window have to be to allow normal
reading?
31 Characters (15 on each side)
2) What peripheral information is being used by the
reader?
Manipulate
similarity of word shape
word boundaries
semantic relatedness
Measure fixation duration
Subjects show greater fixation durations with more visual and semantic inconsitencies
The captain granted the pass in the afternoon
The captain guarded the pass in the afternoon
The captain gnarbed the pass in the afternoon
The captain pmarved the pass in the afternoon
We use 7-12 character spaces for visual information (word shape, word boundaries)
We use 1-6 character spaces for semantic information
Conclusion: We do use information in our periphery, although we can use it more for visual information than for semantic informationAt what level are we processing a sentence?
The player caught the baseball
Syntactic, Semantic, Pragmatic levels
How do the levels interact?
Syntactic Processing
Parsing: Assigning linguistic categories to the surface structure of a sentence
The player caught the baseball
S -> NP+VP
NP -> det + (adj) + N
VP->V+NP
Parsing as problem solving or decision making.
How do we deal with syntactic ambiguity
They are flying planes
John drove down the street in the car
Yet, we don't make many problems parsing. Why?
Three possible ways of parsing
Construct all logically possible syntactic structures
Keep one structure, but if it doesn't work, go back and try another
Hold information in short-term memory until there is enough evidence to rule out all but one structure
What are the problems with each approach?
Immediacy principle vs. Wait and see
Immediacy: When we see a word, we access, its meaning and fit it into the parse tree
Wait and See: We wait on interpreting a word until it is clear where it should fit
They are eating apples
However, immediacy can lead to errors in parsing
The florist sent the flowers . . .
. . .was very pleased
The florist who was sent the flowers was very pleased
Parsing strategies
What type of decisions are being made?
Late Closure strategy
The reader attaches new items to the phrase or clause currently being processed
Tom said that Bill had taken the cleaning out yesterday
1) Since Jay always jogs a mile seems like a short distance to him
2) Since Jay always jogs a mile this seems like a short distance to him
Minimal Attachment strategy
We prefer to attach new items into the phrase marker being constructed using the fewest syntactic nodes consistent with the rules of the language.
Ernie kissed Marcie and her sister ...
1) The girl knew the answer by heart.
2) The girl knew the answer was wrong.
We tend to take the answer as the direct object of the verb knew, but have to reinterpret in sentence 2 because the answer is part of a new clause
Evidence for Late Closure and Minimal Attachment strategies
1) Since Jay always jogs a mile seems like a short distance to him
2) Since Jay always jogs a mile this seems like a short distance to him
Word by word processing
Eyemovement studies
Longer fixations on disambiguating words, or the succeeding word, or regressions from those words.
Question: Do we deal with syntactic ambiguity in the same way as lexical ambiguity?
Do lexical (semantic) features interact with parsing?
Tyler and Marslen-Wilson (1977)
1) If you walk too near the runway, landing planes is...
2) If you walk too near the runway, landing planes are...
3) If you've been trained as a pilot, landing planes is ...
4) If you've been trained as a pilot, landing planes are...
Latencies are shorter for appropriate words rather than inappropriate words
Evidence for context helping to assign structural meaning to the sentence
Comprehending sentences in context
Parsing produces a syntactic structure. This permits a basis for determining the literal meaning of the sentence.
But what does a sentence mean?
Can you close the door?
We are fighting an uphill battle
He flew through his homework
Encyclopedias are goldmines
How do we use communicative context to understand sentences?
Conventions
Linguistic communication takes place within a context of shared assumptions about communication
Grice's maxims
conventions provide ground rules for successful conversation.
Joe: Are you going to the big dance tonight?
Sue: Didn't you hear that Billy Smith will be there?
Grices maxims:
All speakers adhere to the "cooperative principle" by following four maxims
Maxim of Quantity: Make your contribution as informative as is required, but not more so, for the current purpose of the exchange.
Maxim of Quality: Do not say what you belive to be false or that for which you lack adequate evidence.
Maxim of Relation: Say only what is relevant for the current purposes of the conversation
Maxim of Manner: Be brief, but avoid ambiguity and obscurity of expression.
Violating the conventions:
Steve: Jack is meeting a woman for dinner tonight.
Susan: Does his wife know?
Steve: Of course she does. The woman he is meeting is
his wife.
John went fishing. His boat sank.
What happens when you violate a maxim?
Speech acts.
Utterances that do not communcate much information but serve as an action.
I promise, ...... I'm sorry ... I'm happy for you
Types of speech acts:
Constatives: Expressing beliefs to form beliefs in the
listener
I deny that.
Directives: Expressing interest in the listener's future
action:
I asked you to do it.
Commissives: speaker is obligated to do something:
I promise to take out the garbage.
Acknowledgements: expresses feelings for the listener
I'm happy for you
Indirect speech acts
Communicative meaning does not correspond to the intended meaning
Question the ability of the listener
Can you close the window?
Question the listener's willingness
Will you close the window?
Indicate the reason why such an action needs to be done
It's getting windy in here
But ...a speech act has to meet certain conditions for it to be judged as appropriate
How do we comprehend speech acts?
Searle's stages:
1) listener extracts the literal meaning of the sentence
2) listener decides whether the literal meaning is what the speaker intended
3) Listener computes indrect meaning based on communicative conventions and direct speech act.
Testing Searle's model
Clark and Lucy (1975)
Used negation on the literal and the non-literal meaning
Subject's task matching pictures to sentences
1) Can you open the door?
postive in direct and indirect meaning
2) Must you open the door?
positive direct negative indirect
3) Why not open the door?
negative direct positive indirect
Positive requests had faster times than negative requests and requests in negative form to even longer.
Evidence for computing literal meaning of the sentence first.
Gibbs (1979)
Processing indirect speech acts in context
Must you open the window?
Mrs. Smith was watering her garden one afternoon. She saw the housepainter was pushing a window open. She didn't understand why he needed to have it open. A bit worried she went over and poiltey asked. "Must you open the window?"
One morning John felt too sick to go to school. The night before he and his friends got very drunk. Because of this he caught a bad cold. He was lying in bed when his mother stormed in. When she started to open the window John groaned, "Must you open the window?"
Then flash:
Need you open the window?
Do not open the window.
Indirect requests took longer in isolation but...
Subjects faster at identifying indirect requests in context than literal meaning.
Evidence that we don't compute a direct meaning if there is enough context.
Metaphor
We are fighting an uphill battle
He flew through his homework
He is drifting aimlessly
He is tied up at work
Arguments: = war
He attacked every week point in my argument
His criticisms were right on target.
He shot down all of my arguments.
If you use that strategy, he'll wipe you out.
Time:= money
You're wasting time
How do you spend your time?
I've invested a lot of time in her.
I can't spare the time.
How often do we use metaphors?
MacNeil/Lehrer News Hour: one metaphor every 25 words
What constitutes a metaphor?
Billboards are warts on the landscape
Topic/Tenor=Billboards
Vehicle= warts
Ground= Implied similarity between the tenor and vehicle
We use the tenor and vehicle to infer the ground
What is the tenor, vehicle and ground of these metaphors?
We hit a roadblock in our work
The thought slipped my mind like a squirrel behind a tree.
Oranges are the baseballs of the fruit lover.
He's as slow as a thumbtack.
Dogs are animals
Why do we use metaphors rather than literal statements?
Words are discrete while human experience is continuous.
Metaphors give us a way of describing information that would otherwise be hard to express
How do we comprehend metaphors?
Verbrugge and MCarrell (1977) Retention of information from metaphors.
Present metaphors, test recall of sentence by giving tenor, vehicle or ground.
Skyscrapers are the giraffes of the city
Give: "skyscrapers", or "giraffes", or "are very tall compared to surrounding things"
Results. Ground just as effective, as tenor and vehicle in recall.
We identify the underlying similarity of relations between tenor and vehicle.
Do we process the literal meaning of the metaphor?
With no context, metaphors take longer to process
With context, metaphors are processed just as fast as normal sentences
Do we sometimes not process the metaphorical meaning?
Subjects decided if a sentence is true.
All jobs are jails
Found that when metaphor was available in sentences that was false, response times were longer.
Indicates we can not ignore metaphors even when they are irrelevant to our task.
How do we process metaphors?
We retrievel lexical representation.
Determine what category they apply to and try to apply it
The pianist is a butcher
The surgeon is a butcher
Sermons are like sleeping pills
My husband is a teddy bear
Summary:
For both metaphors and speech acts, we comprehend the sentences directly, not interpretting the literal meaning.
Memory for sentences
What is remembered after exposure to a sentence?
Meaning vs. Surface Form
Strohner and Nelson 1974. Children aged two to three interpret sentences:
1) The cat chased the mouse.
2) The mouse was chased by the cat.
3) The mouse chased the cat.
4) The cat was chased by the mouse.
All interpret as "cat chased mouse"
By age 4 or 5 children interpret all four sentences correctly.... although sentences 3 and 4 take longer to process.
Does this happen for adults as well?
Task: Paraphrase the sentence
1) John dressed and had a bath.
2) John finished and wrote the article on the weekend.
3) Don't print that or I won't sue you.
Results in "If you print that I'll sue you
60 percent of people normalized perverse sentences
If asked if there was any difference between what they said and the original sentence 53% said no difference.
47% of the time they justified it by saying that they knew what the sentence said, they just reworded it so that it was less clumsy.
A reality principle. "What is described in the discourse should be sensible"
Recognition memory for sentences
Fillenbaum (1966) Present long list of unrelated sentences
Present: 1) The window is not closed
Which did you see?
a) The window is closed
b) The window is not open
c) The window is open
Open and closed are "contradictories", does it work for "contraries"
The man was tall
The man was not short.
Maybe it is an effect of the experimental context
Wanner (1974) gave test on the instructions themselves
When you score the results, do nothing to your correct answer, but mark carefully those answers which are wrong.
Recognition test on
your correct vs. correct your
or
mark carefully vs. carefully mark
Subjects 100% recognize on your correct
Subjects 50% recognize on mark carefully.
People retain meaning and not surface form, even if they don't know if they are going to be tested on the sentence
Time course of retention of sentence information
How long do we hold surface and meaning?
Sachs (1967)
Played tape recorded passages.
Interrupt at various intervals.
Variables:
Retention interval
Type of sentence
Original
2 that changed wording but not meaning
1 changed wording and meaning
Results:
If tested immediately: retained sentence wording and meaning
If tested 40 syllables (12.5 secs) later: wording lost meaning stays
If tested 80 syllables later: meaning still remains
Replicated for Visual presentations and American Sign Language
How do we retain information in the real world?
Keenan, MacWhinney and Mayhew (1977) retention of surface and meaning in natural conversations
Recorded researchers lunch discussions
Tested recognition memory
Interactional content : Jokes, Figures of speech, insults
Some utterances convey information to the listener, others convey attitude of the speaker toward the listener. These are high in interactional content
Results:
High retention of both surface and meaning for sentences with high interactional content
No retention of surface and low retention of meaning for sentences with low interactional content
Second test.
Presented sentences in a context free environment. No effect for interactional content.
Conclusion. It is the pragmatic factors of the conversational context that results in the retention.
The role of inferences in sentence memory.
Inference: A proposition, concept, drawn by the reader
Inferences are drawn through the process of elaboration
Example of inferences:
1) Last Christmas Emily went to a lot of parties.
This Christmas she got very drunk again.
2) Last Christmas Emily became absolutely smashed.
This Christmas she got very drunk again.
Second sentence takes longer to read in first passage.
Inferences and false recognition errors
What is from the text, what is an inference?
Bransford, Barclay and Franks (1972)
1) Three turtles rested beside a floating log, and a fish swam beneath them.
2) Three turtles rested on a floating log, and a fish swam beneath them.
Surprise memory test several minutes later.
Three turtles rested on a floating log, and a fish swam beneath it.
Three turtles rested beside a floating log, and a fish swam beneath it.
Subjects who had seen the on were more likely to falsely recognize the test sentence.
John was trying to fix the bird house. He was looking for the nail when his father came out to watch him and to help him do the work.
False recognition on:
John was using the hammer to fix the bird house when his father came out to watch him and to help him do the work.
Speed of processing of inferences
All the mugger wanted was to seal the woman's money. But when she screamed, he stabbed her with his weapon in an attempt to quiet her. He looked to see if anyone had seen him. He threw the knife into the bushes, took her money, and ran away.
People infer knife before getting to the last sentence,
reading time of last sentence the same if the second sentence had knife or weapon.
But, if the second sentence is:
But when she screamed, he assaulted her with his (knife/weapon) in an attempt to quiet her.
Last sentence takes longer to read when knife is not explicitly mentioned in the second sentence.
Conclusion: We draw elaborative inferences only when the context is clear
Inferences in advertising
My doctor said Mylanta
4 out of 5 dentists recommend dentine for their patients who chew gum
Breck Shampoo, now with vitamin E!
My opponent voted against the child safety law act.
Harris (1977) are statements were true, false, or indeterminate
"Wouldn't it be great", asks the mother, "if you could make him coldproof? Well you can't. Nothing can do that (boy sneezes). But there is something that you can do that may help. Have him gargle with Gargoil Antiseptic. Gargoil can't promise to keep him cold-free, but it may help him fight off colds. During the cold-catching season, have him gargle twice a day with full strength Gargoil. Watch his diet, see he gets plenty of sleep, and there's a good chance he'll have fewer colds, milder colds this year."
Test : Gargling with Gargoil Antiseptic helps prevent colds.
Subjects can't distinguish between claims that are asserted vs. those that were implied.
Improving sensitivity to advertisements.
Harris (1977) some improvement in performance when subjects are explicitly warned about implied claims.
Bruno (1977): 20 minute training session, improved performance for both adults and school children.
How do we represent the gist in memory?
Propositions: Unit of meaning representing a phrase or clasue
a set of arguments joined by a predicate
The teacher taught the student
It was the teacher who taught the student
The student was taught by the teacher
taught(teacher, student)
The dog ate the large bone and then wagged his tail
p1: ate (dog, p2)
p2: (large, bone)
p3: and (p1, p4)
p4: wagged(his, tail)
How do we process information into propositions?
Comprehension and memory of connected discourse
Connected discourse: spoken and written: stories, class
lectures, manuals, textbooks ...
How is discourse organized and how does that affect our comprehension?
Discourse coherence and comprehension strategies
What is important for comprehension of connected discourse?
It is more than just understanding the meaning of each sentence
Carlos arranged to take golf lessons from the local professional. His dog, a cocker spaniel, was expecting pups again. Andrea had the car washed for the big wedding. She expected Carlos to help her move into her new apartment.
John bought a cake at the bake shop. The cake was chocolate with white frosting, and it read "Happy Birthday, Joan" in red letters. John was particularly pleased with the lettering. He brought it over to Greg's house, and together they worked on the rest of the details.
What makes the second paragraph easier to understand?
John bought a cake at the bake shop. The birthday card was signed by all of the employees. The party went on until after midnight.
Jeff went fishing. The boat sank, but he caught a big one.
Local/microstructure vs. global/macrostructure
Microstructure: relationship between individual sentences
Macrostructure: Relationship of sentences to our knowledge
Cohesion
Coherence is based on semantic relationships between sentences
"The range of possibilities that exist for linking something with what has gone before" (Halliday & Hassan, 1976)
Order these sentences:
A) He had eaten at McDonald's yesterday
B) Where should he eat?
C) Maybe Pizza Hut would deliver.
D) George needed to find some food.
E) However, he was tired of burgers.
What strategies did you use?
Categories of Cohesion
Reference, Substitution, Ellipsis, Conjuntion, Lexical
Anaphoric vs. cataphoric reference
Anaphoric reference: Current expresssion relates to an expression made earlier
cataphoric reference: Expression refers to future expression
This is how you do it. You let the herbs dry and then grind them up in a food processor.
I'm going to let you in on a secret. I hid the treasure on top of Deadman's bluff.
Strategies to establish coherence
Clark and Haviland: Distinction between given and new information
Given information: Information the author/speaker assumes the reader/listener already knows
New information: Information the reader/listener is assumed not to know.
Goal of giving information to someone is to provide both new information and old information.
It was Steve who robbed the bank.
It was the bank that Steve robbed.
The given/new strategy
1) Identify the given and new information in the current sentence
2) Find an antecedent in memory for the given information
3) Attach the new information to this spot in memory
Experimental approach: How long does it take to understand a sentence given what was said before?
Direct Matching Strategy
We got some beer out of the trunk.
The beer was warm.
Zak hopped into the waiting car and sped around the corner. He swerved to avoid the parked car and smashed into a building.
Zak hopped into the waiting car and sped around the corner. The old car lost a whell and smashed into the building.
Direct matches made on concepts not on words. (Overlap of arguments in the propositions)
p1: took (we, beer, p2)
p2: (from trunk)
p3: was(beer, warm)
Bridging inferences
We got the picnic supplies out of the trunk.
The beer was warm.
Last christmas John went to a lot of parties.
This Christmas he got very drunk again.
vs.
Last Christmas John became absolutely smashed.
This Christmas he got very drunk again.
Reinstating old information
Is the information in the background or the foreground?
I am trying to find a black dog.
He is short and has a dog tag on his neck that says Fred.
Yesterday that dog bit a little girl.
She was scared but she wasn't really hurt.
Yesterday a black dog bit a little girl
It got away and we are still trying to find it.
He is short and has a dog tag on his neck that says Fred.
She was scared but she wasn't really hurt.
Reinstatements: If information is no longer in the foreground, the reader must do a reinstatement.
Reinstatment requires retrieval of information from long term memory.
Comprehension time increases when subjects have to make reinstatements.
Coherence summary
If there is direct overlap we use: Direct match
Else
If there is some way to use semantic knowledge to find a relationship between current sentence and previous sentence by making a bridging inference
Else
Go back and search in memory for appropriate antecedentes by doing a reinstatement search
Memory for Discourse
How/What do we remember from reading a text?
A series of violent, bloody enounters between police and the Black Panther Party members punctuated the early summer days of 1969. Soon after, a group of Black students I teach at California State College, Los Angleles, who were members of the Panther Party, began to complain of continuous harassment by law enforcement officers. Among their many grievances, they complained about receiving so many traffic citations that some were in danger of losing their driving privileges. During one lengthy discussion, we realized that all of them drove automobiles with Panther Party signs glued to their bumpers. This is a report of a study that I undertook to assess the seriousness of their charges and to determine whether we were hearing the voice of paranoia or reality.
The van Dijk and Kintsch (1983) / Kintsch (1988) model of text comprehension
As text is read, it is processed into a set of propositions.
Propositions are connected to each other based on overlap of arguments in propositions (repetition rule)
Because of limitations of Short Term Memory, only a few propositions will be processed at a time.
More centrally connected propositions will receive the most activation
These propositions will remain in short term memory longer
Will tend to be transferred to long term memory and integrated with previous knowledge
Will tend to be recalled the best
Key points:
Coherence created through the generation of connections between propositions
The levels effect Evidence for the hierarchical organization of memory.
Propositions at higher levels are recalled better
Limits on STM Text processing involves a limited capacity system with the goal of integrating new textual information with previous knowledge
The psychological validity of propositions
Reading times
Kintsch and Keenan created sentences that were about the same word length, but had different numbers of propositions
1) Cleopatra's downfall lay in her foolish trust in the fickle political figures of the Roman world.
2) Romulus, the legendary founder of Rome, took the women of the Sabine by force.
Increased processing time with increased numbers of propositions to process
Cued recall studies
Words from the same proposition are more effective recall cues than words from different propositions
The mausoleum that enshrined the czar overlooked the square.
overlooked will be a better recall cue for mausoleum than for czar in spite of its physical proximity
The representation of propositions in memory
McKoon & Ratcliff
Priming task after reading a short text
Early French settlements in North America were strung so thinly along the major waterways that land ownership was not a problem. The Frenchmen were fur traders, and, by necessity, the fur traders were nomads. Towns were few, forts and trading posts were many. Little wonder that the successful fur trader learned to live, act and think like an Indian. Circulation among the Indians was vital to the economic survival of the traders.
Use one proposition as a prime for a second proposition
Task: Is the second sentence true?
1) Circulation among the Indians was vital
The fur traders were nomads.
2) Land ownership was not a problem.
The fur traders were nomads
Sentences that were close in discourse stucture produce the greatest priming
Amount of priming is based on the number of intervening propositions, not on the number of intervening words
Inferences and Discourse representation
Inferences: generation of information that is not explicitly in the text
John got the picnic supplies out of the car.
The beer was warm.
Sara ate the chicken.
She was sick all night.
Under what conditions do we draw inferences?
McKoon & Ratcliff (1992)
We draw inferences only under two conditions:
1) When the inference is necessary to make the text locally coherent
2) When the information upon which the inference is based is easily activated
Stated explicitly in the text
Part of the reader's general knowledge
Representation of inferences
Are inferences added in to the reader's text representation in the same way as explicit information from the text?
Kintsch (1974)
Subjects read either:
1) A carelessly discarded burning cigarette started a fire.
The fire destroyed many acres of virgin forest.
2) A burning cigarette was carelessly discarded.
The fire destroyed many acres of virgin forest
Verification task: True or false A discarded cigarette started a fire
No difference between explicit and implicit propositions after 15 minutes.
Evidence for two representations:
1) short-term verbatim representation
Only contains explicit information mentioned in the text
2) long-term propositional representation
Both explicit and implicit information from the text stored in the same structure
Does this work for pictures as well?
Bagget (1975) picture stories
Same effect although longer delay (72 hrs) before implicit information is stored .
The role of the situation model
A representation of our general understanding of what the text refers to.
General background knowledge, Mental maps, Mental pictures
Information is easier to be learned when we can integrate that information into our situation model.
Example: Bransford and Johnson (1973)
If the balloons popped the sound wouldn't be able to carry since everything would be too far away from the correct floor. A closed window would also prevent the sound from carrying, since most buildings tend to be well insulated. Since the whole operation depends on a steady flow of electricity, a break in the middle of the wire would also cause problems. Of course, the fellow could shouut, but the human voice is not loud enough to carry that far. An additional problem is that a string could break on the instrument. Then there could be no accompaniment to the message. It is clear that the best situation would involve less distance. Then there would be fewer potential problems. With face to face contact, the least number of things could go wrong.
No picture: 26% recall of ideas from text,
very low rating of comprehensibility
With picture: 57% recall of ideas from text
middle rating of comprehensibility
Goal for comprehension process of reading: relate propositions currently being read to propositions that are already stored in memory
This requires working memory
Does reading ability depend on our working memory capacity?
Daneman and Carpenter complex reading span task
Subjects are read aloud a series of sentences and recall the final word of each sentence
Number of sentences presented varied from 2 to 5
Combining processing function with a memory/storage function
Different people have different reading spans
Significant correlation between reading span and
comprehension test on questions
pronoun reference tests
Verbal SAT scores
Inferences and working memory capacity
Subjects provide Think aloud protocol while reading a passage
High memory span subjects produce inferences at the end of passage
Low memory span subjects produce inferences in the middle of the passage
What does this say about how we process text?
For low memory span subjects, difficulty of retaining information forces them to make inferences, even if they are wrong.
High memory span subjects can maintain multiple interpretations active in working memory and are able to make the correct interpretation when it is necessary.
Schemata and Discourse processing
How does the global structure of the discourse affect comprehension?
Genres: A type of discourse that has a characteristic structure
Class lectures, fairy tales, political speeches, News articles, Research articles, cop movies, romance novels
APA format:
Title page/Abstract
Introduction
Method
Subjects
Materials
Procedure
Results
Discussion
References
Appendices
Why does psychology use a single format?
Narrative Stories:
Introduction of Characters/Goal of main character
Introduction of Setting
Character runs into some obstacles
Character overcomes obstacles
Schemata and Discourse processing
Schema: a structure in memory the specifies the general or expected arrangement of a body of information
Restaurant schema
List 20 actions or events that occur commonly in eating at a restaurant.
Large agreement between subjects
All subjects mention:
Look at menu
order food
pay bill
Subjects also group the actions into similar components
Bartlett (1932) War of the Ghosts
Recall over time: How does the reader's schemata affect recall?
Process of "normalization" in recall: Stories that are incosistent with a reader's schemata are distorted in the direction of the schemata
Voss baseball study:
Subjects listen to fictitious baseball game
High knowledge subjects have better recall of events that are important for the game
No difference between high and low knowledge subjects in recall information not related to the game's goal structure (weather, name of players)
Activation of an appropriate schema
If a text is written so that we can not activate the appropriate schema, recall will be poor.
The procedure is actually quite simple. First you arrange things into different groups. Of course, one pile may be sufficient depending on how much there is to do. If you have to go somewhere else due to lack of facilities, that is the next step; otherwise you are pretty well set. It is important not to overdo things. That is, it is better to do too few things at once than too many. In the short run this may not seem important, but complications can easily arise. A mistake can be expensive as well. At first the whole procedure will seem complicated. Soon, however, it will become just another facet of life. It is difficult to foresee any end to the necessity for this task in the immediate future, but then one never can tell. After the procedure is completed, one arranges the materials into different groups again. Then they can be put into their appropriate places. Eventually they will be used once more, and the whole cycle will then have to be repeated. However, that is part of life.
Do we recall the details based on the activated schema?
Pichert and Anderson (1977)
Text describing the interior of the house.
Read the story from the perspective of:
buying a house
a burglar
Recall the story
now recall the story from the other perspective
After first recall, subjects recall more relevant information for their particular schema
After second recall, subjects able to recall previously unrecalled propositions from the second perspective
We are able to revive information by using a different schema
Conclusion:
Schemata have a directive effect on comprehension
Schemata help organize learning and affect recall of information
Story comprehension
How do we define a schema?
Story grammar: A schema that identifies the typical arrangement of events in a story
Mandler's rules for a story grammar
Like phrase-structure rules
Tree structure representation of stories
Application of story grammars to children's stories
All the animals at the zoo receive very good care. A zookeeper stays up each night to watch over them. One night the lions start making a lot of noise. One by one they started to roar and growl fiercely. The keeper was worred that they were sick or hurt. He wanted to see if something was wrong with them. He carried his bag of medicine into the lions' pen. The keeper fully checked over each one of the lions. He soon discovered that they were not sick at all. Thy were just trying to keep the other animals up. The zookeeper finally got them to go back to sleep. Soon things were quiet, and the zoo was calm again. Later on the elephants all got into a water fight. They used their trunks to toss water at each other. The keeper was mad because they were making a mess. He wanted them all to be quiet and peaceful again. He ran after the frisky elephants with a big stick. He shouted angrily at them to stop the water fight. The paid no attention to him and just ran faster. They squirted lots of water all over the poor man. They made an awful muddy mess in the elephant yard. The zookeeper worked for three days cleaning it up.
How do we represent and process episodes?
Episodes tend to be recalled in an an all-or-none fashion (chunks)
Length of episodes does not affect their recall
The time course of reading episodes
Haberlandt, Berian and Sandson (1980)
Sentence by sentence reading task, recording sentence reading time
Reading times were longer at the beginnings and ends of episodes
What processing is going on?
Begining of episodes
Identifying new topic of discourse
Forming expectations
End of episodes
Summarize the gist of the episode
Deciding what propositions to hold over
What is retained best from a story?
Attempts and outcomes are retained
Goals, reactions, emotions tend to be lost
Emphasis on what happenend more than what a character was thinking
Does the story grammar hold true for other cultures as well?
Mandler tested stories in Liberia
Wide range of literacy and education
Results show that recall patterns are similar despite different cultures and different degrees of experience with stories
Indicates that story grammar may be a universal schema
What do all these psychological studies say about teaching and learning?
Strategies for better learning from text
Strategies for improving texts
Speedreading
What do you have to do to learn from a text?
Key issues:
Learning surface structure vs. deep representation of text
Identifying the important information in atext
Connecting the discourse into a coherent body of information
Activating the appropriate schema (if you have it) and connecting it to the information being processed
Active processing vs. passive processing
Relate what we read/hear to what we already know
Palinscar and Brown (1984)
Trained junior high students who were poor readers
Trained to formulate questions that would be answered by the most important parts of the text
Students learned to identify main themes in texts
Post-test scores improved from 30% to 80%, no improvement for control group
Long-term retention of these reading skills
Questioning the author (Beck)
Why did the author write this?
What is his/her point?
How would I summarize this section?
Where is he/she going with it?
Hilighting texts
Does using hilight pens help?
Students read texts with or without hilight pens
Students who did not hilight had higher comprehension
It may help later search, but it did not foster active processing
There are hilighting strategies that will help, though
We need to process information at a deep semantic level in order to retain it
Identifying the main points
What is remembered from a lecture?
Kintsch & Bates (1977)
Students remembered recalled main points, but also recalled just as many details and jokes.
Problem for students: What are the main points of a lecture?
How do we identify the main points?
Explicit text signals can aid in comprehension of main points (Meyer, Brant & Bluth)
Revised texts to improve signals
Explicit: A problem of vital concern is the preventio of oil spills from supertankers
Implicit: Prevention is needed of oil spills from supertankers
Explicit text improved comprehension of poor comprehenders
No difference in comprehension for good comprehenders
What about just leaving in the main points and taking the details out?
Reder & Anderson (1980)
Improved retention for condensed versions of texts
Discourse coherence
Goal in text comprehension: connect discourse into a well connected structure
Two ways of looking at coherence.
Improving reader's strategies
Improving the text's coherence
Improving reader's strategies
Reader's goal: create a well linked set of propositions
Look for coherence discourse markers at the global and local level
Local:
Sara went home. She was very tired
While this was the case, other people were not as pleased....
Global
A related point is...
In summary...
Topic headers
Draw inferences
A cigarette was carelessly discarded. The fire caused a lot of damage.
The plane was equiped for IFR flight. It should have managed to get through the storm.
Improving the text's coherence
Revising the coherence of the text at the local and global level
Study by McNamara, Kintsch, Songer & Kintsch
One text on heart disease revised into four versions improving local coherence and/or global coherence
Improving local coherence:
Replace pronouns with noun phrases (replace it with heart)
Adding sentence connectors (however, therefore, because, so that)
Replacing words to increase argument overlap (repeat words across sentences)
Improving global coherence
Add topic headers (Congenital Heart Disease, Acquired Heart Disease)
Add macropropositions linking each paragraph to the rest of the text and to the topic. (There are many kinds of heart disease, some of which are present at birth and some of which are acquired later)
|
Low Local Coherence |
High Local Coherence |
|
|
Low Global Coherence |
||
|
High Global Coherence |
General result: Higher levels of coherence in text result in improved comprehension.
Why? Less cognitive strain on reader since the topic is unfamiliar
Second manipulation.
1/2 the students have high knowledge about heart disease
1/2 the students have low knowledge about heart disease
Results:
High knowledge students benefit most from low coherence texts
Low knowledge students benefit most from high coherence texts
Why? Force active processing for high knowledge students
Related applications
Britton & Gulgoz (1988)
Revised version of texts for Air Force textbooks
Revisions done based on propositional analysis to determine breakdown in coherence.
Results: Propositional analysis of a text can identify places where coherence can break down. Revising these places in the text can improve a reader's comprehension.
Building global structures
How does the current text fit in with your understanding of the topic?
Can you induce the author's schema?
Approaches:
Try to summarize portions of the text
Summarize each paragraph/section. How does it fit in with what you know?
Tailoring comprehension to how you will use the information
Try to encode/study the information in a similar manner to the way it will be used.
Mannes & Kintsch (1987)
Students provided with an outline of a topic that was either:
consistent with the text
inconsistent with the text
After reading the text, comprehension was evaluated
Students with the consistent outline performed better on memory for the text
Students with the inconsistent outline performed better on inference verification and problem solving
Why?
Inconsistent outline forced more active processing (deeper understanding)
Consistent outline permitted easier memorization of text but not deeper learning.
Review:
Strategies for improving your recall of a text.
Active processing
Identifying the main points
Connecting the propositions in the discourse
Building global structures
Tailoring the comprehension activities to the test
Strategies for improving a text to make it easier to comprehend
Revising local coherence
Revising global coherence
Tailoring the text to the reader's background knowledge/schema