Comprehension, Coherence and Strategies in Hypertext and Linear Text

Comprehension, Coherence and Strategies in Hypertext and Linear Text

Peter W. Foltz

Dept. of Psychology
New Mexico State University
Las Cruces, NM, 88003

Foltz, P.W. (1996) Comprehension, Coherence and Strategies in Hpyertext and Linear text. In Rouet, J.-F., Levonen, J.J., Dillon, A.P. & Spiro, R.J. (Eds.) Hypertext and Cognition. Hillsdale, NJ: Lawrence Erlbaum Associates.


Hypertext presents a new way to read on-line text that differs from reading standard linear text. Text is typically presented in a linear form, in which there is a single way to progress through the text, starting at the beginning and reading to the end. However, in hypertext, information can be represented in a semantic network in which multiple related sections of the text are connected to each other. A user may then browse through the sections of the text, jumping from one text section to another. This permits a reader to choose a path through the text that will be most relevant to his or her interests.

The features in hypertext supply flexibility to the reader when compared to reading linear text such as books. Clearly some of this flexibility does exist in books (e.g. table of contents and indexes), but it is not as widely used or exploited. Hypertext permits readers to use these features automatically rather than requiring readers to manually refer to them as needed. This provides additional control to the reader in determining the order that the text is to be read, and allows the reader to read the text as if it were specifically tailored to the reader's background and interests. This flexibility does promise an advantage of personalization and eases the burden of finding information, However, is this flexibility actually good or useful to the reader?

The concept of using the associative paths in hypertext to retrieve and read information has caused great excitement. The promise of universally available hypertexts has been touted as "a seamless and reunited computer world" (advertisement for the Xanadu Hypertext ), having "the potential to become a significant application area; equaling or perhaps exceeding that of word processing, spreadsheets and general database applications" (Begoray, 1990, p. 121), and "because hypertext has the power to change the way we understand and experience texts, it offers radical promises and challenges to students, teachers and theorists of literature" (Landow, 1989, p. 174). Using associative retrieval paths is similar to the way retrieval is performed from human memory, and this may be part of the appeal of hypertext to researchers and developers when they state that hypertext systems will improve a user's ability to find and use information.

While a variety of hypertexts have been developed over the past 20 years, it is often not clear whether there are strong advantages for hypertext. Research in hypertext has often failed to show any significant advantages for reading a hypertext compared to the equivalent text in linear form. In addition, the effectiveness of various features that can be used in hypertexts can vary greatly depending on the domain and content of the text and the goals of the reader. Up to this point, no standards or definitive rules exist on how to develop an effective hypertext. However, because hypertext encompasses such domains as user-interface design, psychology, education, and information retrieval, theory from these domains can be applied to hypertext in order to aid in the understanding and development of effective hypertexts.

Over the last 40 years, a large body of research has developed which focuses on linear text comprehension, both from a theoretical and an evaluative standpoint. Text research has permitted predictions of comprehension based on such factors as the structure of the text, the background knowledge of the reader, and the reader's abilities. This chapter examines the evaluation of hypertext from the perspective of text comprehension.

Approaches to Developing Hypertext

Research in Hypertext

The development of hypertext systems has created research into how to design and use hypertexts. Much of this research has focused on computer science and interface issues rather than the cognitive aspects of hypertexts. Some of the major areas of hypertext research include: the development of the underlying representations of information and connections (e.g., Botafogo & Shneiderman, 1991; Furuta & Stotts, 1989), methods of connecting, structuring and retrieving the information (e.g., Croft & Turtle, 1989; Crouch, Crouch, & Andreas, 1989), designing hypertexts for supporting argumentation (e.g., Conklin & Begeman, 1989; Fischer, McCall, & Morch, 1989), studying information seeking in hypertexts (e.g., Marchionini & Shneiderman, 1987; Weyer, 1982), and the role of rhetoric and writing in hypertexts (e.g., Bolter, 1991; Britton & Glynn, 1989).

While there have been a variety of areas of research on hypertext, one of the primary goals of the research has been to evaluate hypertexts and hypertext features in order to bring about improvements. From this research, we can then get an idea as to how well hypertexts have succeeded and what hypertext features are the most effective.

Evaluation of Hypertexts

Research comparing hypertext to linear text provides some measure as to what situations will be best suited for the presentation of textual information using hypertext. While there have been claims of the superiority of hypertext over linear text (e.g., Martin, 1990; Nelson, 1967), research into this area has not been uniformly successful in showing an advantage for hypertext. The chapter by Rouet and Levonen in this volume discuss the details of these studies. Overall, the results of the studies vary greatly. A large part of the variability may be due to the different tasks, hypertext systems and the types of texts used. Therefore, the results indicate that there are several factors that must be considered in order to determine whether a hypertext or linear text would work best.

Because hypertexts have incorporated many new features that are not found in linear text, such as linking, the ability to search for information, guided tours and overview maps, it is often not clear which of the features may be effective in improving the users' ability to read and comprehend the text. Without information about what features work best, it is difficult to know what features to incorporate into the design of a new hypertext. For this reason, there have been studies comparing the various hypertext features.

One factor that may affect reading hypertext is the resolution of the screen. Work by Gould (Gould & Grischkowsky, 1984; Gould, Alfaro, Fonn, Haupt, Minuto, & Salaun, 1987) found that reading time is significantly slowed when reading a text on a computer screen as compared to reading the same text on paper. However, when a high resolution screen with anti-aliased fonts are used, the reading speeds are equivalent. Thus, studies comparing linear text on paper to hypertexts on screens may find some reading differences merely due to the resolution of the text rather than any factors of the actual structure of the text. For this reason, research that makes comparisons of hypertext and linear texts should ensure that they both have equivalent resolution of the text.

With the multiple paths that are possible when reading a hypertext, there is a greater navigation load on the reader than with linear text. In order to simplify the reader's task then, additional navigation features must be provided. In a linear text, a table of contents and an index are the two typical navigational features. Since linear text is often organized hierarchically, the table of contents serves as an outline with pointers to pages for each entry. However, in a hypertext, the structure may be much more complicated. For that reason, a common feature in a hypertext is a map. A map permits a reader to see a representation of the text's structure along with the relationships between the different text sections. Monk, Walsh & Dix (1988) found that with a map, subjects were much faster at being able to answer questions in a hypertext than with no map. However, in a similar study, Hammond & Allinson (1989) found no differences in answer accuracy or task time whether or not a map was provided. Nevertheless, they found that with a map, subjects did tend to visit more of the text nodes than the subjects who did not have a map. There were large differences in the style of navigation based on the subject's task. Subjects who were instructed to just explore the information space tended to use a guided tour facility, while subjects with the more directed tasks of answering particular questions used mostly the map and index to help navigate.

Thus, we see that there are varying results as to whether certain features aid the user of a hypertext. In addition, factors such as the user's goals and background knowledge can influence whether a particular feature is useful. Nielsen (1989) performed a meta-analysis of 92 benchmark measures that had been taken from usability issues that had been tested in hypertext research. In all of the measures he examined, he found that there were actually very few results which had large effects. From this analysis, he concluded that there are actually very few studies that have shown the real world impact of hypertext systems. In addition, the lack of strong effects in this area of research could mostly be explained by large individual differences among users, tasks and texts. Thus, while there has been a lot of research in hypertext, few concrete results exist to provide strong evidence on how well hypertexts will work or guidelines on how to create effective hypertexts.

Guidelines for Developing Hypertexts

Another source of how to develop hypertexts comes from development guidelines. Several books and articles have been published containing guidelines and rules for hypertext development (Nielsen, 1990; Martin, 1990). These guidelines concentrate on issues such as how much text should be contained in a node, what hypertext features to use, and how the information should be structured. However, most of these guidelines do not provide concrete design rules based on theory, but instead present abstract rules based more on common sense[1]. The fact that the rules are fairly abstract is not surprising given that writing a hypertext is as difficult, if not more difficult, than writing linear text. With linear text, we do have some guidelines, but they still are somewhat abstract, and effective writing depends more on large amounts of practice than just being able to follow a set of writing rules. In addition, the style of writing will depend greatly on the points the author wants to express and the domain in which the author is writing.

Theory-Based Approaches to Hypertext Design and Evaluation

The studies described above shed some light on what features may be used to develop a good hypertext system, however, they are very insular, only examining particular features with particular texts. Without an ability to generalize outside of the texts and tasks, with every new feature or text developed, another evaluation study would need to be performed to determine its effectiveness. An alternative approach is to use a theoretical background to drive the design of the hypertext. This then would permit comparisons of features through using the theory to make the performance predictions. Theory-based approaches have been used in the study of human-computer interactions (e.g. Card, Moran & Newell, 1984; Kieras & Polson, 1985; Lewis & Polson, 1990). These approaches use a theoretical model of the user and system in order to derive predictions of usability. They typically involve modeling such factors as the users' knowledge, possible states of the computer, and possible actions a user can take, and have had some success at predicting usability of systems and of particular features. While theory-based design has not been fully incorporated into the majority of software designers, its success for designing interfaces suggests that it can similarly be used for the design of hypertext systems.

There have been a few theoretical approaches to studying and designing hypertext. While not based on some of the strict modeling criteria used in the modeling mentioned above, these approaches do use theoretical bases or guidelines for their development and testing. One approach has been to use user-interface guidelines, such as dialog rules, to determine how a hypertext interface should be designed (Hardman & Sharrat, 1989; Shneiderman, 1987). This approach permits pieces of the interface of a hypertext to be developed based on proven guidelines. A second approach has been to use "formative design evaluation" (Egan, Remde, Gomez, Landauer, Eberhardt & Lochbaum, 1989, Landauer, Egan, Remde, Lesk, Lochbaum, & Ketchum, 1992). In formative design evaluation, development and changes of features are based on psychological guidelines. Behavioral studies are then performed on the system in order to determine what features can be improved and in what ways. This iterative design method allows a comparison of the features and quantitative measures of the improvement from previous versions. A third approach has been to develop hypertexts from the point of view of information retrieval theory. In this approach, models from information retrieval have been applied in order to determine how to structure the information. These methods include, probabilistic models of retrieval (Croft & Turtle, 1989), hierarchical clustering (Crouch, Crouch & Andreas, 1989), and petri nets (Furuta & Stotts, 1989) in order to connect the nodes of information. A more cognitively-based approach is Cognitive Flexibility Theory (Spiro & Jehng, 1990). Cognitive Flexibility Theory represents the textual information in multiple perspectives that permit the reader to learn and use the information flexibly.

While these approaches have helped improve the design of hypertexts, one area that has been neglected is to consider the text in terms of its discourse cues. Charney (1987, 1994) has suggested this area as a possible area for improving hypertexts. However, thus far, there has been little theoretical research in this area. This may be partially due to the fact that up to this point, developers of hypertexts tend to be primarily computer scientists and not psychologists with skills and background in issues of text comprehension. Thus, a different theoretical approach to hypertext is to examine it in terms of text comprehension and discourse cues.

Text Comprehension Research and Hypertext

Over the past 30 years, there has been a large amount of research in text comprehension, primarily in the fields of psychology and education. The goals of text comprehension research are to understand what factors of the reader and the text influence the ease of comprehending a text and to make some predictions as to how easy a text will be to comprehend. Through modeling both the text and the reader's knowledge and abilities, researchers have been able to both develop better texts and a better understanding of the human comprehension processes.

Research in text comprehension has examined a variety of factors that influence comprehension. These factors include: the role of coherence in a text, the role of the readers' background knowledge, the role of the narrative schema of the text, and the role of the reader's cognitive abilities. In addition, studies have examined the strategies readers use when going through a text and the role these strategies can play for comprehending a text. Not only do these factors play a major role in linear text comprehension, they also play an equally important role in the comprehending of hypertexts.

Predictions of Comprehension

Modeling Comprehension

Hypertext differs from linear text in some fundamental ways. A hypertext provides more flexibility to the reader in choosing where to go in the text. A hypertext also provides the reader with more methods to employ in order to find the relevant information in the text and to move through the different sections of the text. However, aside from these differences, the primary goal of both hypertexts and linear texts is to convey information in a coherent form to a reader. In this manner, the reader should be able to extract the relevant information from the text that should reflect the intended goals of both the author and the reader. For this reason, we can examine how the results from research in text comprehension can have implications to the design and evaluation of hypertexts.

Researchers in the field of text comprehension have used user models to predict what information will be learned from a linear text. One primary approach has been to examine comprehension using the Kintsch model of text comprehension (Kintsch, 1988; van Dijk & Kintsch, 1983). The Kintsch model has been used for predicting the comprehension of text based on such factors as what features will be remembered from the text (van Dijk & Kintsch, 1983), the role of background knowledge (Britton & Gulgoz, 1991; van Dijk & Kintsch, 1983), the role of coherence and readability (Kintsch & Vipond, 1979), and also goal planning in such domains as computer mail systems (Mannes & Kintsch, 1991) and the UNIX operating system (Doane, Kintsch & Polson, 1990).

When reading a linear text, processing occurs at many levels. These levels range from the low level processes of recognizing individual words up to high level processes of deriving the gist of the information in the text. These processes work together simultaneously to extract meaning from the text. Meaning, however, is represented at different levels.

In the Kintsch model, the reader's memory for text is represented at 3 levels, a surface representation of the words and sentences, the meaning of the text (textbase) and a general representation of what is described by the text incorporating outside background knowledge (situation model). As text is read, the text is incorporated into the readers' representation of the information. Information from the surface representation of the text is quickly lost (e.g., Bransford & Franks, 1971). However, some of the abstracted information from the surface structure of the text is incorporated into the textbase, represented as propositions. Propositions serve as semantic primitives representing the information acquired (e.g. Kintsch, 1974; Fodor, Fodor & Garrett, 1975).

The propositions are connected to each other in the textbase through semantic coherence relations. In semantic coherence, constituents of the text will be coherent if they share some form of semantic relatedness in the discourse. The amount of coherence is therefore represented by the amount of shared meaning and referential relations. These coherence relations are based on standard rhetorical devices in the text, such as causality, use of pronouns, and word repetition. The semantic coherence can be represented both at the local level of individual propositions in the textbase and at the global level of the macrostructure of the text.

The mental representation of these interconnected propositions in the textbase (coherence graph) takes the form of a hierarchical structure with higher level concepts represented as super-ordinate propositions which are connected to lower level concepts represented as subordinate propositions (Kintsch & van Dijk, 1978). Recall of propositions follows this hierarchy, with propositions from the upper part of the tree being more likely to be recalled than lower level propositions (Britton, Meyer, Hodge & Glynn, 1980; Kintsch & Keenan, 1973; Meyer, 1973). At a higher level, readers generate a macrostructure or gist of the text. The macrostructure is the result of the readers' inferential processes, with readers forming hypotheses of the overall meaning of paragraphs, chapters and whole books. In this manner, the macrostructure incorporates the reader's background knowledge with the text in forming these hypotheses. The resulting macrostructure of the text is similar to the textbase in that it is also represented as a hierarchical structure with higher level concepts represented at the top.

The Role of Coherence

The process of incorporating propositions into the textbase is a process of maintaining coherence. Propositions that have overlapping arguments, and thus are semantically related, create coherence. However, if the current proposition being processed does not share arguments with propositions in short-term memory, then a bridging inference must be made by the reader in order to maintain coherence (e.g., Kintsch & van Dijk, 1978). When a reader makes a bridging inference, the reader must use knowledge from his or her situation model in order to fill in the missing information. Writers often assume readers will have the appropriate background knowledge to make the proper bridging inferences, and knowledgeable readers can make these inferences automatically. However, if a reader does not have the proper background knowledge, these inferences consume additional resources of the reader, typically resulting in lower comprehension. Thus, the amount of coherence in the text, can be used to make predictions of comprehension (e.g., Kintsch & Vipond, 1979; Miller & Kintsch, 1980). Coherence of a text can be assessed by converting a text into its propositions and then calculating the number of arguments that overlap from one proposition to the next. The greater amount of propositional overlap corresponds to a greater amount of coherence.

One application of this predictive power has been to improve the comprehensibility of texts. Britton & Gulgoz (1991) used the Kintsch model to identify locations in a text where a reader would have to make these bridging inferences in order to maintain coherence of the text. The chosen text was on the topic of the air war in Vietnam and so undergraduates had very little background knowledge to make the correct bridging inferences at these incoherent locations. They created a revised version of the text in which they inserted into the text the inferences that would be needed to be made at the various locations. They found that undergraduates reading the revised text had significantly better recall and a better mental representation of the text than those who had read the original version.

In a linear text, a writer typically maintains a set of coherent arguments through the text. At the local level, a writer makes words and sentences flow together through common referents. At a global level, a writer similarly makes paragraphs and sections flow from one to the other in a coherent manner. This aids the reader in structuring the information in the text to fit into the knowledge structures of what has been read previously. If there is little global coherence between sections, then the user must make bridging inferences in order to maintain coherence (e.g., Kintsch & Vipond, 1979). For readers without appropriate background knowledge, these inferences can consume the resources of the reader, typically resulting in lower comprehension.

In a hypertext, at any text section there are usually a variety of other sections to which a reader can jump. However, it may not be possible for a writer to anticipate all the possible places to which a reader may jump and therefore, it may also not be possible to maintain good macrocoherence for all possible links. Although the writer of the hypertext may code all the links, it would still be difficult to write each section so that it would cohere well with every possible section to which a reader could jump. Jumps which are not coherent could result in additional processing load for the reader, as the reader generates the necessary inferences to incorporate the textual information from the new node into what has been previously read. Thus, a many of the possible links in a hypertext may cause difficulty in the reader's comprehension.

One exception to this problem of hypertext coherence may be found in argument-based hypertext systems (e.g., Conklin & Begeman, 1989; Fischer, McCall, & Morch, 1989). These systems take into account the role of coherence through only allowing jumps between nodes in which a coherent argument has been previously set between the two. These coherent links are created through careful hand-coding of all possible links. While argument-based hypertext involves a lot more hand crafting in order to create only these coherent links, it avoids the problems of readers jumping to nodes using links which may cause an incoherent transition.

The Role of Background Knowledge

As a text is read, there is a large cognitive load on the reader as the reader is decoding the text and incorporating the textual information into his or her knowledge base. The key to incorporating the information into the reader's knowledge base is partly dependent on the amount of background knowledge of the reader. A reader's background knowledge permits the information to be incorporated into pre-existing knowledge structures. So readers who do not have an adequate amount of background knowledge on the subject of a text will have lower comprehension of the text (Spilich, Vesonder, Chiesi & Voss, 1979; Voss, Vesonder & Spilich, 1980). The background knowledge permits the reader to provide a coherence to the text, permitting better bridging inferences between non-coherent sections and also permitting additional elaborative inferences. In addition, there is evidence that the background knowledge influences the processing of information for developing at the situational model level, but not at the propositional level (Fincher-Kiefer, Post, Greene & Voss, 1988). Thus, additional background knowledge of the text makes it easier for developing relevant macropropositions, resulting in a better representation of the text for the reader.

The amount of background knowledge can also differentially affect the readers of a hypertext. As in linear text, readers with background knowledge on the domain of the text will be better at encoding information from the text than those without the background knowledge. The high knowledge readers will have the correct conceptual structures in which to integrate the new information and therefore the reader will tend to have better recall of the text.

However, due to the flexibility of hypertexts, readers with little background knowledge may have additional difficulties when compared to readers of equivalent linear texts. Readers with background knowledge will already have the correct conceptual structures for the domain. The hypertext structure may therefore be very familiar for them. However, a reader with little knowledge of the domain of the text will not be familiar with the structure of the hypertext. Since one of the concepts of hypertext is to permit more flexibility for the reader in choosing where to go, a low knowledge reader may not be able to accurately choose the relevant text sections. Thus, low knowledge readers may have additional problems of navigating through the hypertext structure. This problem may not be as evident in linear texts, since the linear text provides a single (default) path to read through the text. Low knowledge readers can always take this path, even if they are having trouble comprehending the text.

The Role of Narrative Schema

Similar to the background knowledge, the narrative schema of a text can aid in the comprehension of that text. A knowledgeable reader can use the narrative schema to provide a structure used in which to organize the text (e.g., Black & Bower, 1979; van Dijk & Kintsch, 1983; Dillon, 1991). Placing a text in a well known narrative schema can improve the comprehension of that text (e.g., Poulsen, Kintsch, Kintsch, & Premack, 1979).

In linear texts, there are a variety of common narrative schemata employed. Most readers are familiar with them and can use their knowledge of a schema in order to help structure and integrate the textual information. However, a narrative schema is primarily dependent on a familiar organization of the presentation of the information. With the novel structures found in hypertexts, much of any familiar narrative schema will not be evident. This can cause difficulties for the readers of a hypertext because they can not effectively organize the textual information into their situation model. While readers of hypertexts may not currently be able to rely on a familiar narrative schema, this may change in the future. As hypertexts become more accepted and widespread, writers of hypertext may develop standard rhetorical styles. Readers who are then familiar with those rhetorical styles can use that knowledge to help in their structuring of the information in an effective manner.

The Role of Readers Abilities

With the large amount of both high-level and low-level processing that must take place in order to comprehend a text, one can expect differences in comprehension based on the reader's abilities. Skilled readers tend to have better skills at exploiting context cues and other textual constraints. They are able to make better hypotheses about the meaning of words (Perfetti & Roth, 1981), and are more responsive to the rhetorical structure of the text (Meyer, Brandt & Bluth, 1980). On the other hand, poor readers' decoding skills are not as effective and instead they compensate through using context-dependent hypothesis testing. Thus, skilled readers are able to use parallel automatic processing to form better hypotheses about the meaning of the text as they read through it and are not as dependent on the contextual cues of the text. In this manner, if contextual cues are missing or are confusing, then the performance of poor readers will be degraded to an even greater extent.

Skilled readers of linear text are more responsive to the rhetorical structure of the text and possess better decoding skills. In contrast, poor readers must rely more on the context to help in decoding. Hypertext may provide fewer context cues than equivalent linear texts. This could be the case in a hypertext in which a reader is not given a lot of information about the structure of the text or about where to go in the text. In a linear text, a poor reader could rely on information from the linear structure for this context, but the same context may not be as evident in the hypertext. On the other hand, a well-structured hypertext that provides a map of its structure may provide additional context cues that would not be present in a linear text. In such a case, a poor reader may find the hypertext to be more of an advantage than the equivalent linear text.

Hypertexts also cause additional processing load by making the reader responsible for navigating the text. Skilled readers, who can process the text automatically will not have as much interference from the controlled processing task of having to make choices of where to go as they read. Readers with poor reading skills are using a lot more controlled processing for their reading processing and thus will likely have a greater amount of interference from the additional task of navigating the text. This will not let them generate as many hypotheses about the text as they read it, making it harder to integrate the information.

Thus, the abilities of the reader may interact with the type of text format and how that text format is implemented. A well structured hypertext which does not impose a large navigational load on the reader may be an advantage for poor readers. On the other hand, poor readers may have great difficulty with a less well structured hypertext when compared to the equivalent linear text.

Readers Strategies in Hypertext and Linear Text

While much of the results described above are based on tasks in which a reader reads through a text in a single linear order, readers also use various strategies for reading through a linear text. Not only do the reader's abilities and the static characteristics of the text determine the comprehension, but also the structural, syntactic and semantic signals in the text can affect comprehension. These signals provide evidence for the macro-relevance of the individual sections of text . A variety of studies have examined the role of different types of text signals on comprehension, including studies of titles (Bransford & Johnson, 1972), frequency of mention (Perfetti & Goldman, 1974), enumeration (Lorch & Chen, 1986), and initial sentences (Kieras, 1981).

While these signals can affect comprehension, they also determine what a reader looks for in a text and what the reader ignores. A reader may look through a text and find relevant sections based on these signals and skip over sections that do not seem to be as relevant. In this way, a linear text may not always be read linearly. There has been some research on the strategies used in reading a text. However, reading strategies have not been investigated as much as some of the comprehension processes in reading, since there is a lot less control of the conditions and it is harder to determine what information a subject has read and the subjects' motivations for choosing the particular information from the text.

Goldman and Saul (1990) identified a number of strategies used by subjects reading text passages. These strategies were identified both at the global level of the text and at the more local levels. Subjects read individual sentences and could go backward and forward through the sentences. At the global level, they identified three approaches: Once through, in which subjects read straight through a text, Review, in which subjects went to the passage end and then reviewed sentences, and the Regress approach, in which they went back to previous sentences throughout the text. They found that subjects almost always used more than one approach to reading a passage. Thus, readers employed a great deal of flexibility in reading the passages. However, their approaches for getting through the text were done in such a manner as to establish both local and global discourse coherence. From these results, they proposed a model of strategy competition for reading strategies. In their model, the choices of where to go in the text follow a set of procedural rules. These procedural rules work at both the global and local level to maintain coherence of the text, to react to the textual features that serve as cues to the coherence relations and to make strategy choices when coherence can not be established.

Thus, reading is not always a static process that proceeds in a single order. Readers can employ a variety of strategies for proceeding through a text. These strategies may be determined by a variety of factors, including, the reader's knowledge for the domain, the reader's goals, the reader's skills, and the characteristics of the text. However, the reader must still work to maintain the overriding goal of proceeding through the text in a coherent manner.

The examination of strategies in hypertext have primarily concentrated on the search strategies of readers when hypertexts have been used as information retrieval systems. Since hypertexts provide additional navigational flexibility to readers, one could expect that readers would employ a variety of strategies. As evident from the Goldman and Saul (1990) study, readers use a variety of strategies in order to maintain coherence through a linear text. It is expected that readers of hypertexts would do the same, since they must still maintain coherence in order to develop a coherent mental representation of the text.

Part of the goal of developing hypertexts is to have mechanisms to simplify the navigational strategies of readers. Hypertexts were developed so that readers could get to the relevant sections in a more efficient manner than linear texts. For this reason, it is expected that the strategies of the reader can depend greatly on what mechanisms are implemented in the hypertext. A hypertext based primarily on search will tend to structure a reader's strategies to using search. However, these mechanism must also match the goals of the reader and these goals can influence the choice of strategies. Hypertext typically permits a reader to see more of the structure of the text and therefore can make it somewhat easier to find the relevant places in the text. Thus, the reading strategies in hypertext could be expanded from those found in reading a linear text and could depend on the mechanisms provided in the hypertext and the goals of the reader.

Two Experiments Applying Text Comprehension to Hypertext

Overall, there are a variety of factors that can cause differences in readers' comprehension and strategies of a text written in hypertext and linear form. Foltz (1993) ran two experiments that focused on two of these factors, the coherence of the text and how the goals of the reader affected reading strategies. The first experiment examined the comprehension and strategies of readers using either a linear text or one of two hypertext environments. The second experiment used verbal reports to investigate readers' strategies in hypertext.

A Comparison of Linear Text and Hypertext

In the first experiment, readers' comprehension and strategies were measured when using either a linear text or one of two hypertext versions of a chapter from an undergraduate level economics textbook. The goals of the readers were manipulated so that half of the readers read the chapter for general knowledge, while the other half read the chapter in order to find certain specific pieces of information.

This design permitted several comparisons. The first comparison was whether there were differences in readers' comprehension and strategies between the hypertexts and the linear texts. It was hypothesized that since readers would have little background knowledge on the subject of the text, the lack of coherence in the hypertext may adversely affect the readers' comprehension when compared to the comprehension of the readers of the linear text. The second comparison was whether the goals of the reader interacted with the format of the text. More specifically, did readers use different strategies based on their goals, and did one format of the text afford better comprehension than another? It was hypothesized that hypertext may be more suited for tasks involving information search, so readers with specific goals, in which they must find the relevant information, may perform better on the hypertext, while readers with general reading goals would find an advantage for the linear text, since it presented a single coherent set of information on the text.

An undergraduate level economics textbook chapter was converted into a hypertext using guidelines from several works on hypertext (e.g., Martin, 1990; Nielsen, 1990; Shneiderman, 1987). This included using the authors' section titles to designate nodes and the authors' outline for generating a hierarchy. The resulting hierarchical hypertext contained 6018 words consisting of 26 nodes. In addition to the hierarchical structure, there were also 17 cross- links where the authors made references to other sections. The hypertext permitted readers to navigate through the text using buttons for following both the hierarchical structure and the cross links. In addition, there was a map of the hypertext structure, which permitted jumping to any point in the text.

As described above, one of the hypothesized problems with hypertext may be that the jumps from node to node may not present as coherent a chapter as that of the linear text, resulting in lower comprehension. In order to investigate this idea, a second, more coherent hypertext was also developed. This hypertext was exactly the same as the first one, but also provided additional coherence in the text by automatically modifying the text in order to make it more coherent. In order to determine when a non-coherent jump was made in the hypertext, a macro-propropositional analysis of the text was first performed. In this analysis, the text was reduced to a set of macro-propositions, describing the gist of each sentence. The analysis indicated where possible non-coherent jumps would occur while reading the hypertext and what missing pieces of information would be needed for any jump in order to make the transition more coherent. Then, whenever non-coherent jumps were made, the computer added in an additional first paragraph of the necessary missing macropropositions, in sentence form, in order to make the transition more coherent. An example of one of the hypertext nodes with the inserted coherence paragraph, labeled "General Background" is shown in Figure 1.

Figure 1. Node from coherent hypertext with context paragraph inserted.

In order to compare the hypertexts to a linear version of the text, an on-line linear version was created. This text permitted subjects to turn pages back and forth through the text and allowed them to go to the start and end of the chapter. A table of contents was also provided as the first page of the chapter. This table of contents was equivalent to the hierarchical representation presented in the hypertext map.

Undergraduate subjects with no background in economics read one of the three text formats. Half of the readers were told to find specific information in the text that was distributed across several sections, while the other half of the readers read the text for general knowledge. After reading the text, the subjects answered comprehension questions and wrote an essay on the chapter. Some of the questions required subjects to recall information from the text, while other questions required subjects to use what they had learned from the text and apply it to new examples. This permitted an evaluation of both what they remembered from the text and what they learned from the text.

The surprising results of the study were that very few performance differences existed between the readers of the hypertexts and the linear text. There were no differences between text formats for the amount of time it took subjects to read the text. The subjects' comprehension also was equivalent for the different text formats. There were no differences in the number of macropropositions subjects generated in their essay and no differences in the subjects' scores on the short answer essay and multiple choice test. However, subjects who used the hypertexts did score better on questions that assessed their knowledge of the structure of the text. There were also no significant performance differences between the two types of hypertexts.

Manipulating the goals of the reader did have an effect. Subjects searching for the specific information in the text were faster but showed lower comprehension on the general knowledge questions. However, these measures did not interact with type of text, indicating that different types of text format did not differentially affect the comprehension of readers with different reading goals. Thus, the hypertexts did not help the readers search for the information any more than the linear text.

These results are similar to other studies in finding few comprehension differences between hypertext and linear text. However, in order to understand the lack of differences, it is important to look at readers' strategies. The computer kept a log of each subjects' progress and section choices as they read through the text. This permitted an analysis of the strategies that the subjects used for getting through the text and a characterization of the order that they read the text. Any node or page on which a subjects spent more than five seconds was counted as being read by the subject. Thus, in the linear text, subjects could still make non-coherent transitions by quickly paging forward or backward. The order that the subjects read the text was then compared to a macro-propositional analysis of the Text in order to count the number of coherent transitions made by the subjects. A coherent transition was one that did not violate the macrostructure of the text (i.e., the context of the node they were reading was similar to the context of the node to which they jumped).

For all groups in the study, 80 to 90 percent of the transitions were made in a manner that was coherent with respect to the macrostructure of the text. There were no differences due to text format or readers goals. Although this is not a surprising result for readers of the linear text, it suggests that readers of the hypertext primarily made transitions within the same context and seldom used any features to move them into very different parts of the hypertext. This coherence measure also correlated with the number of macropropositions generated in the subjects' essays indicating that the more coherently the text was read, the better the essay. Thus, the readers' comprehension of the hypertext was better if the reader chose to go through it in a coherent manner. The fact that there were no group differences in the coherent transition measure suggests that the lack of comprehension differences between the three text formats could be due to ceiling effects. All subjects were seeing approximately the same text in about an equivalently coherent manner, and thus their comprehension would not differ greatly. These coherence results therefore indicate that subjects tended to use a very rational approach of getting through the hypertexts, trying to maintain coherence.

Verbal Reports of Hypertext Readers Strategies

The results from Experiment 1 showed that, although there were few differences in comprehension due to the three text formats, the reading strategies used by the subjects were all very similar. This indicates that subjects used heuristics to employ information from the text to guide them through the text in a coherent manner. While Experiment 1 showed evidence of these strategies, the experiment did not provide data that showed the reasons why the subjects used these strategies and what type of information from the text helped them to use these strategies. In the second experiment, verbal protocols (e.g. Ericsson & Simon, 1984) were collected about readers' strategies from six subjects as they read through the hypertext. In this experiment, two of the subjects read through the hypertext with instructions to find specific information, while the other four subjects read for general knowledge. Subjects were instructed to read the texts silently, but to describe out loud everything they were thinking as they read through the texts. If at any point the subjects were silent and were not reading, and also when they made any transition in the hypertext, the experimenter prompted them to describe what they were thinking. The results therefore provide a qualitative measure of subjects reasoning about their navigation strategies.

The verbal protocols indicated that subjects seldom wanted to stray from the hierarchical structure and they expressed interest in reading all the text in one area of the hypertext before moving on to other areas.

I think I'm done in government. I've covered all, covered that. Uhuh, I'm going to go to... how markets solve the basic economic problems. (Subject hg3001)

This strategy has somewhat of the same effect as the depth-first strategy. By reading one part of the hierarchy before moving on to other parts, the subject avoids moving out of the current context at any point. Indeed, if this strategy is applied recursively, the two strategies are equivalent.

Subjects with the general reading goals used reading strategies for covering the whole text in a coherent manner. However, when they made non-coherent transitions, they often expressed confusion and then tried to get back to where they had been previously. On the other hand, the subjects with specific reading goals tried to maintain the context around the specific information they needed to find.

Um, I was mostly looking at the general topics and trying not to skip, go all the way down to the, that in case there might be other topics to the left that were on the same kind of hierarchy, or whatever. I tried to look at, get an overview of those other three before I went over to there, in case they had something to do with that [....] I just think I needed an overview of the little things around it in order to get at the answers to that question. (subject hs3004)

Thus, it was not sufficient to find the relevant information in the text. In order for that information to make sense, the subject needed background context that could be provided by reading the text from the nodes that were around the relevant node. In addition, based on their time-stamped record, when subjects were instructed to read with goal of finding particular pieces of information, rather than using jumps in the hypertext to go down to the specific information, they tended to start at the upper level nodes above the desired information and then worked their way down to the specific information they had to read. They expressed that this would give them the appropriate background context.

The verbal protocols show that subjects used strategies for maintaining the coherence of the text. It is also important to know what information they used to help guide them to use these reading strategies. From both the verbal protocols and the time-stamped record of actions, subjects seemed to rely primarily on the map to guide their strategies. Subjects consulted the map an average of 8 times throughout the session. This indicates that they used the map often to help orient themselves. The subjects' protocols also showed that they relied on the map for guidance.

I'm thinking umm, I'm trying to decide whether to go back to the map of the chapter and work my over actually to keep working my way down the map. (Subject hs3004 )

In addition to the verbalizations about the map, subjects also appeared to use the titles of nodes in order to guide them. One subject identified the three nodes that were under the node titled, "The three functions of government", and decided he had better cover all three functions. Thus, while the map of the hierarchical structure appeared to be the primary piece of information used to guide the subjects, the titles could identify relationships between items in the hierarchy and allow the subject to navigate based on following the labels.

These results corroborate the results found in Experiment 1. The results show that reading a hypertext is not just a reading process, but also a process of problem solving. In order to understand the text, subjects developed heuristics for maintaining coherence with an unfamiliar text domain and text format The verbal protocols showed that the subjects were aware of their strategies to read the information around one area of the hierarchy before moving on to other areas. In this manner, their strategies allowed them to always stay within the same textual context.

Implications for Hypertext and Linear Text

Reading Strategies and Comprehension in Hypertext

Despite the differences in the text formats, in the above experiments, subjects used strategies to navigate the texts in a coherent manner. The similar approaches to reading the text resulted in similar amounts of text comprehension for the different text formats. For navigating the linear text, subjects merely relied on the linear order of the text for following the coherent path. For the hypertexts, subjects relied on additional constraints in the text such as the map and node titles which signaled the structure of the text. These signals served as guides for locating additional nodes that would flow coherently with the previously read nodes.

Manipulating the goals of the reader changed the readers' strategies. Readers with the specific goals saw fewer nodes and pages of the text and read for a shorter amount of time. However, although they were supposed to find information that was in separate places in the hierarchy, there were no differences in the number of coherent transitions made based on the readers' goals. This indicates that even when readers need to find specific information in an unfamiliar text, they still may need to have the appropriate background context. The verbal protocols from Experiment 2 confirmed that although subjects were aware of where the relevant nodes were, they read down to those nodes in order to put the relevant nodes in the appropriate context. In addition, the goal manipulation did not interact with the text format. Thus, even though subjects had different reading goals, they employed similar reading strategies on the different text formats.

The modified version of the hypertext that provided additional coherence showed no real differences from the original hypertext. Subjects may have ignored the added information that was provided to them by the coherent hypertext. However, based on the strategies used by the subjects in Experiment 1 and on the protocols in Experiment 2, the general background context provided by the coherent hypertext appears to be the type of information that the subjects needed to maintain coherence and to provide a context for what they were currently reading. Thus, subjects may have not noticed this information sufficiently or ignored when it appeared. It is also possible that not enough information was provided in these "background context" paragraphs in order to be efficiently used. However, because subjects tended to navigate the coherent hypertext in a coherent manner, this information was also seldom presented to them. Thus, there may not have been enough opportunities for them to use the provided background context to produce significant differences between the hypertexts. Although this research failed to show that the automatic coherence aided the readers, this type of approach may be of use to hypertexts. Given the right type of information in these background context paragraphs and proper training of the readers, readers would not have had to rely as much on following coherent paths. Instead they could used the provided background context to give them the necessary coherence when making jumps out of a particular context.

The strategies used by the subjects show evidence for the dominance of global comprehension in reading. The subjects used a rational approach to reading, maintaining an order of reading that was consistent with the macrostructure of the text. Even when the structure of the text had been modified from its linear form, they chose paths through the text that would flow coherently. This is consistent with the readers' goals to form a coherent macrostructure. Thus, readers of a hypertext are opportunistic. They look for the cues that will lead them to the most coherent path through the text. This behavior is also consistent with rules for coherent reading strategies set out in Goldman and Saul (1990).

With readers using rules to maintain coherence, this means that readers must develop a set of rules and then choose what rules to use for different situations. In this way, reading becomes an application of problem solving. The problem of navigating a hypertext coherently is similar to the problem of search in a problem space in an unfamiliar domain (e.g., Newell, 1980; Newell & Simon, 1972). The reader has a set of possible actions for getting through the text (problem space) and a set of search methods for finding the most effective paths through the text. Since the subjects in this experiment were all expert readers, they had fairly powerful search methods for discovering coherent relationships. However, the subjects were all in an unfamiliar text domain (economics), which put some limits on the ability of the search methods from guaranteeing that they would find the most coherent path.

This approach to navigating through the hypertext is also consistent with investigations of exploratory learning in computer systems, such as the CE+ model of Lewis and Polson (1990). Their model of behavior in unfamiliar computer domains uses a label-following, hill climbing search method in order to choose the relevant actions. In label-following (e.g., Engelbeck, 1986), the user's goals are matched with the possible actions, and an action is chosen through the degree of overlap of terms between the two. The label following heuristic is similar to the methods the hypertext readers used for determining where to make the next coherent transition. For finding specific information, subjects could match their reading goals to the possible nodes they could read to identify relevant nodes of the text and then work their way down to the appropriate nodes. For subjects with general reading goals, they could use the titles as guides for determining the relationships between nodes in order to determine what transitions would be the most coherent.

The Lewis and Polson research suggests that one of the keys to aiding in the problem solving is to design for successful guessing (see also, Norman 1986, 1988). This is a similar necessity for hypertexts. A reader of a hypertext must be able to guess whether following a particular link will both lead toward the relevant information and also be coherent with the current context. In order for a reader to make these successful guesses, this information needs to be evident through the structure of the text and the titles of the nodes.

Text Comprehension Theory and Hypertext

While one of the original goals of this research was to determine the effect of the text's coherence on readers of hypertexts and linear texts, the results suggest that the similar coherence strategies overshadowed the effects of the text formats. In linear text, coherence is a good predictor of comprehension (e.g., van Dijk & Kintsch, 1983; Kintsch & Vipond, 1979). In this research, comprehension was equivalent because the resulting coherence of the text formats were equivalent. Even when modifying the goals of the reader, the text was still read coherently, resulting in few differences in comprehension. Thus, due to a lack of variability in coherence, this research can not draw conclusions about how much the coherence can break down in a hypertext and thereby affect the comprehension.

Readers' strategies indicate the strong role coherence plays in reading a hypertext. In the development of hypertext, the coherence between linked nodes is not often considered. Hypertext guidelines tell creators to link related items, however, there are few guidelines that tell them to make the text of linked nodes cohere. It is typically assumed that because two nodes are linked by some common piece of information, the reader can then generate the correct inferences about the link and incorporate the new information into his or her representation of the text. The results of this study show that subjects avoided the cases of "loosely" linked information (e.g., cross-hierarchical links), and instead primarily made transitions to highly related nodes.

It should be noted that this research examined the strategies for readers who were not familiar with the domain of the text. The results could change greatly with readers who are knowledgeable of the domain. With more background knowledge, readers could perform better problem solving to choose better coherent paths through the text. These readers would have the correct knowledge to make informed guesses on where to go in the text. With the proper knowledge structures, they could also make non-coherent jumps without adverse affects because they would possess the appropriate background knowledge to make the necessary bridging inferences. Thus, although readers with little background knowledge of the text domain, may have to rely on coherent paths in order to read through the text, a knowledgeable reader may be better able to exploit some of the less coherent links in the hypertext.

Implications for the Design of Hypertexts

The results of these studies indicate that text comprehension theory can provide predictions for the design of hypertexts. Based on the theory, the primary design issues are to consider the type of text, how it will be used, and who will be using it. These issues can affect both what the reader can learn from the text and the type of strategies the reader will employ. In the case of these studies, the text was designed primarily for students to study concepts in economics. Students reading the text would have little background knowledge on the topic and would have the goals of either reading through the text or for searching for information in the text. Thus, these results can not be generalized to all hypertexts in all situations.

One of the primary considerations in the design of a hypertext should be to focus on the readers' strategies. These strategies can vary, depending both on the goals of the reader and the background knowledge of the reader. A reader who wants to find specific information in a hypertext needs to have simple methods to locate that information. However having the specific information without the appropriate background context can be useless. Thus, for hypertexts it is not just a matter of getting a reader to the correct place in the text, but also ensuring that the relevant information is represented in a meaningful context. The results from Experiment 2 indicated that subjects finding specific information needed the appropriate background context before they could go ahead and read the specific information.

For readers with extensive background knowledge, text comprehension theory predicts that this meaningful context may not be needed. However, for readers without the proper background knowledge, the additional information will be more valuable. For this reason, it would make sense to insert the appropriate context, or permit the novice reader to read through all the background context in order to get the relevant information. This concept would be somewhat similar to the idea of "guided tours" in hypertexts in which an author provides a single path through the text with the appropriate context (e.g., Marshall & Irish, 1989). It is also similar to the "training wheels" methodology for interfaces which kept novices on the correct path, only allowing them the use of a subset of the available functions (e.g., Carroll & Carrithers, 1984).

In addition to providing an appropriate context to readers, it is important to support familiar reading strategies. Readers will navigate through the text trying to find coherent paths. Thus, readers should be able to identify these paths. This can be done is several ways. Providing a clear structure of the text will aid the reader in finding these paths. Although a large text may have an enormous structure that can not be visualized as a whole, providing local information around the current node will provide both additional context for the reader and additional information as to what nodes will produce a coherent representation.

Along with having a clear structure, providing good labels for the nodes will also aid readers. Since novice readers tend to use a label-following strategy, labels that clearly indicate the role of a particular node, will help the reader in successful guessing of the appropriate coherent path (e.g., Lewis & Polson, 1990). Support for successful guessing in a hypertext should be similar to that of using a standard rhetorical structure in linear text. Readers who are familiar with the structure will be able to exploit it to their advantage. Those who are unfamiliar with it will have to expend cognitive effort in problem solving at the expense of the primary task of comprehending the information from the text.

One of the common tasks in developing hypertext is converting a linear text into a hypertext. While changing a text to a hypertext may provide a lot more flexibility to the readers, these conversions typically concentrate on restructuring the information rather than making large scale changes to the text. A converted linear text may still contain many of the ordering signals that were in the original text. These signals could include such text factors as enumeration and the mention of information that occurred previously in the linear text. These factors may cause the text to be read in the same linear fashion as the original text. The text may also have to be modified in order to improve its coherence. Restructuring a text will cause more possible ways to get to any particular node. The text in that node will have to be written in such a way that a jump into that node will be coherent. Thus, the conversion of a linear text to a well-written hypertext can involve an extensive amount of modification of the original text in order for the text to be more appropriate to being in a hypertext format.

One way around the problem of structuring when converting a text to hypertext is to keep the text in the linear form. SuperBook (Egan, et al., 1989, Landauer, et al., 1992) uses such an approach. The text is kept in its original form, but also enhanced through a variety of features to facilitate navigation. These features include a dynamic table of contents, a fisheye view of the current context, and search which posts the hits against the table of contents. This permits readers to get to the relevant places in the text efficiently but also permits them to read the text in the linear order.

One of the current difficulties in developing a hypertext is that hypertext writing environments are still in their infancy. Improved environments will help a writer see the constraints of the text as the text is developed. Such features will permit the writer to be better able to consider the coherence of the text from node and how the text should best be structured.

Thus, developing a hypertext is not just a matter of developing a text with all relevant interconnections. Instead, an effective hypertext must be based on a coherent structure. This structure must present an interface to the reader that will make the reader's strategies simple to be carried out.

Future Issues for Text and Hypertext Research

The studies described above show the importance that readers place on maintaining the coherence of the text as they navigate a hypertext, however there remain many research issues that still must be explored in hypertexts. These results, and many other studies indicate that hypertexts, as they are currently implemented, are not suitable for all reading applications.

While this research sought to identify the effect of differences in the amount of coherence in the hypertext and the linear text, the strategies of the readers created equivalent amounts of coherent transitions. Not all hypertexts would permit subjects to perform as many coherent transitions as the hypertext used in this research. Thus, it would be instrumental to evaluate hypertexts which may cause a greater amount of non-coherent transitions in order to determine how much the coherence affects the comprehension. An alternative is to examine hypertexts that have large amounts of coherence. Hypertexts based on rhetorical models, such as systems for argumentation, enforce the coherence to a large extent. An analysis of reading comprehension on such systems may find that they produce larger amounts of comprehension and also provide a more appropriate representation of arguments than the equivalent linear version.

Modeling Text and Hypertext Readability

Models of text comprehension (e.g., van Dijk & Kintsch, 1983) can be applied to hypertext as well as to linear text. In both cases the key is to be able to measure the appropriateness of the text's structures and the resulting human's representation of those structures. The primary difference between the current text comprehension models for linear text and those for the hypertext is that a model for hypertext must also take into account the reader's behavior as he or she navigates through the text. In this way, the model must take into account more than just the text, but also the reader's strategies for dealing with the computer interface (e.g., Polson & Lewis, 1990). A model of hypertext comprehension must consider both the information the reader gains from the text and how that information can affect the readers' choice of strategies for proceeding through the text. Thus, this involves both an understanding of the constraints of the text and also of the constraints of the user interface. Given a successful model of these factors, we can generate better guidelines on how to develop more readable hypertexts which support more successful reading strategies.

Where can Hypertext Succeed?

From many of the studies in the field of hypertext, hypertexts have often performed worse, or just as well as the equivalent linear text. This raises the question: What are the areas where a hypertext will be more successful than a linear text, and what sorts of features will the hypertext need for this success?

One of the remaining problems of paper texts is still the fact that their indexes are inadequate and unwieldy to use. A current success for hypertext is as a search engine. Although this blurs the distinction between information retrieval systems and hypertexts, the ability to search in through a text simplifies many tasks. Hypertexts, such as SuperBook, permit readers to find the relevant information in an efficient manner. Although current computer indexing methods still leave much room for improvement (e.g., Dumais, 1988), the ability to move from one piece of relevant information to another can not be matched in equivalent paper versions of large amounts of text.

A second area where there is a great deal of potential for hypertext systems is for representing textual information that is not as easily presented in linear form. Such domains as legal argumentation and design knowledge have such problems. Any piece of textual information can have many related arguments and counter-arguments. These can be represented graphically with links between the arguments. Because the arguments are designed to follow each other, the links are constrained to be coherent. Thus the reader of the hypertext may follow the links and still get a coherent representation of the information. While these systems may provide an advantage to the reader, they do however, put a large load on the writer. Unlike preparing a single coherent set of arguments, the writer must determine the relationship between all pieces of information and structure them in a much more complicated manner than the linear equivalent. In the future, better computerized methods based on models of the text may be able to do some of this structuring, simplifying the task of the writer.

A third potential area for success in hypertexts is suggested by the investigation in this research of providing dynamic coherence to the hypertext. The goal of dynamic coherence is to provide additional information to the reader so that the text automatically contains information that is more appropriate to the reader's representation of the text. Readers of a text will bring large differences in reading abilities and background knowledge to a text. Through modeling the reader's knowledge, abilities, and goals, hypertexts could be developed that perform dynamic structuring of the text. These hypertexts could include additional background knowledge to readers with little knowledge and make the text more coherent for readers with low reading abilities. For readers with general reading goals or trying to get an overview, the text could be presented as a single path through the text. For readers trying to find specific information, the text could provide search capabilities and appropriate background context of any relevant items.

While the concept of personalized text sounds like a wonderful idea, accurately gauging the readers background knowledge, abilities and goals can be difficult. A lot of this will depend on being able to develop appropriate user models. These models would need to couple information about what is contained in the text, what information a reader already knows, and what information the reader needs to know.

Success for hypertexts lies in exploiting the powers of both the computer and the writer to generate better personalized texts. However, in order to do this, we must first have accurate models of such factors as: what a reader knows, what a reader needs, what methods a reader can use, and what information is contained in the text. Thus, the future of hypertext will depend on improving both models of the user and models of the text.


The author thanks Walter Kintsch, Peter Polson, Anders Ericsson, Reid Hastie, and Gerhard Fischer for their advice and comments on this project. He also thanks Adrienne Lee, Thomas Landauer, Susan Dumais, and Dennis Egan for their help. Support was provided by grants from the Army Research Institute ARI project MDA 903-86-CO143, the National Institute of Health and Bell Communications Research.


Begoray, J. A. (1990). An introduction to hypermedia issues, systems and application areas. International Journal of Man-Machine Studies, 33, 121-147.

Black, J. B., & Bower, G. H. (1979). Episodes as chunks in narrative memory. Journal of Verbal Learning and Verbal Behavior, 18, 309-318.

Bolter, J. D. (1991). Writing Space: The computer, hypertext, and the history of writing. Hillsdale, NJ: Lawrence Erlbaum.

Botafogo, R. A., & Shneiderman, B. (1991). Identifying aggregates in hypertext structures. In Proceedings of Hypertext '91, (pp. 63-74). San Antonio, TX: ACM.

Bransford, J., & Johnson, M. K. (1972). Contextual prerequisites for understanding: Some investigations of comprehension and recall. Journal of Verbal Learning and Verbal Behavior, 11, 717-726.

Bransford, J. D., & Franks, J. J. (1971). The abstraction of linguistic ideas. Cognitive Psychology, 2, 331-350.

Britton, B. K., & Glynn, S. M. (1989). Computer Writing Environments: Theory, Research, and Design. Hillsdale, NJ: Lawrence Erlbaum.

Britton, B. K., & Gulgoz, S. (1991). Using Kintsch's computational model to improve instructional text: Effects of repairing inference calls on recall and cognitive structures. Journal of Educational Psychology, 83, 329-345.

Britton, B. K., Meyer, B. J., Hodge, M. H., & Glynn, S. M. (1980). Effects of the organization of text on memory: Tests of retrieval and response criterion hypotheses. Journal of Experimental Psychology: Human Learning and Memory, 6(5), 620-629.

Card, S., Moran, T. P., & Newell, A. (1983). The Psychology of Human-Computer Interaction. Hillsdale, NJ: Lawrence Erlbaum.

Carroll, J. M., & Carrithers, C. (1984). Training wheels in a user interface. Communications of the ACM, 27(8), 800-806.

Charney, D. (1987). Comprehending non-linear text: The role of discourse cues and reading strategies. In Proceedings of Hypertext '87, (pp. 109-120). Chapel Hill, NC: ACM.

Charney, D. (1994). The impact of hypertext on processes of reading and writing. In S. J. Hilligoss and C.L. Selfe (Eds.) Literacy and Computers.. New York: MLA.

Conklin, J., & Begeman, M. L. (1989). gIBIS: A tool for all reasons. Journal of the American Society for Information Science, 40(3), 200-213.

Croft, W. B., & Turtle, H. (1989). A retrieval model incorporating hypertext links. In Proceedings of Hypertext '89, (pp. 213-224). Pittsburgh: ACM.

Crouch, D. B., Crouch, C. J., & Andreas, G. (1989). The use of cluster hierarchies in hypertext information retrieval. In Proceedings of Hypertext '89, (pp. 225-238). Pittsburgh: ACM.

Dillon, A. (1991). Readers' models of text structures: The case of academic articles. International Journal of Man-Machine Studies, 35, 913-925.

Doane, S., Kintsch, W., & Polson, P. G. (1990). Action planning: Producing UNIX commands. In Proceedings of the 11th annual meeting of the Cognitive Science Society, 7 (pp. 30-57). Ann Arbor, Michigan: Erlbaum.

Dumais, S. T. (1988). Textual information retrieval. In M. Helander (Eds.), Handbook of Human-Computer Interaction. (pp. 673-700). New York: Elsevier.

Egan, D. E., Remde, J. R., Gomez, L. M., Landauer, T. K., Eberhardt, J., & Lochbaum, C. C. (1989). Formative design-evaluation of 'SuperBook'. Transactions on Information Systems, 7(1), 30-57.

Engelbeck, G. E. (1986) Exceptions to generalizations: Implications for formal models of human-computer interaction. Unpublished master's thesis, University of Colorado, Department of Psychology, Boulder, Co.

Ericsson, K. A., & Simon, H. A. (1984). Protocol analysis: Verbal reports as data. Cambridge, MA: MIT Press.

Fincher-Kiefer, R., Post, T. A., Greene, T. R., & Voss, J. F. (1988). On the role of prior knowledge and task demands in the processing of text. Journal of Memory and Language, 27, 416-428.

Fischer, G., McCall, R., & Morch, A. (1989). JANUS: Integrating hypertext with a knowledge-based design environment. In Proceedings of Hypertext '89, (pp. 105-118). Pittsburgh, PA: ACM.

Fodor, J. D., Fodor, J. A., & Garrett, M. F. (1975). The psychological unreality of semantic representations. Linguistic Inquiry, 6, 515-531.

Foltz, P. W. (1993). Readers' comprehension and strategies in linear text an hypertext. Unpublished Doctoral dissertation. University of Colorado, Boulder.

Furuta, R., & Stotts, P. D. (1989). Programming browsing semantics in Trellis. In Proceedings of Hypertext '89, (pp. 27-42). Pittsburgh, PA: ACM.

Goldman, S. R., & Saul, E. U. (1990). Flexibility in text processing: A strategy competition model. Learning and Individual Differences, 2(2), 181-219.

Gould, J. D., Alfaro, L., Fonn, R., Haupt, B., Minuto, A., & Salaun, J. (1987). Why reading was slower from CRT displays than from paper. In Proceedings of the ACM CHI + GI '87, (pp. 7-11). Toronto, Canada: ACM.

Gould, J. D., & Grischkowsky, N. (1984). Doing the same work with hard copy and with cathode ray tube (CRT) computer terminals. Human Factors, 26, 296-300.

Hammond, N., & Allinson, L. (1989). Extending hypertext for learning: An investigation of access and guidance tools. In People and Computers V, Nottingham, UK.

Hardman, L., & Sharrat, B. (1989). User-Centered hypertext design: The application of HCI design principles and guidelines. In Proceedings of Hypertext 2, York, UK

Kieras, D., & Polson, P. G. (1985). An approach to the formal analysis of user complexity. International Journal of Man-Machine Studies, 22, 365-394.

Kieras, D. E. (1981). The role of major referents and sentence topic in the construction of passage macrostructure. Discourse Processes, 4, 1-15.

Kintsch, W. (1974). The representation of meaning in memory. Hillsdale, NJ: Erlbaum.

Kintsch, W. (1988). The use of knowledge in discourse processing: A construction-integration model. Psychological Review, 95, 363-394.

Kintsch, W., & Keenan, J. M. (1973). Reading rate and retention as a function of the number of propositions in base structure of sentences. Cognitive Psychology, 5, 257-274.

Kintsch, W., & van Dijk, T. A. (1978). Toward a model of text comprehension and production. Psychological Review, 85, 363-394.

Kintsch, W., & Vipond, D. (1979). Reading comprehension and readability in educational practice and psychological theory. In L. G. Nilsson (Eds.), Perspectives on Memory Research. Hillsdale, NJ: Erlbaum.

Landauer, T. K., Egan, D., Remde, J., Lesk, M. J., Lochbaum, C. C., & Ketchum, D. (1993). Enhancing the usability of text through computer delivery and formative evaluation: The SuperBook project. In C. McKnight, A. Dillon & J. Richardson, (Eds.), Hypertext: A Psychological Perspective. New York: E. Horwood.

Landow, G. P. (1989). Hypertext in literary education, criticism, and scholarship. Computers and the Humanities, 23, 173-198.

Lewis, C. & Polson, P. G. (1990). Theory-based design for easily learned interfaces. HCI, 5, 191-220.

Lorch, R. F., & Chen, A. H. (1986). Effects of number signals on reading and recall. Journal of Educational Psychology, 78, 263-279.

Mannes, S., & Kintsch, W. (1991). Routine computing tasks: Planning as understanding. Cognitive Science, 15, 305-342.

Marchionini, G., & Shneiderman, B. (1987). Finding facts vs. browsing knowledge in hypertext systems. IEEE Computer, 21(1), 70-80.

Marshal, C. C., & Irish, P. M. (1989). Guided tours and on-line presentations: How authors make existing hypertext intelligible for readers. In Proceedings of Hypertext '89, (pp. 15-26). Pittsburgh, PA: ACM.

Martin, J. (1990). Hyperdocuments and How to Create Them. Englewood Cliffs, NJ: Prentice-Hall.

Meyer, B. J. F. (1973). The organization of prose and its effects on memory. Amsterdam: North Holland.

Meyer, B. J. F., Brandt, D. M., & Bluth, G. J. (1980). Use of top-level structure in text: Key for reading comprehension in ninth-grade students. Reading Research Quarterly, 16, 72-103.

Miller, J. R., & Kintsch, W. (1980). Readability and recall of short prose passages: A theoretical analysis. Journal of Experimental Psychology: Human Learning and Memory, 6(4), 335-354.

Monk, A. F., Walsh, P., & Dix, A. J. (1988). A comparison of hypertext, scrolling, and folding as mechanisms for program browsing. In D. M. J. &. R. Winder (Eds.), People and Computers IV. (pp. 421-435). Cambridge: Cambridge University Press.

Nelson, T. (1967). Getting it out of our system. In G. Schechter (Eds.), Information Retrieval: A critical review. Wash., D.C.: Thompson Books.

Newell, A. (1980). Reasoning, problem solving and decision processes: The problem space as a fundamental category. In R. Nickerson (Eds.), Attention and performance VIII. (pp. 693-718). Hillsdale, NJ: Lawrence Erlbaum.

Newell, A., & Simon, H. A. (1972). Human problem solving. Englewood Cliffs, NJ: Prentice-Hall.

Nielsen, J. (1989). The matters that really matter for hypertext usability. In Proceedings of Hypertext '89, (pp. 239-248). Pittsburgh, PA: ACM.

Nielsen, J. (1990). Hypertext and Hypermedia. San Diego, CA: Academic Press.

Norman, D. A. (1986). Cognitive engineering. In D. A. Norman & S. W. Draper (Eds.), User centered system design: New perspectives in human-computer interaction. (pp. 31-61). Hillsdale, NJ: Lawrence Erlbaum.

Norman, D. A. (1988). The psychology of everyday things. New York: Basic Books.

Perfetti, C. A., & Goldman, S. R. (1974). Thematization of sentence retrieval. Journal of Verbal Learning and Verbal Behavior, 13, 70-79.

Perfetti, C. A., & Roth, S. (1981). Some of the interactive processes in reading and their role in reading skill. In A. Lesgold & C. Perfetti (Eds.), Interactive processes in reading. Hillsdale, NJ: Lawrence Erlbaum.

Poulsen, D., Kintsch, E., Kintsch, W., & Premack, D. (1979). Children's comprehension and memory for stories. Journal of Experimental Child Psychology, 28, 379-403.

Shneiderman, B. (1987). User interface design and evaluation for an electronic encyclopedia. In G. Salvendy (Eds.), Cognitive Engineering in the Design of Human-Computer Interaction and Expert Systems. (pp. 207-223). New York: Elsevier.

Spilich, G. J., Vesonder, G. T., Chiesi, H. L., & Voss, J. F. (1979). Text processing of domain-related information for individuals with high and low domain knowledge. Journal of Verbal Learning and Verbal Behavior, 18, 275-290.

Spiro, R. J., & Jehng, J. C.. (1990). Cognitive Flexibility and hypertext: Theory and technology for the nonlinear and multidimensional traversal of complex subject matter. In D. Nix and R. J. Spiro (Eds.) Cognition, Education and Multimedia: Exploring Ideas in High Technology. Hillsdale, NJ: Lawrence Erlbaum

van Dijk, T. A., & Kintsch, W. (1983). Strategies of Discourse Comprehension. New York: Academic Press.

Voss, J. F., Vesonder, G. T., & Spilich, G. J. (1980). Text generation and recall by high-knowledge and low-knowledge subjects. Journal of Verbal Learning and Verbal Behavior, 19, 651-667.

Weyer, S. A. (1982). The design of a dynamic book for information search. International Journal of Man-Machine Studies, 17, 87-107.

[1] "For the short term, the best recommendation probably is to pay close attention to the authoring principles implicit in other writers' hypertext and try to emulate the principles you like." (Nielsen, 1990, p. 164)