- Open Access
Characteristics of Finnish and Swedish intensive care nursing narratives: a comparative analysis to support the development of clinical language technologies
© Allvin et al; licensee BioMed Central Ltd. 2011
- Published: 14 July 2011
Free text is helpful for entering information into electronic health records, but reusing it is a challenge. The need for language technology for processing Finnish and Swedish healthcare text is therefore evident; however, Finnish and Swedish are linguistically very dissimilar. In this paper we present a comparison of characteristics in Finnish and Swedish free-text nursing narratives from intensive care. This creates a framework for characterising and comparing clinical text and lays the groundwork for developing clinical language technologies.
Our material included daily nursing narratives from one intensive care unit in Finland and one in Sweden. Inclusion criteria for patients were an inpatient period of least five days and an age of at least 16 years. We performed a comparative analysis as part of a collaborative effort between Finnish- and Swedish-speaking healthcare and language technology professionals that included both qualitative and quantitative aspects. The qualitative analysis addressed the content and structure of three average-sized health records from each country. In the quantitative analysis 514 Finnish and 379 Swedish health records were studied using various language technology tools.
Although the two languages are not closely related, nursing narratives in Finland and Sweden had many properties in common. Both made use of specialised jargon and their content was very similar. However, many of these characteristics were challenging regarding development of language technology to support producing and using clinical documentation.
The way Finnish and Swedish intensive care nursing was documented, was not country or language dependent, but shared a common context, principles and structural features and even similar vocabulary elements. Technology solutions are therefore likely to be applicable to a wider range of natural languages, but they need linguistic tailoring.
The Finnish and Swedish data can be found at: http://www.dsv.su.se/hexanord/data/.
- Technology Acceptance Model
- Clinical Information System
- Language Technology
- Swedish Data
- Content Accessibility
The term clinical text stands for textual documents that are produced for clinical work which are often saved in clinical information systems [1, 2]. The primary purpose of clinical text is to serve patient care as a summary or hand-over note, but clinical texts are also written to fulfil legal requirements and for purposes of reimbursement, management and research. The author can be a physician, nurse, therapist, specialist, or other clinician responsible for patient care. The text may have been entered into the system in real time, in retrospect, or as a summary made by the bedside or elsewhere, by the author or by a secretary who transcribes a dictation, by a speech recognition software, or by another system that generates or synthesises text. Clinical text applies to texts documenting the entire care process, and the actual content may differ substantially depending on the purpose – for example, describing the patient’s socio-medical history and current health problems as opposed to detailing care plans or even evaluating care outcomes. Synonyms or related terms include case sheets, clinical data, clinical free text, clinical notes, clinical records, clinical reports, computer based patient records, digital patient records, discharge letters, discharge reports, discharge summaries, electronic health records, electronic patient records, health records, health reports, health text, medical records, medical reports, nursing discharge notes, nursing narratives, nursing notes, patient records, and patient’s chart.
In several countries clinical documents are regulated by law and standardised via national or international models. In Finland, the legislation  stipulates that to ensure good care, clinical documents must cover all necessary information and the documents must adequately detail the patient’s conditions, care, and recovery. The text in the documents must be explicit, comprehensive, and include only generally well-known, accepted concepts and abbreviations. Swedish legislation has a similar approach .
In both Finland and Sweden, there are national models for nursing narratives, that is, clinical text written by nurses. Both models originate in the care process of gathering information from the patient, setting goals for care, implementing nursing interventions, and evaluating the outcomes of care. In Finland, a national standardised documentation model has been implemented that is based on the Finnish care classification (assessment, interventions, and outcomes of care) . In Sweden, there is the VIPS (an acronym for the Swedish words for wellbeing, integrity, prevention, and security) model, which provides a structure for the documentation process with key words that reflect the nursing process .
In this paper we explore and compare the content and linguistic characteristics of nursing narratives from intensive care units (ICUs) with similar care systems but very different languages. Our analysis aims to support the development of clinical language technologies. The analysis is based on the technology acceptance model  with the hypothesis that perceived usefulness and ease-of-use are indicators of technology use. The analysis includes both a qualitative and a quantitative approach. The qualitative approach addresses document/technology usefulness by exploring the document content (i.e., what, when, why, from whom, to whom) and ease-of-use by analysing understandability and content accessibility. We extended this via the quantitative approach to problems in document accessibility and understandability. We performed the analysis with Finnish and Swedish data because of the differences between the two languages, but similarities between the two countries regarding healthcare and culture. We focused on ICUs – hospital units that provide 24/7 care for critically ill patients and focus on conditions that are life-threatening and require comprehensive care and constant monitoring – because of the similarity in ICU clinical decision-making processes between different nations and between different languages .
The criteria for intensive care admission, discharge, and triage are well defined in international guidelines [9, 10] which standardises clinical decision-making processes in different ICUs. We used daily nursing narratives for the analysis because they cover the entire inpatient period.
Our materials included daily nursing narratives from a Finnish and a Swedish ICU in university-affiliated hospitals . Our inclusion criteria for patients were an ICU inpatient period of at least five days and an age of at least 16 years. Finnish (Swedish) health records were written between January 2005 and August 2006 (January 2006 and May 2008). Our research was approved by ethics committees in both countries (Ethics Committee of the Hospital District of South West Finland, 2/2009 §66 and the Ethics Committee in Stockholm, 2009/1742-31/5).
We analysed the materials by using content analysis, a widely used method for textual data which consists of systematic content coding with the aim of identifying themes and patterns in the data; the words and phrases mentioned most often are seen as those reflecting important concerns in communication [12–14]. We considered the daily nursing narratives as categorised data in which the content labels of the analysis correspond to the content headings written by the nurses. We compared these labels and contents with the aim of understanding their frequencies, contextual use, clarity, and relationships (e.g., parallel headings, synonymous concepts, negated concepts, subject-object roles, time order). Looking at the vocabulary and n-grams of different sizes generated from the whole data set, we explored the richness and expressive variation in the language and analysed the extent to which this posed a problem for the current context of the data set.
The analysis included both a qualitative and a quantitative approach. In the qualitative approach three average-sized health records from each data set (an average size of 2,389 and 5,169 words for Finland and Sweden, respectively) were used. The analysis was performed manually by three native Finnish speakers fluent in Swedish and two Swedish native speakers, four of whom are licensed healthcare professionals. The quantitative approach used 514 Finnish and 379 Swedish health records. For the Finnish data, we used the FinTWOL morphological analyser with the FinCG disambiguator, and for the Swedish data we used the GTA, Granska Text Analyzer. When FinCG produced multiple alternatives (e.g., haavan [wound’s] → haapa [aspen] and haava [wound]) caused by highly inflective Finnish, we reduced the chances for sparse data by choosing only one alternative. The analysis was performed semi-automatically by a native Finnish speaker and a native Swedish speaker, both experts in clinical language technology development.
Special structural and contextual features of ICU nursing narratives.
Structural & contextual features
Finland (Finnish examples)
Sweden (Swedish examples)
Headings are used in 2 of 3 health records. Headings are used as subjects and subjects are missing.
Diuresis: occasionally profuse.
(Diureesi: ajoittain runsasta.)
Pupils move under eyelids but does not open eyes.
(Pupillit liikkuvat luomien alla, mutta ei avaa silmiään.)
Headings are used in all narratives. The structure of headings seems to be obligatory. The headings are used as subjects.
Circulation: Stable with inotropy.
(Cirkulation: Stabil med inotropi.)
Reacts only to pain stimulation during suctioning of intubation tube.
(Reagerar enbart vid smärtstimuli vid sugning i tuben.)
Present and past participles are typical but be , is and are are not used. The most common tense is perfect.
Consciousness remained unchanged.
(Tajunta pysynyt ennallaan.)
Blood pressure low.
Present and past participles are typical but be , is and are are not used. The most common tense is perfect.
Breathing: Ventilator parameters unchanged.
(Andning: Ventilator parametrarna oförändrade.)
Structure of sentences
Complete sentences are rare.
No spontaneous movements, rigidifies.
(Ei spontaania liikettä, jäykistelee.)
Complete sentences are rare.
Light sedation, looks up now and then.
(Lätt sederad, tittar upp ibland.).
Misspellings exist but the meaning is clear.
Misspellings exist but the meaning is clear.
The motther is informed.
(Mammman är informered.)
Subjects (a patient)
The word patient as a subject is infrequently mentioned. If this word is mentioned it is not abbreviated.
Oxygenates well or ventilates well.
(Happeutuu hyvin tai ventiloituu hyvin.)
The word patient is used more often as a subject or object than in Finnish narratives. It is also replaced with abbreviations Pat or Pt . Use of patient was 40 % more common than she / he .
Patient got a percutanous tracheostomy today.
(Patienten har fått en perkutan trakeostomi idag.)
Very worried about patient’s condition.
(Mycket oroliga över patientens tillstånd.)
Pt. wakes up when talked to and appears to be oriented.
(Pt. vakner på tilltal och upplevs som adekvat.)
Signs and abbreviations
Signs and abbreviations are common. They originate from Finnish, Swedish, English, Latin, or professional jargon.
The height of the drain rose from 10 -->20 mmHg.
(Dreneerausrajaa nostettu 10 -->20 mmHg.)
Got medicine -->good response.
(Sai lääkettä -->hyvä vaste.)
Signs and abbreviations are common. They originate from Swedish, English, Latin, or professional jargon.
(em. [eftermiddag, afternoon])
CVP [Central Venous Pressure]
EN [Enteral Nutrition]
TPN [Total Parenteral Nutrition]
pO2 [partial pressure of oxygen]
pCO2 [partial pressure of carbon dioxide]
MAP [Mean Arterial Pressure].
From the perspective of ease-of-use, analysts with ICU expertise considered the narratives to be clear and easy to understand. However, ICU-specific nonstandard abbreviations and acronyms were prevalent and some of them were unclear to analysts with less domain expertise. Consequently, narratives were difficult to understand for persons not working in specialised health care, especially for the patients and their relatives.
Using the documents was facilitated by content headings. Headings were used similarly in Finland and Sweden. Usually the content matched its heading; for example, Consciousness: Unchanged. Drain liquid brighter than yesterday. In the Swedish data, content headings were obligatory and nurses selected them from a pre-defined list. They wrote their observations under the heading that was the closest match; for example, they wrote body temperature under the heading circulation , and level of sedation under the heading sleep. In the Finnish data, reference resolution complicated content accessibility; nurses wrote headings freely and there were consequently numerous synonyms and closely related concepts; for example, haemodynamics – blood pressure – pulse . In addition, parts of the Finnish narratives were without headings. In that case, nurses either wrote their narratives in a story format with a clear plot or they started their notes with a word which can be considered as a heading (e.g. Diuresis occasionally profuse, Therapeutic hypothermia still ongoing or Haemodynamic variation).
In addition to abbreviated words and problems with headings, reference resolution in the vocabulary as well as numerous linguistic and grammatical mistakes made using the documents difficult. For example, automated text analysis and reasoning seemed problematic with these data, with almost all sentences having no subject and approximately half of the sentences containing no verbs. The missing subject or object was usually the patient or clinician.
Quantitative comparison of ICU nursing narratives.
514 (496 unique patients)
379 (333 unique patients)
Daily documents (i.e., daily notes about a patient)
5,915 (17,103 shifts)
Types (i.e., unique tokens) before/after FinCG/GTA
63,328 / 38,649
– / 41,883
Tokens per patient / Tokens per daily document
540 / 0
92 / 5
14,118 / 915
36,830 / 9,389
2,389 / 208
5,169 / 417
1,635 / 87
5,271 / 239
The number of bigrams
368,166 (275,205 after FinCG)
469,455 (344,127 after GTA)
The number of trigrams
745,407 (356,307 after FinCG)
1,064,944 (905,539 after GTA)
Proportion of pronouns after FinCG/GTA
Proportion of nouns after FinCG/GTA
Proportion of verbs after FinCG/GTA
The most common unigrams, bigrams and trigrams.
The most common unigrams after FinCG/GTA
The most common bigrams after FinCG/GTA
[night time, misspelled]
[to suction (liquid)]
The most common trigrams after FinCG/GTA
hapettua ja tuulettua
i samband med
[oxygenate and ventilate]
[in connection with]
ja tuulettua hyvin
slem att suga
[and ventilate well]
[to suction secretions]
ei yhteyden#otto yö
munhåla och svalg
[no contact night]
[oral cavity and throat]
ei olla tarvita
med god effect
[have no need]
[with good effect]
yhteyden#otto yö aika
att suga i
[contact night time]
[to suction in]
The most common pronouns, nouns and verbs.
The most common pronouns after FinCG/GTA
The most common nouns after FinCG/GTA
ml [ml, abbr]
The most common verbs after FinCG/GTA
gå [walk, leave]
The most common topics.
Approximate number of occurrences of each topic
Number of occurrences of each topic
The relations of the most common topics
To illustrate the need for domain-tailored technologies and resources, FinCG did not recognise 36 percent of the Finnish data (including punctuation). By tailoring the FinCG disambiguator with approximately 3,500 of the most common ICU terms, the method applicability improved substantially (see  and the references therein). The GTA handles unknown words differently than FinCG, but by comparing the ICU words with a general Swedish language corpus (PAROLE ), we estimated that 69 percent of the types were domain specific and thereby the need for domain-tailored methods was justified. Tailoring processes are likely to be similar for different languages and countries; words that were used for all patients and in all daily documents were very similar in both Finnish and Swedish data sets. These included the most common headings, temporal expressions, negations, and changes in observed patient state (e.g. increase, continue, begin). In these processes, which connect healthcare service providers, academic researchers, and commercial language and information systems providers, ensuring patient confidentiality is essential; the amount of protected health information was equal in the two data sets (1.5 person names per thousand words).
The most frequent tokens and types in a subset of the Finnish and Swedish data have been made publicly available .
In this paper we have presented a collaborative comparison of the content and linguistic characteristics in Finnish and Swedish nursing narratives taken from two national ICUs. There is a strong belief that capturing the clinical knowledge in such large-scale data sets could lead to improved safety and quality of care, promotion of clinical research and development of better language technology. However, although free text is helpful for entering information into clinical information systems, the complexity, variation and ambiguity of human languages make effective knowledge mining difficult.
Our results show that nonstandard headings, abbreviations, acronyms, and terminology complicate content accessibility. Similar results have been published for clinical text from US hospitals [20, 21], from Finnish surgical, neurological, maternity and paediatric wards , from a medical-surgical ward in Thailand , and from Norwegian medical and cardiopulmonary units . In addition, our results demonstrate that unclear and difficult-to-understand contents give rise to problems regarding document usefulness and ease-of-use. Previous studies have shown that both clinicians and patients have difficulties in interpreting clinical text, in particular abbreviations, medical terms and other professional jargon, and clinical reasoning [25, 11]. Finally, the differences between general languages and domain jargon have been discussed in general (computational) linguistics studies, and it has been shown that the language of different specific domains or genres exhibits a high degree of linguistic variation [26, 27].
The use of clinical text and knowledge mining can be supported by developing domain-tailored language technologies and resources that improve referential coherence in headings and vocabulary. International data standards, documentation models, and other standardisation resources include, for example, the HL7 Health Level Seven International Standards , NANDA Nursing Diagnostic Terminology , and SNOMED CT Systematized Nomenclature of Medicine – Clinical Terms . As examples of technologies, we refer the reader to software for linguistic and grammatical proofing (e.g. domain-tailored FinCG [17, 31]) and Clinical Finnish Parser , and methods for assigning headings automatically [17, 33, 34]. As examples of studies discussing the potential of language technologies to improve the clarity, understandability, and accessibility of clinical text for other languages, we refer the reader to studies  and  on English health sciences literature and clinical text, respectively.
However, the majority of content analyses and language technologies for clinical text consider only a monolingual level and do not compare other languages or countries with one another. Our paper explores and compares ICU nursing narratives in Finland and Sweden in both the Finnish and Swedish languages. Although the two languages are not closely related, nursing narratives in both languages have many characteristics in common, including similar content, structural features, and similar elements of vocabulary. We believe that this has implications for the design and development of common language technology solutions that support producing and using healthcare documentation in a better and more effective manner than is the case today. These common characteristics can also be interpreted as additional support for the similarities in clinical decision-making in ICUs (see ). To our knowledge, the 2007 study  is the only other paper comparing clinical text at a cross-lingual level (English, Japanese, Russian, Swedish) other than the conference version  of this paper.
Our study was limited to health records from only one ICU in each country, and these ICUs represented the highest level of intensive care. This may pose a problem regarding the representativeness of the data. The results of our study are not generalisable per se, but can be considered in Finnish and Swedish ICUs with similar care levels. Since there were many similarities between the Finnish and the Swedish ICUs, it is unlikely that different units with similar care levels within the countries have large differences. Finland and Sweden are closely related culturally but not linguistically. The cultural closeness might have affected the fact that the two different sets of text also seemed to be very similar in content and style.
The work presented in this paper represents merely a starting point and should be extended to other ICUs, clinics, languages, and countries. These extensions will enable us to analyse similarities and differences in clinical texts in a systematic way. We are also planning to carry out a more in-depth quantitative analysis by syntactic parsing of both sets of text. Moreover, we will study how to identify, normalise, and correct abbreviations and misspellings automatically by using various distance measures and concept-management techniques. We will also address the similarities and differences in clinical text written by various professional groups and at other hospital wards and healthcare units. Finally, we are eager to seek possibilities to incorporate laypeople’s information needs, and their interaction with healthcare providers, in our study.
In our study the way Finnish and Swedish intensive care nursing was documented was not country or language dependent, but shared several common contexts, principles, structural features and even similar vocabulary elements. For example, both Finnish and Swedish data showed a lack of subjects and a substantial amount of non-standard abbreviations. We are therefore convinced that language technology solutions are likely to be applicable to a wider range of natural languages and to be very useful in the clinical setting. However, the technologies still need linguistic tailoring, and for wider applicability, multi-lingual analyses are needed. The framework we have introduced for analysing and comparing clinical text is practical and applicable for similar studies.
We gratefully acknowledge Nordforsk and the Nordic Council of Ministers for the funding of our research network HEXAnord – HEalth teXt Analysis network in the Nordic and Baltic countries. We also thank NICTA – funded by the Australian Government as represented by the Department of Broadband, Communications and the Digital Economy, and the Australian Research Council through the ICT Centre of Excellence program, the Academy of Finland (decision 136653), and the Department of Information Technology and TUCS, University of Turku, Finland.
This article has been published as part of Journal of Biomedical Semantics Volume 2 Supplement 2, 2011: Proceedings of the Second Louhi Workshop on Text and Data Mining of Health Documents. The full contents of the supplement are available online at http://www.jbiomedsem.com/supplements/2/S3.
- McDonald CJ: The barriers to electronic medical record systems and how to overcome them. J Am Med Inform Assoc. 1997, 4: 213-221. 10.1136/jamia.1997.0040213.View ArticleGoogle Scholar
- Thoroddsen A, Saranto K, Ehrenberg A, Sermeus W: Models, standards and structures of nursing documentation in European countries. Stud Health Technol Inform. 2009, 146: 327-331.Google Scholar
- Statutes of Finland 298/2009. Helsinki: Ministry of Social Affairs and HealthGoogle Scholar
- Patientdatalagen [Patient Data Law] 2008:355. Stockholm: National Board of Health and WelfareGoogle Scholar
- Tanttu K, Ikonen H: Nationally standardized electronic nursing documentation in Finland by the year 2007. Stud Health Technol Inform. 2007, 122: 540-541.Google Scholar
- Ehrenberg A, Ehnfors M, Thorell-Ekstrand I: Nursing documentation in patient records: experience of the use of the VIPS model. J Adv Nurs. 1996, 24: 853-867. 10.1046/j.1365-2648.1996.26325.x.View ArticleGoogle Scholar
- Davis FD: Perceived usefulness, perceived ease of use, and user acceptance of information technology. MIS Quarterly. 1989, 13: 319-340. 10.2307/249008.View ArticleGoogle Scholar
- Lauri S, Salanterä S: Developing an instrument to measure and describe clinical decision making in different nursing fields. J Prof Nurs. 2002, 18: 93-100. 10.1053/jpnu.2002.32344.View ArticleGoogle Scholar
- Task Force of the American College of Critical Care Medicine, Society of Critical Care Medicine: Guidelines for intensive care unit admission, discharge, and triage. Crit Care Med. 1999, 27: 633-638. 10.1097/00003246-199903000-00048.View ArticleGoogle Scholar
- Haupt MT, Bekes CE, Brilli RJ, Carl LC, Gray AW, Jastremski MS, Naylor DF, Rudis M, Spevetz A, Wedel SK, Horst M, Task Force of the American College of Critical Care Medicine, Society of Critical Care Medicine: Guidelines on critical care services and personnel: Recommendations based on a system of categorization of three levels of care. Crit Care Med. 2003, 31: 2677-2683. 10.1097/01.CCM.0000094227.89800.93.View ArticleGoogle Scholar
- Dalianis H, Hassel M, Velupillai S: The Stockholm EPR Corpus – characteristics and some initial findings. Proceedings of 14th International Symposium for Health Information Management Research. 2009, Kalmar, SwedenGoogle Scholar
- Miles MB, Huberman AM: Qualitative data analysis: an expanded sourcebook. 1994, Thousand Oaks (CA): Sage Publications, 2Google Scholar
- Krippendorff K: Content analysis: an introduction to its methodology. 2004, Thousand Oaks (CA): Sage Publications, 2Google Scholar
- Hsieh H-F, Shannon SE: Three approaches to qualitative content analysis. Qual Health Res. 2005, 15: 1277-1288. 10.1177/1049732305276687.View ArticleGoogle Scholar
- FinTWOL morphological analyser with the FinCG disambiguator. [http://www.lingsoft.fi]
- Knutsson O, Bigert J, Kann V: A robust shallow parser for Swedish. Proceedings of the 14th Nordic Conference on Computational Linguistics. 2003, Reykjavik, IcelandGoogle Scholar
- Suominen H: Machine learning and clinical text: supporting health information flow. (PhD thesis). 2009, Turku: University of TurkuGoogle Scholar
- Gellerstam M, Cederholm Y, Rasmark T: The bank of Swedish. Proceedings of the 2nd International Conference on Language Resources: Conference on Computational Linguistic. 2000, Athens, GreeceGoogle Scholar
- The most frequent tokens and types in a subset of the Finnish and Swedish data. [http://www.dsv.su.se/hexanord/data/]
- Lovis C, Baud RH, Planche P: Power of expression in the electronic patient record: structured data or narrative text?. Int J Med Inform. 2000, 58-59: 101-110. 10.1016/S1386-5056(00)00079-4.View ArticleGoogle Scholar
- Hyun S, Bakken S: Toward the creation of an ontology for nursing document sections: mapping section headings to the LOINC semantic model. AMIA Annu Symp Proc. 2006, 364-368.Google Scholar
- Kärkkäinen O, Eriksson K: Evaluation of patient records as part of developing a nursing care classification. J Clin Nurs. 2003, 12: 198-205. 10.1046/j.1365-2702.2003.00727.x.View ArticleGoogle Scholar
- Cheevakasemsook A, Chapman Y, Francis K, Davies C: The study of nursing documentation complexities. Int J Nurs Pract. 2006, 12: 366-374. 10.1111/j.1440-172X.2006.00596.x.View ArticleGoogle Scholar
- Hellesø R: Information handling in the nursing discharge note. J Clin Nurs. 2006, 15: 11-21. 10.1111/j.1365-2702.2005.01235.x.View ArticleGoogle Scholar
- Allvin H: Patientjournalen som genre [Patient narratives as a genre]. (Bachelor Thesis). 2010, Stockholm: Stockholm UniversityGoogle Scholar
- Harris Z, Gottfried M, Ryckman T, Mattick JRP, Daladier A, Harris T, Harris S: The Form of Information in Science, Analysis of Immunology Sublanguage, volume 104 of Boston Studies in the Philosophy of Science. . 1989, Dordrecht (The Netherlands): Kluwer Academic PublisherGoogle Scholar
- Biber D: Using register-diversified corpora for general language studies. Comput Linguistics. 1993, 19: 219-241.Google Scholar
- HL7 Health Level Seven International Standards. [http://www.hl7.org]
- NANDA Nursing Diagnostic Terminology. [http://www.nanda.org]
- SNOMED CT Systematized Nomenclature of Medicine – Clinical Terms. [http://www.fmrc.org.au/snomed]
- Domain-tailored FinCG. [http://www.lingsoft.fi/?doc_id=505&lang=en]
- Clinical Finnish Parser. [http://bionlp.utu.fi/clinicalcorpus.html]
- Cho KJ, Taira RK, Kangarloo H: Automatic section segmentation of medical reports. AMIA Annu Symp Proc. 2003, 155-159.Google Scholar
- Jancsary J, Matiasek J: Revealing the structure of medical dictations with conditional random fields. Proceedings of the 2008 Conference of Empirical Methods in Natural Language Processing. 2008, Stroudsburg (PA): Association for Computational LinguisticsGoogle Scholar
- Kim H, Goryachev S, Rosemblat C, Browne A, Keselman A, Zeng-Treitler Q: Beyond surface characteristics: a new health text-specific readability measurement. AMIA Annual Symp. 2007, 11: 418-422.Google Scholar
- Pakhomov SVS, Coden A, Chute CG: Developing a corpus of clinical notes manually annotated for part-of-speech. Int J Med Inform. 2006, 75: 418-429. 10.1016/j.ijmedinf.2005.08.006.View ArticleGoogle Scholar
- Borin L, Grabar N, Hallett C, Hardcastle D, Toporowska Gronostaj M, Kokkinakis D, Williams S, Willis A: Empowering the patient with language technology. Semantic Mining. 2007, NoE 507505: Deliverable D27.2, [http://gup.ub.gu.se/gup/record/index.xsql?pubid=53590]Google Scholar
- Allvin H, Carlsson E, Dalianis H, Danielsson-Ojala R, Daudaravicius V, Hassel M, Kokkinakis D, Lundgren-Laine H, Nilsson G, Nytrø Ø, Salanterä S, Skeppstedt M, Suominen H, Velupillai S: Characteristics and analysis of Finnish and Swedish clinical intensive care nursing narratives. Proceedings of the NAACL HLT 2010 Second Louhi Workshop on Text and Data Mining of Health Documents. 2010, Los Angeles (CA): Association for Computational LinguisticsGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.