AskDefine | Define Romanic

Extensive Definition

The Romance languages (sometimes referred to as Romanic languages, or Neolatin languages) are a branch of the Indo-European language family comprising all the languages that descend from Latin, the language of ancient Rome. They have more than 700 million native speakers worldwide, mainly in the Americas, Europe, and Africa, as well as many smaller regions scattered throughout the world.
Romance languages have their roots in Vulgar Latin, the popular sociolect of Latin spoken by soldiers, settlers and merchants of the Empire, as distinguished from the Classical form of the language spoken by the Roman upper classes, the form in which the language was generally written. Between 350 BC and 150 AD, the expansion of the Empire, together with its administrative and educational policies, made Latin the dominant native language in continental Western Europe. Latin also exerted a strong influence in southeastern Britain, the Roman province of Africa, and the Balkans north of the Jireček Line.
During the Empire's decline, and after its fragmentation and collapse in the 5th century, dialects of Latin began to diverge within each local area at an accelerated rate, and eventually evolved into languages of their own right. The overseas empires established by Spain, Portugal and France from the 15th century onward spread their languages to the other continents, to such an extent that about 70% of all Romance speakers today live outside Europe.
Despite influences from pre-Roman languages and from later invasions, the phonology, morphology, lexicon, and syntax of all Romance languages are predominantly evolutions of Vulgar Latin. In particular, with only one or two exceptions, Romance languages have lost the declension system of Classical Latin and, as a result, have SVO sentence structure and make extensive use of prepositions.


The term "Romance" comes from the Vulgar Latin adverb romanice, derived from Romanicus: used, for instance, in the expression romanice loqui, "to speak in Roman" (that is, the Latin vernacular), contrasted with latine loqui, "to speak in Latin" (Medieval Latin, the conservative version of the language used in writing and formal contexts or as a lingua franca), and with barbarice loqui, "to speak in Barbarian" (the non-Latin languages of the peoples that conquered the Roman Empire). From this adverb the noun romance originated, which applied initially to anything written romanice, or "in the Roman vernacular".
The word romance with the modern sense of romance novel or love affair has the same origin. In the medieval literature of Western Europe, serious writing was usually in Latin, while popular tales, often focusing on love, were composed in the vernacular and came to be called "romances".


Lexical and grammatical similarities among the Romance languages, and between Latin and each of them, are apparent from the following examples:
As an alternative to lei, Italian has the pronoun ella, a cognate of the other words for "she", but it has become disused in most dialects.
Note that some of the lexical divergence above comes from different Romance languages using the same root word with different meanings (semantic change). Portuguese, for example, has the word fresta, which is a cognate of French fenêtre, Italian finestra, Romanian fereastra and so on, but now means "slit" as opposed to "window." Likewise, Portuguese also has the word cear, a cognate of Italian cenare and Spanish cenar, but uses it in the sense of "to have a late supper" in most dialects, while the preferred word for "to dine" is actually jantar (related to archaic Spanish yantar) because of semantic changes in the 19th century. Galician has both fiestra (from medieval fẽestra which is the ultimate origin of standard Portuguese fresta), and the less frequently used ventá and xanela.


Vulgar Latin

There is a lack of documentary evidence about Vulgar Latin for the purposes of comprehensive research, and the literature is often hard to interpret or generalise upon. Many of its speakers were soldiers, slaves, displaced peoples and forced resettlers, more likely to be natives of conquered lands than natives of Rome. It is believed that Vulgar Latin already had most of the features that are shared by all Romance languages, which distinguish them from Classical Latin, such as the almost complete loss of the Latin case system and its replacement by prepositions; the loss of the neuter gender, comparative inflections, and many verbal tenses; the use of articles; and the initial stages of the palatalization of the plosives c, g, and t. There are some modern languages, such as Finnish, which have similar, quite sharp, differences between their printed and spoken form. This perhaps suggests that the form of Vulgar Latin that evolved into the Romance languages was around during the time of the empire, and was spoken alongside the written Classical Latin, reserved for official and formal occasions.

Fall of the Roman Empire

During the political decline of the Roman Empire in the fifth century, there were large-scale migrations into the empire, and the Latin-speaking world was fragmented into several independent states. Central Europe and the Balkans were occupied by the Germanic and Slavic tribes, as well as by the Huns, which isolated the Vlachs from the rest of Latin Europe. Latin disappeared completely from southeastern Britain and the Roman province of Africa, where it had been spoken by much of the urban population. But the Germanic tribes that had penetrated Italy, Gaul, and Hispania eventually adopted Latin and the remnants of Roman culture, and so Latin remained the dominant language there.

Latent incubation

Between the fifth and tenth centuries, the dialects of spoken Vulgar Latin diverged in various parts of their domain, eventually becoming distinct languages. This evolution is poorly documented because the literary language, Medieval Latin, remained close to the older Classical Latin.

Recognition of the vernaculars

Between the 10th and 13th centuries, some local vernaculars developed a written form and began to supplant Latin in many of its roles. In some countries, such as Portugal, this transition was expedited by force of law; whereas in others, such as Italy, many prominent poets and writers used the vernacular of their own accord.

Uniformization and standardization

The invention of the press apparently slowed down the evolution of Romance languages from the 16th century on, and brought a tendency towards greater uniformity of standard languages within political boundaries, at the expense of other Romance languages and dialects less favored politically. In France, for instance, the dialect spoken in the region of Paris gradually spread to the entire country, and the Occitan of the south lost ground.

Current status

The Romance language most widely spoken natively today is Spanish, followed by Portuguese, French, Italian, Romanian and Catalan, all of which are official languages in at least one country. A few other languages have official status on a regional or otherwise limited level, for instance Friulian, Sardinian and Valdôtain in Italy; Romansh in Switzerland; and Galician in Spain. French, Italian, Portuguese, Spanish, and Romanian are also official languages of the European Union. Spanish, Portuguese, French, Italian, Romanian, and Catalan are the official languages of the Latin Union; and French and Spanish are two of the six official languages of the United Nations.
Outside Europe, French, Spanish and Portuguese are spoken and enjoy official status in various countries that emerged from their respective colonial empires. French is an official language of Canada, Haiti, many countries in Africa, and some in the Indian and Pacific Oceans, as well as France's current overseas possessions. Spanish is an official language of Mexico, much of South America, Central America and the Caribbean, and of Equatorial Guinea in Africa. Portuguese is the official language of Brazil, being the most spoken language in South America, and official in six African countries. Although Italy also had some colonial possessions, its language did not remain official after the end of the colonial domination, resulting in Italian being spoken only as a minority or secondary language by immigrant communities in North and South America and Australia or African countries like Libya, Eritrea and Somalia. Romania did not establish a colonial empire, but the language spread outside of Europe through emigration, notably in Western Asia; Romanian has flourished in Israel, where it is spoken by some 5% of the total population as mother tongue, and by many more as a secondary language, considering the large population of Romanian-born Jews who moved to Israel after World War II.
The total native speakers of Romance languages are divided as follows (with their ranking within the languages of the world in brackets):
The remaining Romance languages survive mostly as spoken languages for informal contact. National governments have historically viewed linguistic diversity as an economic, administrative or military liability, as well a potential source of separatist movements; therefore, they have generally fought to eliminate it, by extensively promoting the use of the official language, restricting the use of the "other" languages in the media, characterizing them as mere "dialects", or even persecuting them.
In the late 20th and early 21st centuries, however, increased sensitivity to the rights of minorities have allowed some of these languages to start recovering their prestige and lost rights. Yet it is unclear whether these political changes will be enough to reverse the decline of minority Romance languages.

Classification and related languages

The classification of the Romance languages is inherently difficult, since most of the linguistic area can be considered a dialect continuum, and in some cases political biases can come into play. Nevertheless, according to SIL counts, 47 Romance languages and dialects are spoken in Europe. Along with Latin (which is not included among the Romance languages) and a few extinct languages of ancient Italy, they make up the Italic branch of the Indo-European family.
Note that Dalmatian is now generally grouped under Proto-Italian rather than Eastern Romance.

Proposed subfamilies

The main subfamiles that have been proposed by Ethnologue within the various classification schemes for Romance languages are:

Pidgins, creoles, and mixed languages

Some languages have developed from mixtures of a Romance language with another language. It is not always clear whether they should be classified as pidgins, creole languages, or mixed languages.

Auxiliary and constructed languages

Latin and the Romance languages have also served as the inspiration and basis of numerous auxiliary and constructed languages, such as Interlingua, its reformed version Modern Latin, Latino sine flexione, Occidental, Lingua Franca Nova, Ido and Esperanto, as well as languages created for artistic purposes only, such as Brithenig, Wenedyk and Talossan.

Linguistic features

Common Indo-European features

As members of the Indo-European family, Romance languages have a number of features that are shared with other members of this family, and in particular with English; but which set them apart from languages of other families, including:

Features inherited from Classical Latin

The Romance languages share a number of features that were inherited from Classical Latin, and collectively set them apart from most other Indo-European languages:
  • Word stress remains predominantly on the penultimate syllable in most languages, although there have been significant changes with respect to classical Latin. An exception is French, whose stress is fixed, falling predictably on the last syllable that does not contain a schwa. Stress patterns of similar languages usually match each other perfectly. French is the noticeable exception, as stress almost always falls on the last syllable.
  • They have two grammatical numbers, singular and plural (no dual).
  • In most languages, personal pronouns have different forms according to their grammatical function in a sentence, a remnant of the Latin case system; there is usually a form for the subject (inherited from the Latin nominative) another for the object (from the accusative or the dative), and a third set of personal pronouns used after prepositions or in stressed positions (see Prepositional pronoun and Disjunctive pronoun, for further information). Third person pronouns often have different forms for the direct object (accusative), the indirect object (dative), and the reflexive.
  • Most are null-subject languages. French is a notable exception.
  • Verbs have many conjugations, including in most languages:
  • Several tenses, especially of the indicative mood, have been preserved with little change in most languages, as shown in the following table for the Latin verb dīcere (to say), and its descendants.
1With the variant díser.
2Until the 18th century.
3With the disused variant dize.
  • The main tense and mood distinctions that were made in classical Latin are generally still present in the modern Romance languages, though many are now expressed through compound rather than simple verbs. The passive voice, which was mostly synthetic in classical Latin, has been completely replaced with compound forms.

Features inherited from Vulgar Latin

Romance languages also have a number of features that are not shared with Classical Latin. Most of these are thought to have been inherited from Vulgar Latin. Even though the Romance languages are all derived from Latin, they are arguably much closer to each other than to their common ancestor, owing to a core of common developments. The main difference is the loss of the case system of Classical Latin, an essential feature which allowed great freedom of word order, and has no counterpart in any Romance language except Romanian. In this regard, the distance between any modern Romance language and Latin is comparable to that between Modern English and Old English. While speakers of French, Italian or Spanish, for example, can quickly learn to see through the phonological changes reflected in spelling differences, and thus recognize many Latin words, they will often fail to understand the meaning of Latin sentences.
  • Vulgar Latin borrowed many words, often from Germanic languages that replaced words from Classical Latin during the Migration Period, including some basic vocabulary. Notable examples are *blancus (white), which replaced Classical Latin albus in most major languages; *guerra (war), which replaced bellum; and the words for the cardinal directions, where cognates of English "north", "south", "east" and "west" replaced the Classical Latin words borealis (or septentrionalis), australis (or meridionalis), occidentalis, and orientalis, respectively, in the vernacular. (See History of French - The Franks.)
  • There are definite and indefinite articles, derived from Latin demonstratives and the numeral unus (one).
  • Nouns have only two grammatical genders, masculine and feminine. Most Latin neuter nouns became masculine nouns in Romance. However, in Romanian, one class of nouns—including the descendants of many Latin neuter nouns—behave like masculines in the singular and feminines in the plural (e.g. un deget "one finger" vs două degete "two fingers", cf. Latin digitum, pl. digita). The same phenomenon is observed non-productively in Italian (e.g. il dito "the finger" vs le dita "the fingers").
  • Apart from gender and number, nouns, adjectives and determiners are not inflected. Cases have generally been lost, though a trace of them survives in the personal pronouns. An exception is Romanian, which retains a combined genitive-dative case.
  • Adjectives generally follow the noun they modify.
  • Many Latin combining prefixes were incorporated in the lexicon as new roots and verb stems, e.g. Italian estrarre (to extract) from Latin ex- (out of) and trahere (to drag).
  • Many Latin constructions involving nominalized verbal forms (e.g. the use of accusative plus infinitive in indirect discourse and the use of the ablative absolute) were dropped in favor of constructions with subordinate clause. Exceptions can be found in Italian, for example, Latin tempore permittente > Italian tempo permettendo; L. hoc facto > I. fatto ciò.
  • The normal clause structure is SVO, rather than SOV, and is much less flexible than in Latin.
  • Owing to sound changes which made it homophonous with the preterite, the Latin future indicative tense was dropped, and replaced with a periphrasis of the form infinitive + present tense of habēre (to have). Eventually, this structure was reanalysed as a new future tense.
  • In a similar process, an entirely new conditional form was created.
  • While the synthetic passive voice of classical Latin was abandoned in favour of periphrastic constructions, most of the active voice remained in use. However, several tenses have changed meaning, especially subjunctives. For example:
    • The Latin pluperfect indicative became a conditional in Catalan and Sicilian, and an imperfect subjunctive in Spanish.
    • The Latin pluperfect subjunctive developed into an imperfect subjunctive in all languages except Romansh, where it became a conditional, and Romanian, where it became a pluperfect indicative.
    • The Latin preterite subjunctive, together with the future perfect indicative, became a future subjunctive in Old Spanish, Portuguese, and Galician.
    • The Latin imperfect subjunctive became a personal infinitive in Portuguese and Galician.
  • Many Romance languages have two verbs "to be", derived from the Latin stare (mostly used for temporary states) and esse (mostly used for essential attributes). In French, however, stare and esse had become ester and estre by the late Middle Ages. Owing to phonetic developments, there were the forms êter and être, which eventually merged to être, and the distinction was lost. In Italian, the two verbs share the same past participle, stato. See Romance copula, for further information.
For a more detailed illustration of how the verbs have changed with respect to classical Latin, see Romance verbs.

Sound changes

The vocabularies of Romance languages have undergone considerable change since their birth, by various phonological processes that were characteristic of each language. Those changes applied more or less systematically to all words, but were often conditioned by the sound context, morphological structure, or regularizing tendencies.
Most languages have lost sounds from the original Latin words. French, in particular, elision progressed more than in any other of the languages (although its conservative etymological spelling does not always make this apparent). In general, all final vowels were dropped, and sometimes also the preceding consonant: thus Latin lupus and luna became Italian lupo and luna but French loup [lu] and lune [lyn]. (See also Use of the circumflex in French.) Catalan, Occitan, many Northern Italian dialects, and Romanian (Daco-Romanian) lost the final vowels in most masculine nouns and adjectives, but retained them in the feminine. Other languages, including Italian, Portuguese, Spanish, Galician and Romanian have retained those vowels.
Some languages have lost the final vowel -e from verbal infinitives, e.g. dīcere → Portuguese dizer (to say). Other common cases of apocope are the verbal endings, e.g. Latin amāt → Italian ama (he loves), amābam → amavo (I loved), amābat → amava (he loved), amābatis → amavate (you loved), etc.
Sounds were often lost in the middle of words, too; e.g. Latin Luna → Galician and Portuguese Lua (Moon), crēdere → Spanish creer (to believe).
On the other hand, some languages have added epenthetic vowels to words in certain contexts. Characteristic of the Iberian Romance languages is the insertion of a prosthetic e at the start of Latin words that began with s + consonant, such as sperō → espero (I hope). French originally did the same, but later dropped the s: spatula → arch. espaule → épaule (shoulder). In the case of Italian, a special article, lo for the definite and uno for the indefinite, is used for masculine words that begin with s + consonant words (sbaglio, "mistake" → lo sbaglio, "the mistake"), as well as all masculine words beginning with z (i.e. clusters /ts/ or /dz/) zaino, "backpack" → lo zaino, "the backpack".
A characteristic feature of the writing systems of almost all Romance languages is that the Latin letters c and g — which originally always represented the "hard" consonants /k/ and /g/ respectively — now represent "soft" consonants when they come before e, i, or y. This is due to a general palatalization of /k/ and /ɡ/ that occurred in the transition to Vulgar Latin. Since the written form of all the affected words was tied to the classical language, the shift was accommodated by a change in the pronunciation rules. The soft sounds of c and g vary from language to language. The consonant t, which was also palatalized, changes pronunciation in French (and English) orthography, but in the other Romance languages the spelling was altered to match the new sound. An exception is Sardinian, whose plosives remained hard before e and i in many words.
The distinctions of vowel length present in Classical Latin were lost in most Romance languages (an exception is Friulian), and partly replaced with qualitative contrasts such as monophthong versus diphthong (Italian, Spanish; French to a lesser extent), or close vowel versus open vowel (as in Portuguese, Galician, Occitan and Catalan).
For most languages in this family, consonant length is no longer phonemically distinctive or present. However some languages of Italy (Italian, Sardinian and Sicilian) do have long consonants like /bb/, /kk/, /dd/, etc., where the doubling indicates a short hold before the consonant is released, in many cases with distinctive lexical value: e.g. note /ˈnɔ.te/ (notes) vs. notte /ˈnɔt.te/ (night), cade /ˈ (s/he, it falls) vs. cadde /ˈ (s/he, it fell). They may even occur at the beginning of words in Romanesco, Neapolitan and Sicilian, and are occasionally indicated in writing, e.g. Sicilian cchiù (more), and ccà (here). In general, the consonants /b/, /ts/, and /dz/ are long at the start of a word, while the archiphoneme |R| is realised as a trill /r/ in the same position.
The double consonants of Piedmontese exist only after stressed /ə/, written ë, and are not etymological: vëdde (Latin videre, to see), sëcca (Latin sicca, dry, feminine of sech). In standard Catalan and Occitan, there exists a geminate sound /lː/ written ŀl (Catalan) or ll (Occitan), but it is usually pronounced as a simple sound in colloquial (and even some formal) speech in both languages.
For more detailed descriptions of sound changes, see the articles Vulgar Latin, History of French, History of Portuguese, Latin to Romanian sound changes, and Linguistic history of Spanish.

Lexical stress

While word stress was rigorously predictable in classical Latin, this is no longer the case in most Romance languages, and stress differences can be enough to distinguish between words. For example, Italian Papa [ˈ] (Pope) and papà [pa.ˈpa] (daddy), or the Spanish imperfect subjunctive cantara ([if he] sang) and future cantará ([he] will sing). However, the main function of Romance stress appears to be a clue for speech segmentation — namely to help the listener identify the word boundaries in normal speech, where inter-word spaces are usually absent.
The position of the stressed syllable in a word generally varies from word to word in each Romance language. Stress usually remains fixed on its assigned syllable within any language, however, even as the word is inflected. It is usually restricted to one of the last three syllables in the word, although Italian verb forms can violate this (e.g. teléfonano 'they telephone'). The limit may be exceeded also by verbs with attached clitics, provided the clitics are counted as part of the word; e.g. Spanish entregándomelo [en.tre.ˈɣ] (delivering it to me), Italian mettiamocene [me.ˈtːjaː.mo.ʧ] (let's put some of it in there), or Portuguese dávamo-vo-lo [ˈda.vɐ] (we were giving it to you).

Other shared features

The Romance languages also share a number of features that were not the result of common inheritance, but rather of various cultural diffusion processes in the Middle Ages — such as literary diffusion, commercial and military interactions, political domination, influence of the Catholic Church, and (especially in later times) conscious attempts to "purify" them in accordance with Classical Latin. Some of those features have in fact spread to other non-Romance (and even non-Indo-European) languages, chiefly in Europe. Some of these "late origin" shared features are:
  • Most Romance languages have polite forms of address that change the person and/or number of 2nd person subjects (T-V distinction), such as the tu/vous contrast in French, the tu/Lei contrast in Italian, the tu/dumneavoastră (from dominus + vostre, literally meaning "your Lordship") in Romanian or the tú (or vos) /usted contrast in Spanish.
  • They all have a large collection of learned hellenisms and latinisms, with prefixes, stems, and suffixes retained or reintroduced from Greek and Latin, and used to coin new words. Most of these are also used in English, e.g. tele-, poly-, meta-, pseudo-, dis-, ex-, post-, -scope, -logy, -tion, though their spelling may differ slightly; for example, poly- becomes poli- in Romanian, Italian and Spanish.
  • During the Renaissance, Italian, Portuguese, Spanish and a few other Romance languages developed a progressive aspect which did not exist in Latin. In French, progressive constructions remain very limited, the imperfect aspect generally being preferred, as in Latin.
  • Many Romance languages now have a verbal construction analogous to the present perfect tense of English. In some, it has taken the place of the old preterite (at least in the vernacular); in others, the two coexist with somewhat different meanings (cf. English I did vs. I have done). A few examples:
    • preterite only: Galician, Sicilian, some dialects of Spanish;
    • preterite and present perfect: Occitan, Portuguese, standard Spanish;
    • present perfect predominant, preterite now literary: French, several dialects of Italian and Spanish.

Writing systems

The Romance languages have kept the writing system of Latin, adapting it to their evolution. One exception was Romanian before the 19th century, where, after the Roman retreat, literacy was reintroduced through the Romanian Cyrillic alphabet by Slavic influences. Also the non-Christian populations of Spain used the systems of their culture languages (Arabic and Hebrew) to write Romance languages such as Ladino and Mozarabic in aljamiado.


The Romance languages are written with the classical Latin alphabet of 22 letters — A, B, C, D, E, F, G, H, I, L, M, N, O, P, Q, R, S, T, V, X, Y, Z — subsequently modified and augmented in various ways. In particular, the letters K and W are seldom used in most Romance languages, only for unassimilated foreign names and words.
While most of the 22 basic Latin letters have maintained their phonetic value, for some of them it has diverged considerably; and the new letters added since the Middle Ages have been put to different uses in different scripts. Some letters, notably H and Q, have been variously combined in digraphs or trigraphs (see below) to represent phonetic phenomena not recorded in Latin, or to get around previously established spelling conventions.
The spelling rules of most Romance languages are fairly simple, but subject to considerable regional variation. To a first approximation, the phonetic values of the letters can be summarized as follows:
C: Generally a "hard" [k], but "soft" (fricative or affricate) before e, i, or y.
G: Generally a "hard" [ɡ], but "soft" (fricative or affricate) before e, i, or y. In some languages, like Spanish, the hard g is pronounced as a fricative [ɣ] after vowels. In Romansch, the soft g is a voiced palatal plosive [ɟ].
H: Silent in most languages; used to form various digraphs. But represents [h] in Romanian and Gascon Occitan.
J: Represents a fricative in most languages, or the palatal approximant [j] in Romansh and in several of the languages of Italy. Italian does not use this letter in native words. Usually pronounced like the soft g (except in Romansch and the languages of Italy).
Q: As in Latin, its phonetic value is that of a hard c, and in native words it is always followed by a (sometimes silent) u. Romanian does not use this letter in native words.
S: Generally voiceless [s], but voiced [z] between vowels in most languages. In Spanish, Romanian, Galician and several varieties of Italian, however, it is always pronounced voiceless. At the end of syllables, it may represent special allophonic pronunciations.
W: No Romance language uses this letter in native words, with the exception of Walloon.
X: Its pronunciation is rather variable, both between and within languages. In the Middle Ages, the languages of Iberia used this letter to denote the voiceless postalveolar fricative [ʃ], which is still the case in Modern Catalan. With the Renaissance the classical pronunciation [ks] — or similar consonant clusters, such as [ɡz], [ɡs], or [kθ] — were frequently reintroduced in latinisms and hellenisms. In Venetian it represents [z], and in Ligurian the voiced postalveolar fricative [ʒ]. Italian does not use this letter in native words.
Y: This letter is not used in most languages, with the prominent exceptions of French and Spanish, where it represents [j] before vowels (or various similar fricatives such as the palatal fricative [ʝ], in Spanish), and the vowel or semivowel [i] elsewhere.
Z: In most languages it represents the sound [z], but in Italian it denotes the affricates [ʣ] and [ʦ] (which, although not normally in contrast, are usually strictly assigned lexically in any single variety: Standard Italian gazza 'magpie' always with [ddz], mazza 'club, mace' only with [tts]), and in Galician and Spanish it denotes either the voiceless dental fricative [θ] or [s].
Otherwise, letters that are not combined as digraphs generally have the same sounds as in the International Phonetic Alphabet (IPA), whose design was, in fact, greatly influenced by the Romance spelling systems.

Digraphs and trigraphs

Since most Romance languages have more sounds than can be accommodated in the Roman Latin alphabet they all resort to the use of digraphs and trigraphs — combinations of two or three letters with a single sound value. The concept (but not the actual combinations) derives from Classical Latin; which used, for example, TH, PH, and CH when transliterating the Greek letters "θ", "ϕ" (later "φ"), and "χ" (These were once aspirated sounds in Greek before changing to corresponding fricatives and the represented what sounded to the Romans like an /ʰ/ following /t/, /p/, and /k/ respectively. Some of the digraphs used in modern scripts are:
CI: used in Italian, Romance languages in Italy and Romanian to represent /ʧ/ before A, O, or U.
CH: used in Italian, Romance languages in Italy, Romanian, Romansh and Sardinian to represent /k/ before E or I; /ʧ/ in Occitan, Spanish and Galician; [c] in Romansh before A, O or U; and /ʃ/ in most other languages.
DD: used in Sicilian and Sardinian to represent the voiced retroflex plosive /ɖ/. In recent history more accurately transcribed as DDH.
DJ: used in Catalan and Walloon for /ʤ/.
GI: used in Italian, Romance languages in Italy and Romanian to represent /ʤ/ before A, O, or U.
GH: used in Italian, Romance languages in Italy, Romanian, Romansh and Sardinian to represent /ɡ/ before E or I, and in Galician for the voiceless pharyngeal fricative /ħ/ (not standard sound).
GL: used in Romansh before consonants and at the end of words for /ʎ/.
GLI: used in Italian and Romansh for /ʎ/.
GN: used in French, Italian, Romance languages in Italy and Romansh for /ɲ/, as in champignon or gnocchi.
GU: used before E or I to represent /ɡ/ or /ɣ/ in all Romance languages except Italian, Romance languages in Italy and Romanian.
IG: used at the end of word in Catalan for /ʧ/, as in maig, safareig or enmig.
IX: used between vowels or at the end of word in Catalan for /ʃ/, as in caixa or calaix.
LH: used in Portuguese and Occitan /ʎ/.
LL: used in Spanish, Catalan, Galician, Norman and Dgèrnésiais, originally for /ʎ/ which has merged in some cases with /j/. Represents /l/ in French unless it follows I (i) when it represents /j/ (or /ʎ/ in some dialects). It's used in Occitan for a long /ll/
L·L: used in Catalan for a geminate consonant /ll/.
NH: used in Portuguese and Occitan for /ɲ/, used in official Galician for /ŋ/ .
N-: used in Piedmontese and Ligurian for /ŋ/ between two vowels.
NY: used in Catalan for /ɲ/.
QU: represents [kw] in Italian and Romance languages in Italy; [k] in French and Spanish; [k] (before e or i) or [kw] (normally before a or o) in Occitan, Catalan and Portuguese.
RR: used between vowels in several languages (Occitan, Catalan, Spanish...) to denote a trilled /r/ or a guttural R, instead of the flap /ɾ/.
SC: used before E or I in Italian and Romance languages in Italy for /ʃ/, and in French and Spanish as /s/ in words of certain etymology.
SCH: used in Romansh for [ʃ] or [ʒ].
SCI: used in Italian and Romance languages in Italy to represent /ʃ/ before A, O, or U.
SH: used in Aranese Occitan for /ʃ/.
SS: used in French, Portuguese, Piedmontese, Occitan and Catalan for /s/ between vowels.
TG: used in Romansh for [c]. In Catalan is used for /ʤ/ between vowels, as in metge or fetge.
TH: used in Jèrriais for /θ/ (as in English "thick"); used in Aranese for either /t/ or /ʧ/.
TJ: used between vowels and before A, O or U, in Catalan for /ʤ/, as in sotjar or mitjó.
TSCH: used in Romansh for [ʧ].
TX: used at the beginnig or at the end of word or between vowels in Catalan for /ʧ/, as in txec, esquitx or atxa.
While the digraphs CH, PH, RH and TH were at one time used in many words of Greek origin, most languages have now replaced them with C/QU, F, R and T. Only French has kept these etymological spellings, which now represent /k/ or /ʃ/, /f/, /ʀ/ and /t/, respectively.

Double consonants

Gemination, in the languages where it occurs, is usually indicated by doubling the consonant, except when it does not contrast phonemically with the corresponding short consonant, in which case gemination is not indicated. In Jèrriais, long consonants are marked with an apostrophe: S'S is a long /zz/, SS'S is a long /ss/, and T'T is a long /tt/. The double consonants in French orthography, however, are merely etymological.

Diacritics and special characters

Romance languages use various diacritics, especially on vowels, to mark special pronunciations, or to distinguish between homophones. The following are the most common.
  • Diaeresis: when a vowel and another letter that would normally be combined into a digraph with a single sound are exceptionally pronounced apart, this is often indicated with a diaeresis mark on the vowel. In the Spanish word pingüino (penguin), the letter u is pronounced, although normally it is silent in the digraph gu when this is followed by an e or an i. Other Romance languages that use the diaeresis in this fashion are French, Catalan, and (Brazilian) Portuguese.
  • Homophones: words that are pronounced exactly or nearly the same way, but have different meanings, can be differentiated with an acute (as in Spanish, where si means "if" while sí means "yes", "himself", "herself", "itself", or "themselves") or with a grave accent (French, in which ou means "or" and où means "where", as well as Italian and Catalan). The circumflex can also have this function in French, sometimes. Often, such words are monosyllables, the accented one being phonetically stressed, while the unaccented one is a clitic; examples are the Spanish clitics de, se, and te (a preposition and two personal pronouns), versus the stressed words dé, sé, and té (two verbs and a noun).
  • Stress: the stressed vowel in a polysyllabic word may be indicated with the acute, é (in Spanish, Portuguese, Catalan), or the grave accent, è (Italian, Catalan). The orthographies of French and Romanian do not mark stress. In Italian orthography, indicating stress with a diacritic is only required when it falls on the last syllable of a word.
  • Vowel quality: the system of marking close-mid vowels with an acute, é, and open-mid vowels with a grave accent, è, is widely used (in Catalan, French, Italian, etc.) Portuguese, however, uses the circumflex (ê) for the former, and the acute (é), for the latter.
Less widespread diacritics in the Romance languages are the breve (in Romanian, ă) and the ring (in Wallon and the Bolognese dialect of Emiliano-Romagnolo, å). The French orthography includes the etymological ligatures œ and (more rarely) æ. The circumflex frequently has an etymological value in this language, as well; see Use of the circumflex in French, for further information.

Upper and lower case

Most languages are written with a mixture of two distinct but phonetically identical variants or "cases" of the alphabet: majuscule ("uppercase" or "capital letters"), derived from Roman stone-carved letter shapes, and minuscule ("lowercase"), derived from Carolingian writing and Medieval quill pen handwriting which were later adapted by printers in the 15th and 16th centuries.
In particular, all Romance languages presently capitalize (use uppercase for the first letter of) the following words: the first word of each complete sentence, most words in names of people, places, and organizations, and most words in titles of books. The Romance languages do not follow the German practice of capitalizing all nouns including common ones. Unlike English, the names of months (except in European Portuguese), days of the weeks, and derivatives of proper nouns are usually not capitalized: thus, in Italian one capitalizes Francia ("France") and Francesco ("Francis"), but not francese ("French") or francescano ("Franciscan"). However, each language has some exceptions to this general rule.

Vocabulary comparison

The table below provides a vocabulary comparison that illustrates a number of examples of sound shifts that have occurred between Latin and the main Romance languages, along with a selection of minority languages.


Romanic in Afrikaans: Romaanse tale
Romanic in Tosk Albanian: Romanische Sprachen
Romanic in Arabic: لغات رومانسية
Romanic in Aragonese: Luengas romanzes
Romanic in Franco-Provençal: Lengoues romanes
Romanic in Asturian: Llingües romániques
Romanic in Azerbaijani: Roman qrupu
Romanic in Bengali: রোমান্স ভাষাসমূহ
Romanic in Min Nan: Romance gí-giân
Romanic in Belarusian (Tarashkevitsa): Раманскія мовы
Romanic in Central Bicolano: Mga Latin
Romanic in Bosnian: Romanski jezici
Romanic in Bulgarian: Романски езици
Romanic in Catalan: Llengües romàniques
Romanic in Czech: Románské jazyky
Romanic in Corsican: Lingue rumaniche
Romanic in Welsh: Ieithoedd Romáwns
Romanic in Danish: Romanske sprog
Romanic in German: Romanische Sprachen
Romanic in Estonian: Romaani keeled
Romanic in Modern Greek (1453-): Ρομανικές γλώσσες
Romanic in Spanish: Lenguas romances
Romanic in Esperanto: Latinida lingvo
Romanic in Basque: Erromantze hizkuntzak
Romanic in Persian: زبان‌های رومی
Romanic in Extremaduran: Luenga Romanci
Romanic in French: Langues romanes
Romanic in Friulian: Lenghis romanzis
Romanic in Manx: Çhengaghyn Romanagh
Romanic in Galician: Linguas románicas
Romanic in Korean: 로망스어군
Romanic in Upper Sorbian: Romaniske rěče
Romanic in Croatian: Romanski jezici
Romanic in Ido: Latinida linguo
Romanic in Indonesian: Bahasa Roman
Romanic in Interlingua (International Auxiliary Language Association): Linguas romanic
Romanic in Icelandic: Rómönsk tungumál
Romanic in Italian: Lingue romanze
Romanic in Hebrew: שפות רומאניות
Romanic in Cornish: Romanek
Romanic in Swahili (macrolanguage): Lugha za Kirumi
Romanic in Haitian: Lang roman
Romanic in Kurdish: Zimanên romanî
Romanic in Latin: Linguae Romanicae
Romanic in Lithuanian: Romanų kalbos
Romanic in Limburgan: Roemaanse tale
Romanic in Lombard: Lenguf rumaanz
Romanic in Hungarian: Újlatin nyelvek
Romanic in Macedonian: Романски јазици
Romanic in Maltese: Lingwi Romanici
Romanic in Malay (macrolanguage): Bahasa-bahasa Romawi
Romanic in Dutch: Romaanse talen
Romanic in Japanese: ロマンス諸語
Romanic in Norwegian: Romanske språk
Romanic in Norwegian Nynorsk: Romanske språk
Romanic in Narom: Laungue romanne
Romanic in Novial: Latinidi lingues
Romanic in Piemontese: Lenghe romanze
Romanic in Polish: Języki romańskie
Romanic in Portuguese: Línguas românicas
Romanic in Romanian: Limbi romanice
Romanic in Vlax Romani: Romanikane chhiba
Romanic in Romansh: Linguas romanas
Romanic in Russian: Романские языки
Romanic in Northern Sami: Románalaš gielat
Romanic in Sardinian: Limbas romanzas
Romanic in Scots: Romance leids
Romanic in Sicilian: Lingui rumanzi
Romanic in Simple English: Romance languages
Romanic in Slovak: Románske jazyky
Romanic in Slovenian: Romanski jeziki
Romanic in Serbian: Романски језици
Romanic in Serbo-Croatian: Romanski jezici
Romanic in Finnish: Romaaniset kielet
Romanic in Swedish: Romanska språk
Romanic in Tamil: ரோமானிய மொழிகள்
Romanic in Thai: ภาษากลุ่มโรมานซ์
Romanic in Vietnamese: Nhóm ngôn ngữ Rôman
Romanic in Turkish: Roman Dilleri
Romanic in Ukrainian: Романські мови
Romanic in Walloon: Lingaedjes romans
Romanic in Contenese: 羅曼語族
Romanic in Chinese: 罗曼语族
Privacy Policy, About Us, Terms and Conditions, Contact Us
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2
Material from Wikipedia, Wiktionary, Dict
Valid HTML 4.01 Strict, Valid CSS Level 2.1