Saturday, June 2, 2012

Kahtsaai: Intensives and Content Questions

Version 1.3 of the Conlanger's Thesaurus is out. It has a few more polysemy maps, a few more grammaticalizations, and an addition to the final section with some notes on the typology of content question words ("who, what, when," etc.). The semantic map I got from "thread" was suitably and ironically tangled.

Some of the work on the Thesaurus has motivated me to make a few refinements to Kahtsaai. The biggest change is that I made some changes to question words. In English and the rest of the Indo-European family, we're used to content question words being obviously related in some way. In English, they all start with wh-, and in the Romance languages with qu-/c-. But it turns out this pattern is very rare in the world's languages, which may have completely unrelated roots for their core set of question words. So, I irregularized Kahtsaai a bit, with *ye'wei where becoming táá, and *ye'pas why becoming łouh.

Related to the question words, I have started to fill out the range of indefinites a bit. In particular, there's now an affix on the question words to mark a free choice indefinite ("pick any one"),

Ye'énałso'ołípto.
ye'é-nałsou-n-líp-to
what-FREE.CHOICE2SG-3INAN-select-IMP
Pick anything.

Additionally, it struck me that I didn't yet have a way to indicate intensives, I myself did it. In quite a few languages, those are related to the word for body. I didn't feel like using the full noun for body, tsire. Instead, I took the inspiration from the morpheme used in noun incorporation for body, which is -s(i)-. I expanded that a bit, to -ssi', which must be possessed:

Mokeilaassi'yo'áhlektényolleile.
mokeilats-ssi'yo-'áh-lekté-nyolleile
SUBJsoldier3SG.AN-self3SG.AN-3PL-save-EVIDchildren
The soldier himself saved the children.

Monday, May 28, 2012

Generating Semantic Maps

One of the central features of the Conlanger's Thesaurus is the cross-linguistic semantic maps. For the first version of the Thesaurus I used those I could find in public linguistics journal articles. But it occurred to me I could come up with some of these on my own.

First I came up with some straightforward software to manipulate lists of definitions to produce the semantic maps automatically. I wasn't actually expecting this approach to work out so well right away, but my initial assumptions and model turned out to work pretty well.

The biggest problem has been finding good dictionaries to work with. All too many online dictionaries — and not a few printed ones — are simply lists of words with single-word definitions. This is not a great way to get at polysemy. However, over the last few days I have managed to find enough good dictionaries online to make me confident in the cross-linguistic (and cross-cultural) polysemy maps I've been creating.

The code is explained at Generating Cross-Linguistic Semantic Maps. At the bottom of that page is a list of core words around which I have generated maps. Even if you cannot understand the Python programming language, you can see the list of languages and meanings I have used in the links that end in .py. The maps are images of the common polysemies.

There have been two big surprises to me in these maps. First, "face" can refer to the blade of a knife in two utterly unrelated languages (Turkish and Inupiaq). Second, I was surprised how often "sweet" can refer to what English speakers consider other flavors, especially "salty."

Wednesday, May 23, 2012

A Conlanger's Thesaurus

In the last year or so I have been thinking about writing a piece of software which would spit up a skeleton dictionary which I could fill in with a new language being created. The point was to help me get out of certain lexical ruts, while still creating a language that would be more or less internally consistent. I gave up on that project, but one side effect was a lot of reading about recent work in lexical typology. I found the semantic maps especially interesting as tools for conlanging. I've collected a bunch of that work in A Conlanger's Thesaurus.

The core of that document is a word list, lightly edited, but mixed in whenever possible are cross-linguistic semantic maps, to prod thinking about new possibilities for words that don't simply reproduce the semantic boundaries of languages I already know. There are still a lot of gaps, but this seems a good start.

The last two pages have some dense but very interesting semantic maps relating to matters most people would consider grammatical rather than lexical. But there's a lot of interesting stuff there, too. Whenever possible I have included links to free, online papers and articles which discuss the maps in much more detail, so people can hunt down details if they are moved to do so.

If you know of cross-linguistic maps I have missed, please let me know. I'm sure I'll be updating this document from time to time as I run across new work.

Thursday, April 26, 2012

Kahtsaai: a year old

I just noticed that the document sketch.pdf in my Kahtsaai directory is dated to April 23, 2011, meaning I just missed Kahtsaai's birthday. Taking a quick look at that sketch, it's interesting how much of the basic character of the language was firmly settled in just three not-very-dense pages with a whopping 20-word vocabulary.

For example, the conjugation table of subject and object prefixes is very similar, with the biggest change being the addition of several conditioned variants for the third person inanimate object, and the indefinites (thank you, Nahuatl). The sound system is completely unchanged, which shows serious restraint for me. The inchoative went from being a suffix, -píí to an aspectual prefix, -yé'-. The verb chain got a new slot just for those aspectual and adverbial prefixes.

I somehow derived an entire system of deixis from the single word hó'owa, thus.

Amusingly, the clause-final discourse particle łaai, undefined in the sketch, was briefly turned into the word yes, but now again sits without definition. It was used in the phrase, "s/he will make tea łaai." I suppose I really should find a meaning for it, given it's seniority.

It's at 973 lexical entries, including sublemmata. Perhaps I can spend the weekend getting that up to 1000.

Friday, April 6, 2012

Agreement of What?

I recently ran across this page about transitivity. Section six, "Problem Cases," mentions two languages in which it appears that the verb transitivity is cross-referenced on other constituents in the clause, in particular, adverbs and noun phrases acting adverbially. The first of the Shipibo-Conibo examples has two other elements in the clause so marked, Jain-xon-ra there-TRNS-EV and xobo-n-xon house-LOC-TRNS.

I don't think I'm going to be using this feature any time soon, but some people might find it alluring.

Saturday, January 28, 2012

Art and Conlanging

If you listened to the out-takes from the most recent Conlangery Podcast you heard an out-of-context quote from an experience I had in college. I went to a show of performance art pieces put on by art students. One of the episodes involved a man walking out on stage with a boom box, turning on some industrial music, stripping naked and proceeding to duct-tape sausage to himself.

From time to time, both on the show and off, Bianca and I have amused ourselves by imagining how various schools of art would map to styles of language invention. But now I think we've been going at this backwards. There is nothing intrinsically confusing about an earnest, naked gentleman with kielbasa affixed to his person. The problem is that no one in the audience knew what language he was speaking. If he had passed out a grammar and lexicon first, the audience would have some idea of what he was trying to say.

So, my current idea is that conceptual art is actually a form of conlanging. People just don't realize it yet.

Maybe I can get a paper in Social Text about this...

Sunday, January 8, 2012

And ACADEW, too: Tsolyáni

While some of us invented our first languages without any idea others might indulge in this hobby, most of us come to it through exposure to some other conlang. Esperanto was the most likely for a long time, and many, many of us create languages in the Mirkwood-deep shadow of Tolkien's languages. Somewhat more recently, Klingon may be the first taste of invented languages, and now Na'vi and Dothraki are bringing a few to the hobby.

I don't remember if I had seen Esperanto when I started creating my first languages — puny relexes of English with hybrid German-Latin grammars, including the dative case before I even know what one did with it. The deepest influence on my languages, however, come from an author who never invented a single language, Frank Herbert.

When Herbert needed non-English touches, he grabbed historical human languages and made modest sound changes, largely to accommodate his editors, I'm sure. Arabic infuses the books (CHOAM = OPEC), and Romani makes a regular appearance. He even grabs Ancient Egyptian once. But Herbert's thinking about language is deeply influenced by General Semantics. In particular, he was always worried about the ways our use of language conceals from us our assumptions about the world. I don't think he ever has a character say, "the map is not the territory," but he might as well have.

Herbert was also preoccupied with how far human capabilities could be extended without the aid of advanced computing technology. The Butlerian Jihad was necessary to wipe computers and AI from his science-fiction world to allow him to explore this. (As a side note, how on earth are the psychotropics used through the works, of which the Spice is but one, not technologies?) Herbert imagines a future in which the ruling classes use many, many languages. Even the soldiers have to master the battle language of the House they work for. (Conlang as impenetrable code has been used by more than one language inventor, I'm sure.)

The language that haunts me to this day is mentioned in Dune Messiah. It is mirabhasa:

They were using a mirabhasa language, honed phalange consonants and joined vowels. It was an instrument for conveying fine emotional subtleties. Edric, the Guild Steersman, replied to the Reverend Mother now with a vocal hurtsy contained in a sneer — a lovely touch of disdainful politeness.

Now, I have never figured out what phalange consonant is supposed to be. I'm guessing he misremembered "pharyngeal" from perusing Arabic, but one never knows.

In any case, I have tried through the years my own variations on a mirabhasa language, with different focuses. The results are often fiendishly complex and not very usable. Then I step back and try something daintier, only to find it falling short of my own ideas about such a language. Herbert took the best course in never attempting to create the language, instead bringing it into the scene as a way to allow him to comment on the political, social and power dynamics of a conversation between four very powerful groups. I, however, keep trying.

I recently hunted down copies of M.A.R. Barker's two-volume documentation on the Tsolyáni language. I have the second printing of it, published in 1981, somewhere between two and four years before I undertook the construction of my first little languages. It is a shame it is not more well-known. It has far more complete than any of Tolkien's languages, more interesting and naturalistic than Esperanto, and easier to learn and better described than Klingon. The grammar is full of examples. A precis may be found here.

Tsolyáni, naturally, has three separate ways to encode emotional and social judgements about discourse topics — two on the nouns, and one on the verbs.

First, nouns may take "general attitude prefixes," which encode personal judgements, though the same slot takes a few prefixes locating a discourse topic in time (the accents are primary and secondary stresses),

  • shàrzakási "the captain whom I somewhat humorously despise" (kási "captain")
  • chiqèkbásrim "the comically inept man" (básrim "man")
  • hoqòkólumel "the future emperor" (kólumel "emperor")

Second, nouns may take "general attitude suffixes." While the prefixes above encode one's personal opinions, these mark general assessments. So, korùsskási "the captain whom I despise and hate" vs. kásigakoi "the (widely) hated captain." From the grammar, "... contains a large number of suffix elements (or perhaps 'secondary compounding stems?') which denote "objectively held" attitudes toward the noun. These items describe the status, rank, size, and other clearly perceptible qualities of the noun — including emotional attitudes towards the noun which are shared by others besides the speaker" (3.160).

Finally, verbs may take "attitude prefixes" containing "some twenty to thirty members." These indicate the way an action is performed in the opinion of the speaker,

  • ramissúm "to slay in a contemptible fashion" (missúm "slay")
  • lüchyiráu "to pilfer" (yiráu "steal", lüch- "to act in a petty, miserly, cowardly fashion")
  • tludímlal "to strike fanatically" (dímlal "strike, hit")
  • bashtaskótl "to advance loyally" (taskótl "to advance as an army")

So, here is Mr. Barker using techniques in the 1970s it would take me a good 20 years to get to myself. Tsolyáni is a good example of why conlangers should study (or at least read the grammars of) as many different kinds of languages as possible. Barker's dissertation was a grammar of Klamath, and he is clearly quite familiar with Nahuatl.

While it might have been nice to know about Tsolyáni when I was younger, I'm not sure how likely it is I would have been to able to keep the grammar when I was a kid. It has drawings of naked slave-ladies on the front cover of both the grammar and the dictionary.

Artistic and Personal Mapmaking

General Semantics  is a philosophical movement with self-help overtones that had its heyday in the 1950s. It had impacts in a few areas, inc...