This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Group actions, power mean orbit size, and musical scales

Jesse Elliott Department of Mathematics
California State University, Channel Islands
Camarillo, California 93012
[email protected]
Abstract.

We provide an application of the theory of group actions to the study of musical scales. For any group GG, finite GG-set SS, and real number tt, we define the tt-power diameter diamt(G,S){\operatorname{diam}}_{t}(G,S) to be the size of any maximal orbit of SS divided by the tt-power mean orbit size of the elements of SS. The symmetric group S11S_{11} acts on the set of all tonic scales, where a tonic scale is a subset of 12{\mathbb{Z}}_{12} containing 0. We show that, for all t[1,1]t\in[-1,1], among all the subgroups GG of S11S_{11}, the tt-power diameter of the GG-set of all heptatonic scales is largest for the subgroup Γ\Gamma, and its conjugate subgroups, generated by {(1 2),(3 4),(5 6),(8 9),(10 11)}\{(1\ 2),(3\ 4),(5\ 6),(8\ 9),(10\ 11)\}. The unique maximal Γ\Gamma-orbit consists of the 32 thāts of Hindustani classical music popularized by Bhatkhande. This analysis provides a reason why these 32 scales, among all 462 heptatonic scales, are of mathematical interest. We also apply our analysis, to a lesser degree, to hexatonic and pentatonic scales.


Keywords: scales, group action, power mean, heptatonic scales, hexatonic scales, pentatonic scales.

MSC: 05E18, 26E60.

1. Introduction and summary

This paper provides an application of group actions to the study of musical scales and uses it to motivate a new invariant, which we call the tt-power diameter diamt(G,S){\operatorname{diam}}_{t}(G,S), defined, for any group GG, finite GG-set SS, and extended real number t{,}t\in{\mathbb{R}}\cup\{\infty,-\infty\}, to be the size of any maximal orbit of SS divided by the tt-power mean orbit size of the elements of SS.

We may represent the chromatic scale as the set

12={0,1,2,3,4,5,6,7,8,9,10,11},{\mathbb{Z}}_{12}=\{0,1,2,3,4,5,6,7,8,9,10,11\},

where 0 represents the tonic, or key, of the chromatic scale, which can be any fixed pitch class. Thus, for example, if one decides to let 0 represent the pitch class C, then 11 represents C\sharp, 22 represents D, and so on. A scale (in 12{\mathbb{Z}}_{12}) is a subset of 12{\mathbb{Z}}_{12}, while a tonic scale (in 12{\mathbb{Z}}_{12}) is a scale in 12{\mathbb{Z}}_{12} containing 0. A tonic scale is kk-tonic if it consists of kk notes, where k{1,2,,12}k\in\{1,2,\ldots,12\}; thus, any tonic scale ss is |s||s|-tonic, where |s||s| is the cardinality of ss. The kk-tonic scales, respectively, for k=1,2,,12k=1,2,\ldots,12 are called monotonic, ditonic, tritonic, tetratonic, pentatonic, hexatonic, heptatonic, octatonic, nonatonic, decatonic, hendecatonic, and chromatic. There are a total of 211=20482^{11}=2048 possible tonic scales, with a total of (11k1){11\choose k-1} kk-tonic scales for each k=1,2,3,,12k=1,2,3,\ldots,12. These numbers comprise the eleventh row of Pascal’s triangle:

1 11 55 165 330 462 462 330 165 55 11 1.1\ \ \ 11\ \ \ 55\ \ \ 165\ \ \ 330\ \ \ 462\ \ \ 462\ \ \ 330\ \ \ 165\ \ \ 55\ \ \ 11\ \ \ 1.

Thus, for example, there are 462 heptatonic scales and 330 pentatonic scales. We call a scale that may not contain the tonic an atonic scale (in 12{\mathbb{Z}}_{12}). There are a total of 212=40962^{12}=4096 possible atonic scales (including the unique empty scale), with a total of (12k){12\choose k} kk-atonic scales for each k=0,1,2,3,,12k=0,1,2,3,\ldots,12. These numbers comprise the twelfth row of Pascal’s triangle:

1 12 66 220 495 792 924 792 495 220 66 12 1.1\ \ \ 12\ \ \ 66\ \ \ 220\ \ \ 495\ \ \ 792\ \ \ 924\ \ \ 792\ \ \ 495\ \ \ 220\ \ \ 66\ \ \ 12\ \ \ 1.

Throughout this paper, 𝒯{\mathscr{T}} denotes the set of all tonic scales in 12{\mathbb{Z}}_{12} and 𝒯k{\mathscr{T}}_{k} the set of all kk-tonic scales in 12{\mathbb{Z}}_{12}. The symmetric group S11S_{11} acts naturally on the sets 𝒯{\mathscr{T}} and 𝒯k{\mathscr{T}}_{k}: a permutation σ\sigma in S11S_{11} acts on a tonic scale s𝒯s\in{\mathscr{T}} by σs=σ(s)={σ(x):xs}\sigma\cdot s=\sigma(s)=\{\sigma(x):x\in s\}, where one sets σ(0)=0\sigma(0)=0. Explicitly, an element σ\sigma of S11S_{11} maps a scale {0,s1,s2,,sk1}\{0,s_{1},s_{2},\ldots,s_{k-1}\} to the scale {0,σ(s1),σ(s2),,σ(sk1)}\{0,\sigma(s_{1}),\sigma(s_{2}),\ldots,\sigma(s_{k-1})\}. This defines an action of S11S_{11} on 𝒯{\mathscr{T}}, and since |σ(s)|=|s||\sigma(s)|=|s| for all ss, the action induces an action on 𝒯k{\mathscr{T}}_{k} for all k{1,2,,12}k\in\{1,2,\ldots,12\}. More generally, the group S12S_{12} acts on the set SS\SS of all (atonic) scales in 12{\mathbb{Z}}_{12}. Moreover, if we consider S11={σS12:σ(0)=0}S_{11}=\{\sigma\in S_{12}:\sigma(0)=0\} as a subgroup of S12S_{12}, then the action of S12S_{12} on SS\SS descends to the action of S11S_{11} on 𝒯{\mathscr{T}}. Although in this paper we focus mainly on the action of S11S_{11} on 𝒯{\mathscr{T}}, many of our results ascend appropriately to the action of S12S_{12} on SS\SS.

The group S11S_{11} has 11!=39,916,80011!=39,916,800 elements (and the group S12S_{12} has 12!=479,001,60012!=479,001,600 elements). A “musical” scale acted on by a randomly chosen element of S11S_{11} is very unlikely to be very musical. For example, under the permutation (1 4)(3 5)(8 7 9 10)(1\ 4)(3\ 5)(8\ 7\ 9\ 10), the heptatonic major scale {0,2,4,5,7,9,11}\{0,2,4,5,7,9,11\} maps to the somewhat “unmusical” scale {0,1,2,3,9,10,11}\{0,1,2,3,9,10,11\}. However, by contrast, some permutations preserve musicality fairly well, e.g., the permutation (3 4)(8 9)(3\ 4)(8\ 9), which swaps the major scale and the harmonic minor scale {0,2,3,5,7,8,11}\{0,2,3,5,7,8,11\}. One of the main questions we investigate in this paper is the following: are there medium-sized subgroups of S11S_{11} whose actions on 𝒯7{{\mathscr{T}}}_{7} preserve “musicality”? An ideal subgroup of S11S_{11} would be one that “respects musicality” in the sense that scales of approximately the same “musicality” appear in the same orbit. As we will see, some subgroups of S11S_{11} induce actions on the heptatonic scales that preserve musicality better than others do. One of our main claims is that the group Γ\Gamma of S11S_{11} generated by {(1 2),(3 4),(5 6),(8 9),(10 11)}\{(1\ 2),(3\ 4),(5\ 6),(8\ 9),(10\ 11)\} is the “best” such subgroup, and the 32 scales in its unique maximal orbit, which coincide with the 32 thāts of Hindustani (North Indian) classical music, represent under a particular measure the “most musical” scales among the 462 possible heptatonic scales.

To measure the musical efficacy of a subgroup GG of S11S_{11}, we define the GG-musicality of a scale s𝒯ks\in{\mathscr{T}}_{k} to be the size |Gs||Gs| of the GG-orbit GsGs of ss in 𝒯k{\mathscr{T}}_{k} divided by the average size of a GG-orbit of 𝒯k{\mathscr{T}}_{k}. A consequence of this definition is that the GG-musicality is a small as possible, namely 11, for all scales if and only if every orbit has the same number of elements, which holds if G=S11G=S_{11} or if GG is the trivial group. In general, the GG-musicality of a given scale will attain a maximum for some subgroups of S11S_{11} in between those two extremes.

The intution behind the concept of GG-musicality is that, if a scale has a small GG-orbit relative to the average GG-orbit size, then it has too much symmetry relative to GG and thus the notes comprising the scale are more “GG-equivalent” to each other and therefore have fewer notes that have their own individual character, whereas scales with larger orbits relative to the average orbit size are comprised of notes that can be better differentiated, or distiguished from one another, by the group GG. The thesis of this paper is that the subgroups GG of S11S_{11} that yield the largest possible GG-musicality (relative to the other subgroups of S11S_{11}) of any kk-tonic scale naturally lead to mathematically and musically interesting theories of kk-tonic scales, namely, those kk-tonic scales with the largest GG-musicality for any subgroup GG of S11S_{11}.

There is a slight subtlety here, however, since, given a group GG and a finite GG-set SS, the average size of a GG-orbit of SS can be measured in at least two distinct ways. One might naively define the average size of a GG-orbit of SS to be

|S||S/G|=i=1r|Oi|r,\frac{|S|}{|S/G|}=\frac{\sum_{i=1}^{r}|O_{i}|}{r},

where S/G={O1,O2,,Or}S/G=\{O_{1},O_{2},\ldots,O_{r}\} is the set of all GG-orbits of SS and where r=|S/G|r=|S/G| is the number of orbits. This represents the average number of elements in each orbit, in a naive sense. However, one may also define the average orbit size of the elements of SS to be

orb1(G,S)=sS|Gs||S|=i=1r|Oi|2i=1r|Oi|.{\operatorname{orb}}_{1}(G,S)=\frac{\sum_{s\in S}|Gs|}{|S|}=\frac{\sum_{i=1}^{r}|O_{i}|^{2}}{\sum_{i=1}^{r}|O_{i}|}.

This represents the expected value of |Gs||Gs| for sSs\in S, where each element of SS is equally likely to be chosen. By contrast, the number |S||S/G|\frac{|S|}{|S/G|} previously considered represents the expected value of |Oi||O_{i}|, where each orbit OiO_{i} is equally likely to be chosen. Since our focus is on the elements of SS rather than on the orbits, orb1(G,S){\operatorname{orb}}_{1}(G,S) is a better notion of average orbit size than is the naive definition |S||S/G|\frac{|S|}{|S/G|}.

Nevertheless, both of these measures of “average orbit size” have mathematical merit, and this is supported by the observation that, using power means, one may continuously deform one of these two means to the other, as follows. For any t{0}t\in{\mathbb{R}}-\{0\}, we define the tt-power mean orbit size of the elements of SS to be

orbt(G,S)=(sS|Gs|t|S|)1/t.{\operatorname{orb}}_{t}(G,S)=\left(\frac{\sum_{s\in S}|Gs|^{t}}{|S|}\right)^{1/t}.

This represents the tt-power mean of |Gs||Gs| over all sSs\in S. Clearly, orbt(G,S){\operatorname{orb}}_{t}(G,S) for t=1t=1 is the average orbit size of the elements of SS. Moreover, for t=1t=-1, we have

orb1(G,S)=(sS|Gs|1|S|)1=|S|OS/G|O||O|1=|S||S/G|,{\operatorname{orb}}_{-1}(G,S)=\left(\frac{\sum_{s\in S}|Gs|^{-1}}{|S|}\right)^{-1}=\frac{|S|}{\sum_{O\in S/G}|O||O|^{-1}}=\frac{|S|}{|S/G|},

and therefore orb1(G,S){\operatorname{orb}}_{-1}(G,S) is the average number of elements of SS in each orbit. Taking appropriate limits at t=0,±t=0,\pm\infty, one can define orbt(G,S){\operatorname{orb}}_{t}(G,S) for all t[,]t\in[-\infty,\infty], and then

orb(G,S)=max{|Gs|:sS}{\operatorname{orb}}_{\infty}(G,S)=\max\{|Gs|:s\in S\}

is the maximal orbit size of SS, and

orb(G,S)=min{|Gs|:sS}{\operatorname{orb}}_{-\infty}(G,S)=\min\{|Gs|:s\in S\}

is the minimal orbit size. Clearly, then, every GG-orbit of SS has the same size if and only if the function orbt(G,S){\operatorname{orb}}_{t}(G,S) is constant with respect to tt. One can use calculus to show that, if the function orbt(G,S){\operatorname{orb}}_{t}(G,S) is not constant, then it is bounded and has positive derivative everywhere. Moreover, from |S||S| and the function orbt(G,S){\operatorname{orb}}_{t}(G,S) for t[0,1]t\in[0,1], one can recover all of the orbit sizes. Our general philosophy is that the critical region of interest of the function orbt(G,S){\operatorname{orb}}_{t}(G,S) is the interval t[1,1]t\in[-1,1], with the value at t=1t=1 being the most important.

For any finite GG-set SS, we define the tt-power diameter diamt(G,S){\operatorname{diam}}_{t}(G,S) of SS to be

diamt(G,S)=orb(G,S)orbt(G,S)=max{|Gs|:sS}orbt(G,S).{\operatorname{diam}}_{t}(G,S)=\frac{{\operatorname{orb}}_{\infty}(G,S)}{{\operatorname{orb}}_{t}(G,S)}=\frac{\max\{|Gs|:s\in S\}}{{\operatorname{orb}}_{t}(G,S)}.

In other words, diamt(G,S){\operatorname{diam}}_{t}(G,S) is the ratio of the maximal GG-orbit size of SS to the tt-power mean orbit size of the elements of SS. The function diamt(G,S){\operatorname{diam}}_{t}(G,S), if not identically 11, has negative derivative with respect to tt and has limiting values of 11 and max{|Gs|:sS}min{|Gs|:sS}\frac{\max\{|Gs|:s\in S\}}{\min\{|Gs|:s\in S\}} at t=t=\infty and t=t=-\infty, respectively. The tt-power diameter diamt(G,S){\operatorname{diam}}_{t}(G,S) represents in an intuitive sense the amount of spread in the “kinetic energy” or “entropy” of the elements of SS under the action of GG, where elements with larger orbits, or equivalently with smaller stabilizers, are considered to have more kinetic energy. The mathematical problem we pose here is the following.

Problem 1.1.

Given a group GG, a finite GG-set SS, and t[,]t\in[-\infty,\infty], determine the subgroups HH of GG for which diamt(H,S){\operatorname{diam}}_{t}(H,S) is largest.

Such subgroups HH of GG maximize the spread of the “kinetic energy” of the elements of SS under the induced goup action.

The following is our main result regarding heptatonic scales in 12{\mathbb{Z}}_{12}.

Theorem 1.2 (with James Allen, Paul Estrada, and Michael McCann).

For all t[1,1]t\in[-1,1], the subgroups GG of S11S_{11} for which diamt(G,𝒯7){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{7}) is largest are the group Γ\Gamma generated by {(1 2),(3 4),(5 6),(8 9),(10 11)}\{(1\ 2),(3\ 4),(5\ 6),(8\ 9),(10\ 11)\}, along with its conjugate subgroups.

The theorem can be proved numerically using GAP and SAGE, as follows. First, note that, for any group GG and any finite GG-set SS, the number diamt(H,S){\operatorname{diam}}_{t}(H,S) for any subgroup HH of GG depends only on the conjugacy class of HH. The group S11S_{11} has 3094 subgroups up to conjugacy. (Those that are cyclic have order 11, 22, 33, 44, 55, 66, 77, 88, 99, 1010, 1111, 1212, 1414, 1515, 1818, 2020, 2121, 2424, 2828, or 3030.) Using GAP and SAGE, one can compute generators for representatives of all 3094 conjugacy classes, and for each of these representatives GG one can compute the GG-orbits of 𝒯7{\mathscr{T}}_{7}. Then the functions diamt(G,𝒯7){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{7}) can be plotted and verified to achieve a maximum for G=ΓG=\Gamma for all tt in the interval [1,1][-1,1].

The fact that diamt(G,𝒯7){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{7}) attains a maximum on the entire interval [1,1][-1,1] for a single conjugacy class of subgroups of S11S_{11} is in itself a surprising result. One has

diam1(Γ,𝒯7)3.5250,{\operatorname{diam}}_{1}(\Gamma,{\mathscr{T}}_{7})\approx 3.5250,
diam0(Γ,𝒯7)4.8324,{\operatorname{diam}}_{0}(\Gamma,{\mathscr{T}}_{7})\approx 4.8324,
diam1(Γ,𝒯7)6.6494.{\operatorname{diam}}_{-1}(\Gamma,{\mathscr{T}}_{7})\approx 6.6494.

Thus, for example, a consequence of Theorem 1.2 is that, for any subgroup GG of S11S_{11}, the maximal GG-orbit size of 𝒯7{\mathscr{T}}_{7} is at most 3.52513.5251 times the average GG-orbit size of the elements of 𝒯7{\mathscr{T}}_{7}, and the maximum ratio possible is obtained precisely by the group G=ΓG=\Gamma and its conjugates.

Given a subgroup GG of S11S_{11} and a scale s𝒯s\in{\mathscr{T}}, we define the (G,t)(G,t)-musicality of ss to be the quantity

m(G,t,s)=|Gs|orbt(G,𝒯|s|),m(G,t,s)=\frac{|Gs|}{{\operatorname{orb}}_{t}(G,{\mathscr{T}}_{|s|})},

which is the size of the GG-orbit of ss relative to the tt-power mean orbit size of the elements of 𝒯|s|{\mathscr{T}}_{|s|}. Thus, the quantity

diamt(G,𝒯k)=max{m(G,t,s):s𝒯k}{\operatorname{diam}}_{t}(G,{\mathscr{T}}_{k})=\max\{m(G,t,s):s\in{\mathscr{T}}_{k}\}

represents the largest possible (G,t)(G,t)-musicality of any kk-tonic scale. Theorem 1.2 says that the heptatonic scales with the largest possible (G,t)(G,t)-musicality for any subgroup GG of S11S_{11} and any t[1,1]t\in[-1,1] occur precisely for the group G=ΓG=\Gamma and its conjugates. These scales comprise the unique maximal Γ\Gamma-orbit of 𝒯7{\mathscr{T}}_{7} and consist of the 32 heptatonic scales

{0,12,34,56,7,89,1011},\left\{0,{1\atop 2},{3\atop 4},{5\atop 6},7,{8\atop 9},{10\atop 11}\right\},

or equivalently, starting, say, at C, the 32 scales

{C,DD,EE,FF,G,AA,BB}\left\{\mbox{C},{\mbox{D}\flat\atop\mbox{D}},{\mbox{E}\flat\atop\mbox{E}},{\mbox{F}\atop\mbox{F}\sharp},\mbox{G},{\mbox{A}\flat\atop\mbox{A}},{\mbox{B}\flat\atop\mbox{B}}\right\}

listed in Table 1. This orbit contains the major scale, the harmonic and melodic minor scales, and many other heptatonic scales that figure prominently in Western and Indian classical music. In fact, all 32 of these scales are among the 72 mēḷakarta ragas of Carnatic (South Indian) classical music standardized by Govindacharya in the 18th century and coincide with the 32 thāts of Hindustani classical music popularlized by the system created by Vishnu Narayan Bhatkhande (1860–1936), one of the most influential musicologists in the field of Hindustani classical music in the twentieth century.

Table 1. The 32 scales in the maximal Γ\Gamma-orbit of 𝒯7{\mathscr{T}}_{7} (the 32 thāts of Hindustani classical music)
major, Ionian mode, or Bilāwal thāt C D E F G A B
Mixolydian or Adonai malakh mode, or Khamaj thāt C D E F G A B\flat
harmonic major C D E F G A\flat B
Mixolydian b6 C D E F G A\flat B\flat
Lydian mode, or Kalyan thāt C D E F\sharp G A B
acoustic, or Lydian dominant C D E F\sharp G A B\flat
C D E F\sharp G A\flat B
minor Lydian C D E F\sharp G A\flat B\flat
ascending melodic minor C D E\flat F G A B
Dorian mode, or Kāfi thāt C D E\flat F G A B\flat
harmonic minor C D E\flat F G A\flat B
natural minor, Aeolian mode, or Āsāvari thāt C D E\flat F G A\flat B\flat
diminished Lydian C D E\flat F\sharp G A B
Ukrainian Dorian C D E\flat F\sharp G A B\flat
Hungarian minor C D E\flat F\sharp G A\flat B
gypsy C D E\flat F\sharp G A\flat B\flat
C D\flat E F G A B
C D\flat E F G A B\flat
double harmonic, or flamenco mode C D\flat E F G A\flat B
Phrygian dominant C D\flat E F G A\flat B\flat
Mārvā thāt C D\flat E F\sharp G A B
C D\flat E F\sharp G A B\flat
Pūrvi thāt C D\flat E F\sharp G A\flat B
C D\flat E F\sharp G A\flat B\flat
Neapolitan major C D\flat E\flat F G A B
Phrygian raised sixth C D\flat E\flat F G A B\flat
Neapolitan minor C D\flat E\flat F G A\flat B
Phrygian mode, or Bhairav thāt C D\flat E\flat F G A\flat B\flat
C D\flat E\flat F\sharp G A B
C D\flat E\flat F\sharp G A B\flat
Todi thāt C D\flat E\flat F\sharp G A\flat B
Bhairavi thāt, or Pelog (approximate) C D\flat E\flat F\sharp G A\flat B\flat

The remainder of this paper is organized as follows. In Section 2 we discuss power means and expected values, and in Section 3 we apply Section 2 to the study of the power mean orbit size and diameter of a finite GG-set. In Section 4 we apply Section 3 to the study of scales in 12{\mathbb{Z}}_{12}. In Section 5 we focus on heptatonic scales in particular, while in Section 6 we briefly study hextonic scales, and in Section 7 we study pentatonic scales.

I would like to thank the two reviewers for their thoughtful and invaluable input on the first draft of this paper. As one of the reviewers pointed out, it is likely that the methods of this paper can be combined synergistically with other ways of understanding musical scales, such as those developed in [1], [2], [3], [4], [5], [6], [7], [8], and [9]. It is not my intention that the results in this paper be definitive. My aim is merely to provide yet another perspective on the already well-developed mathematical theories of musical scales, one that, in my view, has also inspired some new and interesting problems regarding the theory of group actions. Based on the reviewers comments, I also discuss some ways in which the theory might be amended or developed further.

The research for this paper was conducted with undergraduate students James Allen and Paul Estrada and MS student Michael McCann at California State University, Channel Islands, in the academic year 2016–17, under the supervision of the author. The idea for the project began with conversations between the author and undergraduate student Vickie Chen during a semester-long project on group theory in music for a first course in abstract algebra. It is in those conversations that Vickie and I first came up with the idea of examining the maximal Γ\Gamma-orbit of 𝒯7{\mathscr{T}}_{7}, an idea that, to our pleasant surprise, eventually led to Theorem 1.2.

2. Power means and expected values

Arithmetic means are generalized by what are known as power means. If S={s1,,sr}S=\{s_{1},\ldots,s_{r}\} is a finite set of cardinality r=|S|r=|S| and X:S>0X:S\longrightarrow{\mathbb{R}}_{>0} a positive real-valued random variable on SS (with the uniform distribution on SS), then, for any nonzero tt\in{\mathbb{R}}, the tt-power mean of XX is defined to be the positive real number

𝕄t(X)=𝕄t(X(s):sS)=𝕄t(x1,,xr)=(i=1rxitr)1/t,{\mathbb{M}}_{t}(X)={\mathbb{M}}_{t}(X(s):s\in S)={\mathbb{M}}_{t}(x_{1},\ldots,x_{r})=\left(\frac{\sum_{i=1}^{r}x_{i}^{t}}{r}\right)^{1/t},

where xi=X(si)x_{i}=X(s_{i}) for all ii. Equivalently, the arithmetic mean of XX is just 𝕄1(X){\mathbb{M}}_{1}(X), and one sets

𝕄t(X)=𝕄1(Xt)1/t.{\mathbb{M}}_{t}(X)={\mathbb{M}}_{1}(X^{t})^{1/t}.

For a=0,±a=0,\pm\infty, one defines

𝕄a(X)=𝕄a(X(s):sS)=𝕄a(x1,,xr)=limta𝕄t(x1,,xr).{\mathbb{M}}_{a}(X)={\mathbb{M}}_{a}(X(s):s\in S)={\mathbb{M}}_{a}(x_{1},\ldots,x_{r})=\lim_{t\rightarrow a}{\mathbb{M}}_{t}(x_{1},\ldots,x_{r}).

It is well known that

𝕄0(x1,,xr)=(i=1rxi)1/r{\mathbb{M}}_{0}(x_{1},\ldots,x_{r})=\left(\prod_{i=1}^{r}x_{i}\right)^{1/r}

is the geometric mean of the xix_{i}, while

𝕄(x1,,xr)=max(x1,,xr){\mathbb{M}}_{\infty}(x_{1},\ldots,x_{r})=\max(x_{1},\ldots,x_{r})

and

𝕄(x1,,xr)=min(x1,,xr){\mathbb{M}}_{-\infty}(x_{1},\ldots,x_{r})=\min(x_{1},\ldots,x_{r})

are the maximum and minimum, respectively, of the xix_{i}.

Let [,]={,}[-\infty,\infty]={\mathbb{R}}\cup\{\infty,-\infty\} denote the set of all extended real numbers. For all t[,]t\in[-\infty,\infty] one has

𝕄t(cX)=c𝕄t(X){\mathbb{M}}_{t}(cX)=c{\mathbb{M}}_{t}(X)

for all c>0c>0 and

𝕄t(X1)=𝕄t(X)1.{\mathbb{M}}_{t}(X^{-1})={\mathbb{M}}_{-t}(X)^{-1}.

It follows that, if XY=cXY=c, that is, if Y=c/XY=c/X, then

𝕄t(X)𝕄t(Y)=c{\mathbb{M}}_{t}(X){\mathbb{M}}_{-t}(Y)=c

for all tt.

We may generalize the definition of 𝕄t(X){\mathbb{M}}_{t}(X) by assuming that SS is a finite probability space with probability distribution :S[0,1]{\mathbb{P}}:S\longrightarrow[0,1], which we assume is nonzero at all elements of SS, and X:S>0X:S\longrightarrow{\mathbb{R}}_{>0} is a positive real-valued random variable on SS. (Previously we implicitly assumed that (s)=1|S|{\mathbb{P}}(s)=\frac{1}{|S|} for all sSs\in S.) We may define the tt-power expected value of XX to be

𝔼t(X)=𝔼t(X(s):sS)=(sS(s)X(s)t)1/t.{\mathbb{E}}_{t}(X)={\mathbb{E}}_{t}(X(s):s\in S)=\left(\sum_{s\in S}{\mathbb{P}}(s)X(s)^{t}\right)^{1/t}.

Equivalently, the expected value of XX is just 𝔼1(X){\mathbb{E}}_{1}(X), and one sets

𝔼t(X)=𝔼1(Xt)1/t.{\mathbb{E}}_{t}(X)={\mathbb{E}}_{1}(X^{t})^{1/t}.

For a=0,±a=0,\pm\infty, one defines

𝔼a(X)=limta𝔼t(X).{\mathbb{E}}_{a}(X)=\lim_{t\rightarrow a}{\mathbb{E}}_{t}(X).

One has

𝔼0(X)=sSX(s)(s),{\mathbb{E}}_{0}(X)=\prod_{s\in S}X(s)^{{\mathbb{P}}(s)},

while

𝔼(X)=maxX(S){\mathbb{E}}_{\infty}(X)=\max X(S)

and

𝔼(X)=minX(S).{\mathbb{E}}_{-\infty}(X)=\min X(S).

If XX is constant, then clearly 𝔼t(X){\mathbb{E}}_{t}(X) is a constant function of tt and one has 𝔼t(X)=X(s){\mathbb{E}}_{t}(X)=X(s) for all tt and all sSs\in S. Conversely, if 𝔼t(X){\mathbb{E}}_{t}(X) is a constant function of tt, then maxX(S)=minX(S)\max X(S)=\min X(S), whence XX must be constant.

One can show that the function 𝔼t(X){\mathbb{E}}_{t}(X) of tt is differentiable with nonnegative derivative, and is therefore nondecreasing, with respect to tt. Thus, one has

minX(S)𝔼t(X)maxX(S)\min X(S)\leq{\mathbb{E}}_{t}(X)\leq\max X(S)

for all tt. Moreover, if XX is nonconstant, then 𝔼t(X){\mathbb{E}}_{t}(X) has positive derivative, and therefore is strictly increasing, with respect to tt. In other words, if nonconstant, the function 𝔼t(X){\mathbb{E}}_{t}(X) of tt is a sigmoid function, that is, it is a bounded differentiable function from {\mathbb{R}} to {\mathbb{R}} whose derivative is everywhere positive. Thus it has horizontal asymptotes, specifically at y=maxX(S)y=\max X(S) and y=minX(S)y=\min X(S) at \infty and -\infty, respectively. Thus, its graph is an “S-shaped” curve. For an explicit example using the uniform probability distribution, see Figure 1.

Refer to caption
Figure 1. Graph of 𝕄t(2,2,7,8,10)=(2t+2t+7t+8t+10t)5)1/t{\mathbb{M}}_{t}(2,2,7,8,10)=\left(\frac{2^{t}+2^{t}+7^{t}+8^{t}+10^{t})}{5}\right)^{1/t} on [10,10][-10,10]

3. Power mean orbit size of a finite GG-set

Throughout this section, GG denotes a group and SS a finite GG-set. One may readily generalize all of what follows to the situation where SS is also assumed to be a probability space; in that case, one simply replaces all tt-power means with tt-power expected values.

For any t[,]t\in[-\infty,\infty], we define the tt-power mean orbit size of the elements of SS to be

orbt(G,S)=𝕄t(|Gs|:sS),{\operatorname{orb}}_{t}(G,S)={\mathbb{M}}_{t}(|Gs|:s\in S),

which, for t0,±t\neq 0,\pm\infty is given by

orbt(G,S)=(sS|Gs|t|S|)1/t=(OS/G|O|t+1|S|)1/t.{\operatorname{orb}}_{t}(G,S)=\left(\frac{\sum_{s\in S}|Gs|^{t}}{|S|}\right)^{1/t}=\left(\frac{\sum_{O\in S/G}|O|^{t+1}}{|S|}\right)^{1/t}.

As observed in the introduction, orb1(G,S){\operatorname{orb}}_{1}(G,S) is the average orbit size of the elements of SS, and orb1(G,S)=|S||S/G|{\operatorname{orb}}_{-1}(G,S)=\frac{|S|}{|S/G|} is the average number of elements of SS in each orbit, while orb(G,s)=max{|Gs|:sS}{\operatorname{orb}}_{\infty}(G,s)=\max\{|Gs|:s\in S\} is the maximal orbit size of SS and orb(G,s)=min{|Gs|:sS}{\operatorname{orb}}_{-\infty}(G,s)=\min\{|Gs|:s\in S\} is the minimal orbit size.

For any sSs\in S, we define the tt-power relative size of ss to be

|s|G,S,t=|Gs|orbt(G,S).|s|_{G,S,t}=\frac{|Gs|}{{\operatorname{orb}}_{t}(G,S)}.

The tt-power relative size of ss is the size of the orbit of ss relative to (or normalized with respect to) the tt-power mean orbit size of the elements of SS. The tt-power mean of the orbit sizes of the elements of SS is equal to

𝕄t(|Gs|:sS)=orbt(G,S){\mathbb{M}}_{t}(|Gs|:s\in S)={\operatorname{orb}}_{t}(G,S)

of SS, while the tt-power mean of the tt-power relative sizes of the elements of SS is equal to 11:

𝕄t(|s|G,S,t:sS)=1.{\mathbb{M}}_{t}(|s|_{G,S,t}:s\in S)=1.

Because of this normalization property, if HH and KK are subgroups of GG, then it makes sense to compare the values of |s|H,S,t|s|_{H,S,t} and |s|K,S,t|s|_{K,S,t} with each other.

The tt-power diameter diamt(G,S){\operatorname{diam}}_{t}(G,S) of SS, as defined in the introduction, is equivalently the maximal tt-power relative size of an element of SS, that is, one has

diamt(G,S)=max{|s|G,S,t:sS}=max{|Gs|:sS}orbt(G,S)=orb(G,S)orbt(G,S).{\operatorname{diam}}_{t}(G,S)=\max\{|s|_{G,S,t}:s\in S\}=\frac{\max\{|Gs|:s\in S\}}{{\operatorname{orb}}_{t}(G,S)}=\frac{{\operatorname{orb}}_{\infty}(G,S)}{{\operatorname{orb}}_{t}(G,S)}.

The function diamt(G,S){\operatorname{diam}}_{t}(G,S), if not identically 11, has negative derivative with respect to tt and has limiting values of 11 and max{|Gs|:sS}min{|Gs|:sS}\frac{\max\{|Gs|:s\in S\}}{\min\{|Gs|:s\in S\}} at t=t=\infty and t=t=-\infty, respectively.

For example, Figure 2 provides the graph of orbt(G,S){\operatorname{orb}}_{t}(G,S) and diamt(G,S){\operatorname{diam}}_{t}(G,S) for any GG-set SS with orbit sizes 22, 22, 77, 88, and 1010.

Refer to caption
Figure 2. Graph of orbt(G,S)=(2t+1+2t+1+7t+1+8t+1+10t+129)1/t{\operatorname{orb}}_{t}(G,S)=\left(\frac{2^{t+1}+2^{t+1}+7^{t+1}+8^{t+1}+10^{t+1}}{29}\right)^{1/t} and diamt(G,S)=10(2t+1+2t+1+7t+1+8t+1+10t+129)1/t{\operatorname{diam}}_{t}(G,S)=10\left(\frac{2^{t+1}+2^{t+1}+7^{t+1}+8^{t+1}+10^{t+1}}{29}\right)^{-1/t} on [10,10][-10,10] for any GG-set SS with orbit sizes 22, 22, 77, 88, and 1010.

4. Musicality of scales and groups

Throughout this section, GG denotes a subgroup of S11S_{11}, which acts on the set 𝒯{\mathscr{T}} of all 2112^{11} possible tonic scales, as well as on the subsets 𝒯k{\mathscr{T}}_{k} of all (11k1){11\choose k-1} possible kk-tonic scales, for all k=1,2,,12k=1,2,\ldots,12, as described in the introduction, and tt denotes a number or variable with values in the extended reals [,][-\infty,\infty].

For any s𝒯s\in{\mathscr{T}}, that is, for any tonic scale ss, we define the (G,t)(G,t)-musicality of ss, or of the orbit GsGs, to be

m(G,t,s)=|s|G,𝒯|s|,t=|Gs|orbt(G,𝒯|s|),m(G,t,s)=|s|_{G,{\mathscr{T}}_{|s|},t}=\frac{|Gs|}{{\operatorname{orb}}_{t}(G,{\mathscr{T}}_{|s|})},

which is the tt-power relative size of ss in 𝒯|s|{\mathscr{T}}_{|s|} (not in 𝒯{\mathscr{T}}). Musicality defines a natural function

m:Subgp(S11)×[,]×𝒯[1,),m:\operatorname{Subgp}(S_{11})\times[-\infty,\infty]\times{\mathscr{T}}\longrightarrow[1,\infty),

where Subgp(S11)\operatorname{Subgp}(S_{11}) denotes the lattice of all subgroups of S11S_{11}. For a fixed GG, tt, and kk, the tt-power mean 𝕄t(m(G,t,s):s𝒯k){\mathbb{M}}_{t}(m(G,t,s):s\in{\mathscr{T}}_{k}) of m(G,t,s)m(G,t,s) over all kk-tonic scales ss is equal to 11. The (G,t)(G,t)-musicality m(G,t,s)m(G,t,s) of a kk-tonic scale ss is directly proportional to the size of its GG-orbit. The constant of proportionality depends on tt and is defined natually in such a way that one can meaningfully compare the values for various subgroups GG of S11S_{11} for a fixed ss, or compare the values for various scales ss for a fixed group GG.

Observe that

diamt(G,𝒯k)=max{|Gs|:sS}orbt(G,𝒯k)=max{m(G,t,s):s𝒯k}.{\operatorname{diam}}_{t}(G,{\mathscr{T}}_{k})=\frac{\max\{|Gs|:s\in S\}}{{\operatorname{orb}}_{t}(G,{\mathscr{T}}_{k})}=\max\{m(G,t,s):s\in{\mathscr{T}}_{k}\}.

Thus diamt(G,𝒯k){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{k}) is the largest possible (G,t)(G,t)-musicality of a scale in 𝒯k{\mathscr{T}}_{k}, or equivalently it is the (G,t)(G,t)-musicality of any maximal GG-orbit of 𝒯k{\mathscr{T}}_{k}. If we let 𝒯k//G{\mathscr{T}}_{k}//G denote the set of all maximal GG-orbits of 𝒯k{\mathscr{T}}_{k}, then the union 𝒯k,G=(𝒯k//G){\mathscr{T}}_{k,G}=\bigcup({\mathscr{T}}_{k}//G) is the set of all scales in 𝒯k{\mathscr{T}}_{k} that have the largest possible (G,t)(G,t)-musicality (namely, diamt(G,𝒯k){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{k})) for any tt. We call the scales in the set 𝒯k,G{\mathscr{T}}_{k,G} the kk-tonic scales of GG. Our philosophy is that the scales in the set 𝒯k,G{\mathscr{T}}_{k,G} should be regarded as the optimally musical kk-tonic scales relative to GG.

Let GG be a subgroup of SnS_{n}. We define a signature of GG (in SnS_{n}) to be a list (n1,n2,,nk)(n_{1},n_{2},\ldots,n_{k}) of the GG-orbit sizes of {{1},{2},,{n}}\{\{1\},\{2\},\ldots,\{n\}\}, listed in any particular order. In particular, one has n=n1+n2++nkn=n_{1}+n_{2}+\cdots+n_{k} for any signature (n1,n2,,nk)(n_{1},n_{2},\ldots,n_{k}) of GG in SnS_{n}. We say that GG acts without crossings (in SnS_{n}), if the GG-orbits of the GG-set {{1},{2},,{n}}\{\{1\},\{2\},\ldots,\{n\}\} are all of the form {{a},{a+1},,{a+k}}\{\{a\},\{a+1\},\ldots,\{a+k\}\}, where the addition here is ordinary addition of positive integers, not addition modulo nn. (In the case at hand, n=11n=11, and 0 is omitted from the discussion because we have chosen to leave 0 fixed by S11S_{11}.) It is clear that every subgroup of SnS_{n} is conjugate to a subgroup that acts without crossings. In fact, the conjugates of GG that act without crossings in SnS_{n} are in one-to-one correspondence with the signatures of GG in SnS_{n}. If GG acts without crossings in SnS_{n}, we define the signature of GG (in SnS_{n}) to be the list (n1,n2,,nk)(n_{1},n_{2},\ldots,n_{k}) of the orbit sizes of {{1},{2},,{n}}\{\{1\},\{2\},\ldots,\{n\}\}, listed in order so that the orbits are {{1},{2},,{n1}}\{\{1\},\{2\},\ldots,\{n_{1}\}\}, {{n1+1},{n1+2},,{n1+n2}}\{\{n_{1}+1\},\{n_{1}+2\},\ldots,\{n_{1}+n_{2}\}\}, etc.

Let n1,n2,,ndn_{1},n_{2},\ldots,n_{d} be a sequence of positive integers whose sum is 1111. Then the group Sn1×Sn2××SndS_{n_{1}}\times S_{n_{2}}\times\cdots\times S_{n_{d}} naturally embeds into S11S_{11} in the following way. The first factor Sn1S_{n_{1}} acts on the first n1n_{1} numbers, 1,2,,n11,2,\ldots,n_{1}. The next factor Sn2S_{n_{2}} acts on the next n2n_{2} numbers, n1+1,n1+2,,n1+n2n_{1}+1,n_{1}+2,\ldots,n_{1}+n_{2}. And so onward to the last factor SndS_{n_{d}}, which acts on the last ndn_{d} numbers, n1++nd1+1n_{1}+\cdots+n_{d-1}+1 to 1111. We denote the image of the embedding

Φ:Sn1×Sn2××SndS11\Phi:S_{n_{1}}\times S_{n_{2}}\times\cdots\times S_{n_{d}}\longrightarrow S_{11}

described above by Sn1,n2,,ndS_{n_{1},n_{2},\ldots,n_{d}}. We say that a twelve tone group of signature at most (n1,n2,nd)(n_{1},n_{2},\ldots n_{d}) is a subgroup of Sn1,n2,,ndS_{n_{1},n_{2},\ldots,n_{d}} of the form Φ(G1×G2××Gd)\Phi(G_{1}\times G_{2}\times\cdots\times G_{d}), where GiG_{i} is a subgroup of SniS_{n_{i}} for all ii. Note that the group Sn1,n2,,ndSn2××SndS_{n_{1},n_{2},\ldots,n_{d}}\cong S_{n_{2}}\times\cdots\times S_{n_{d}} is the largest twelve tone group of signature at most (n1,n2,nd)(n_{1},n_{2},\ldots n_{d}) in the sense that it contains every twelve tone group of signature at most (n1,n2,nd)(n_{1},n_{2},\ldots n_{d}). For this reason we call it the maximal twelve tone group of signature (n1,n2,nd)(n_{1},n_{2},\ldots n_{d}). For example, the group S11S_{11} the maximal twelve tone group of signature (11)(11), and therefore every subgroup of S11S_{11} is a twelve tone group of signature (11)(11). Note that all twelve tone groups act without crossings, and the signature of the maximal twelve tone group of signature (n1,n2,nd)(n_{1},n_{2},\ldots n_{d}) is (n1,n2,nd)(n_{1},n_{2},\ldots n_{d}).

Of course, there are other natural embeddings of Sn1×Sn2××SndS_{n_{1}}\times S_{n_{2}}\times\cdots\times S_{n_{d}} in S11S_{11}. For example, S4×S7S_{4}\times S_{7} can be embedded in S11S_{11} by allowing the first factor to act, say, on {2,4,7,8}\{2,4,7,8\} and the second factor on {1,3,5,6,9,10,11}\{1,3,5,6,9,10,11\}. Such an embedding does not yield a twelve tone group with signature (4,7)(4,7). In loose terminology, twelve tone groups do not allow the factors to act in a ways that are “intertwined”: no “crossing” is allowed. Philosophically, this restriction can be motivated as follows. The chromatic scale has a linear ordering, and our goal is to understand how various scales may be transformed from one to the other. The most obvious and most common way in which this is done is by changing various notes of the scale by applying operations ,,\natural,\sharp,\flat, which can be modeled by permuting neighboring notes. The notion of a twelve tone group is meant to capture this notion of locality.

Even without these locality restrictions, our mathematical analysis of twelve tone groups will apply equally well to any of the embeddings of Sn1×Sn2××SndS_{n_{1}}\times S_{n_{2}}\times\cdots\times S_{n_{d}} in S11S_{11} that allow crossings, because we can simply relabel the numbers 0 through 1111 so that there are no crossings. We say that a maximal twelve tone group of signature (n1,n2,,nd)(n_{1},n_{2},\ldots,n_{d}) with or without crossings is a subgroup of S11S_{11} that is the result of applying some (inner) automorphism of S11S_{11} to the maximal twelve tone group Sn1,n2,,ndS_{n_{1},n_{2},\ldots,n_{d}}. We note the following proposition, whose proof is elementary.

Proposition 4.1.

There are 210=10242^{10}=1024 possible signatures, hence 10241024 maximal twelve tone groups, corresponding to 1010 independent choices of whether or not to separate ii from i+1i+1, for i=1,,10i=1,\ldots,10. There are 678570678570 maximal twelve tone groups with or without crossings, corresponding to the 678570678570 possible partitions of an eleven element set. Up to isomorphism, there are p(11)=56p(11)=56 maximal twelve tone groups (or maximal twelve tone groups with or without crossings), corresponding to the 5656 possible partitions of the number 1111.

5. Heptatonic scales

In this section we are primarily interested in the action of subgroups of S11S_{11} on the set 𝒯7{\mathscr{T}}_{7} of all heptatonic (tonic) scales, which has 462 elements.

The following proposition gives a formula for diamt(G,𝒯7){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{7}) for any maximal twelve tone group GG. Multisets are a generalization of sets where, as with tuples, repetition is allowed, but, as with sets, order doesn’t matter.

Proposition 5.1.

Let GG be a maximal twelve tone subgroup of S11S_{11} of signature (n1,n2,,nd)(n_{1},n_{2},\ldots,n_{d}) with or without crossings. The multiset of GG-orbit sizes of 𝒯7{\mathscr{T}}_{7} is the multiset

{(n1k1)(n2k2)(ndkd):k1,k2,,kd>0,k1+k2++kd=6,ikini}\left\{{n_{1}\choose k_{1}}{n_{2}\choose k_{2}}\cdots{n_{d}\choose k_{d}}:k_{1},k_{2},\ldots,k_{d}\in{\mathbb{Z}}_{>0},k_{1}+k_{2}+\cdots+k_{d}=6,\forall i\ {k_{i}\leq n_{i}}\right\}

of positive integers. Therefore, one has

orbt(G,𝒯7)=(1462k1+k2++kd=6kini((n1k1)(n2k2)(ndkd))t+1)1/t{\operatorname{orb}}_{t}(G,{\mathscr{T}}_{7})=\left(\frac{1}{462}\sum_{{k_{1}+k_{2}+\cdots+k_{d}=6}\atop{k_{i}\leq n_{i}}}\left({n_{1}\choose k_{1}}{n_{2}\choose k_{2}}\cdots{n_{d}\choose k_{d}}\right)^{t+1}\right)^{1/t}

for all t0,±t\neq 0,\pm\infty, the maximal GG-orbits of 𝒯7{\mathscr{T}}_{7} have size

orb(G,𝒯7)=maxk1+k2++kd=6kini(n1k1)(n2k2)(ndkd),{\operatorname{orb}}_{\infty}(G,{\mathscr{T}}_{7})=\max_{{k_{1}+k_{2}+\cdots+k_{d}=6}\atop{k_{i}\leq n_{i}}}{n_{1}\choose k_{1}}{n_{2}\choose k_{2}}\cdots{n_{d}\choose k_{d}},

and one has

diamt(G,𝒯7)=orb(G,𝒯7)orbt(G,𝒯7).{\operatorname{diam}}_{t}(G,{\mathscr{T}}_{7})=\frac{{\operatorname{orb}}_{\infty}(G,{\mathscr{T}}_{7})}{{\operatorname{orb}}_{t}(G,{\mathscr{T}}_{7})}.

One also has

462=(116)=k1+k2++kd=6kini(n1k1)(n2k2)(ndkd)462={11\choose 6}=\sum_{{k_{1}+k_{2}+\cdots+k_{d}=6}\atop{k_{i}\leq n_{i}}}{n_{1}\choose k_{1}}{n_{2}\choose k_{2}}\cdots{n_{d}\choose k_{d}}

and the number of GG-orbits of 𝒯7{\mathscr{T}}_{7} is equal to

k1+k2++kd=6kini1.\sum_{{k_{1}+k_{2}+\cdots+k_{d}=6}\atop{k_{i}\leq n_{i}}}1.
Proof.

The proof is elementary. ∎

Clearly, this proposition generalizes to kk-tonic scales by replacing 𝒯7{\mathscr{T}}_{7} everywhere in the proposition with 𝒯k{\mathscr{T}}_{k}, replacing the number 66 everywhere with the number k1k-1, and replacing the number 462 with (11k1){11\choose k-1}. (It even works for an NN-note chromatic scale by replacing 1111 everywhere with N1N-1.) It also generalizes to kk-atonic scales.

Using the proposition, one can compute diamt(G,𝒯7){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{7}) for all 56 maximal twelve tone groups GG, say, for the critical values t=1,0,1t=1,0,-1. These values are listed in Table 2 in descending order of diam1(G,𝒯7){\operatorname{diam}}_{1}(G,{\mathscr{T}}_{7}).

Table 2. Maximal twelve tone groups
Signature Maximal orbits #\# orbits diam1(G,𝒯7){\operatorname{diam}}_{1}(G,{\mathscr{T}}_{7}) diam0(G,𝒯7){\operatorname{diam}}_{0}(G,{\mathscr{T}}_{7}) diam1(G,𝒯7){\operatorname{diam}}_{-1}(G,{\mathscr{T}}_{7})
(2,2,2,2,2,1)(2,2,2,2,2,1) 1 of size 32 96 3.5250 4.8324 6.6494
(3,2,2,2,2)(3,2,2,2,2) 1 of size 48 61 3.0689 4.3060 6.3377
(2,2,2,2,1,1,1)(2,2,2,2,1,1,1) 3 of size 16 131 2.7603 3.5264 4.5368
(4,2,2,2,1)(4,2,2,2,1) 1 of size 48 48 2.6679 3.4917 4.9870
(3,2,2,2,1,1)(3,2,2,2,1,1) 3 of size 24 83 2.4115 3.1423 4.3117
(5,2,2,2)(5,2,2,2) 1 of size 80 26 2.3864 3.1193 4.5022
(4,3,2,2)(4,3,2,2) 1 of size 72 31 2.3203 3.1114 4.8312
(2,2,2,1,1,1,1,1)(2,2,2,1,1,1,1,1) 10 of size 8 179 2.1513 2.5733 3.0996
(3,3,2,2,1)(3,3,2,2,1) 3 of size 36 53 2.1027 2.8000 4.1299
(4,2,2,1,1,1)(4,2,2,1,1,1) 3 of size 24 65 2.0921 2.5480 3.3766
(4,4,2,1)(4,4,2,1) 1 of size 72 25 2.0182 2.5229 3.8961
(6,2,2,1)(6,2,2,1) 1 of size 80 18 1.9833 2.3882 3.1169
(3,2,2,1,1,1,1)(3,2,2,1,1,1,1) 10 of size 12 113 1.8844 2.2930 2.9351
(5,2,2,1,1)(5,2,2,1,1) 3 of size 40 35 1.8788 2.2763 3.0303
(3,3,3,2)(3,3,3,2) 3 of size 54 34 1.8293 2.4951 3.9740
(4,3,2,1,1)(4,3,2,1,1) 3 of size 36 42 1.8261 2.2705 3.2727
(7,2,2)(7,2,2) 1 of size 140 9 1.8033 2.1611 2.7273
(5,4,2)(5,4,2) 1 of size 120 14 1.8032 2.2539 3.6364
(4,4,3)(4,4,3) 1 of size 108 16 1.7533 2.2482 3.7403
(6,3,2)(6,3,2) 1 of size 120 12 1.7228 2.1281 3.1169
(2,2,1,1,1,1,1,1,1)(2,2,1,1,1,1,1,1,1) 35 of size 4 245 1.6709 1.8779 2.1212
(3,3,2,1,1,1)(3,3,2,1,1,1) 10 of size 18 72 1.6480 2.0433 2.8052
(5,3,2,1)(5,3,2,1) 3 of size 60 23 1.6364 2.0284 2.9870
(4,2,1,1,1,1,1)(4,2,1,1,1,1,1) 10 of size 12 88 1.6325 1.8594 2.2857
(4,3,3,1)(4,3,3,1) 3 of size 54 27 1.5907 2.0232 3.1558
(4,4,1,1,1)(4,4,1,1,1) 3 of size 36 34 1.5849 1.8411 2.6494
(6,2,1,1,1)(6,2,1,1,1) 3 of size 40 24 1.5577 1.7428 2.0779
(6,4,1)(6,4,1) 1 of size 120 10 1.4994 1.7256 2.5974
(5,2,1,1,1,1)(5,2,1,1,1,1) 10 of size 20 47 1.4704 1.6611 2.0346
(8,2,1)(8,2,1) 1 of size 140 6 1.4667 1.6141 1.8182
(3,2,1,1,1,1,1,1)(3,2,1,1,1,1,1,1) 35 of size 6 154 1.4667 1.6733 2.0000
(3,3,3,1,1)(3,3,3,1,1) 10 of size 27 46 1.4388 1.8207 2.6883
(4,3,1,1,1,1)(4,3,1,1,1,1) 10 of size 18 57 1.4289 1.6569 2.2208
(7,2,1,1)(7,2,1,1) 3 of size 70 12 1.4224 1.5770 1.8182
(5,4,1,1)(5,4,1,1) 3 of size 60 19 1.4220 1.6448 2.4675
(5,3,3)(5,3,3) 3 of size 90 15 1.4218 1.8074 2.9221
(7,4)(7,4) 1 of size 210 5 1.3618 1.5615 2.2727
(6,3,1,1)(6,3,1,1) 3 of size 60 16 1.3583 1.5529 2.0779
(9,2)(9,2) 1 of size 252 3 1.3469 1.4752 1.6364
(6,5)(6,5) 1 of size 200 6 1.3379 1.5416 2.5974
(2,1,1,1,1,1,1,1,1,1)(2,1,1,1,1,1,1,1,1,1) 126 of size 2 336 1.2941 1.3704 1.4545
(3,3,1,1,1,1,1)(3,3,1,1,1,1,1) 35 of size 9 98 1.2857 1.4911 1.9091
(5,3,1,1,1)(5,3,1,1,1) 10 of size 30 31 1.2848 1.4802 2.0130
(5,5,1)(5,5,1) 3 of size 100 11 1.2727 1.4694 2.3810
(8,3)(8,3) 1 of size 210 4 1.2725 1.4383 1.8182
(4,1,1,1,1,1,1,1)(4,1,1,1,1,1,1,1) 35 of size 6 119 1.2692 1.3569 1.5455
(7,3,1)(7,3,1) 3 of size 105 8 1.2375 1.4053 1.8182
(6,1,1,1,1,1)(6,1,1,1,1,1) 10 of size 20 32 1.2171 1.2718 1.3853
(8,1,1,1)(8,1,1,1) 3 of size 70 8 1.1538 1.1779 1.2121
(5,1,1,1,1,1,1)(5,1,1,1,1,1,1) 35 of size 10 63 1.1458 1.2122 1.3636
(3,1,1,1,1,1,1,1,1)(3,1,1,1,1,1,1,1,1) 126 of size 3 210 1.1379 1.2211 1.3636
(7,1,1,1,1)(7,1,1,1,1) 10 of size 35 16 1.1149 1.1508 1.2121
(10,1)(10,1) 1 of size 252 2 1.0820 1.0864 1.0909
(9,1,1)(9,1,1) 3 of size 126 4 1.0645 1.0765 1.0909
(11)(11) 1 of size 462 462 1 1 1
(1,1,1,1,1,1,1,1,1,1,1)(1,1,1,1,1,1,1,1,1,1,1) 462 of size 1 462 1 1 1

As the most important example, consider the maximal twelve tone group Γ=S2,2,2,1,2,2\Gamma=S_{2,2,2,1,2,2} with signature (2,2,2,1,2,2)(2,2,2,1,2,2). Equivalently, Γ\Gamma is the subgroup of S11S_{11} generated by the set {(1 2),(3 4),(5 6),(8 9),(10 11)}\{(1\ 2),(3\ 4),(5\ 6),(8\ 9),(10\ 11)\}, and it is isomorphic to S2×S2×S2×S1×S2×S225S_{2}\times S_{2}\times S_{2}\times S_{1}\times S_{2}\times S_{2}\cong{\mathbb{Z}}_{2}^{5} and has order 3232. The orbits in 𝒯7{\mathscr{T}}_{7} therefore have 1, 2, 4, 8, 16, or 32 elements. Using Proposition 5.1 we find that 𝒯7{\mathscr{T}}_{7} under the action of Γ\Gamma has:

  1. (1)

    11 orbit of size 3232,

  2. (2)

    55 orbits of size 1616,

  3. (3)

    2020 orbits of size 88,

  4. (4)

    3030 orbits of size 44,

  5. (5)

    3030 orbits of size 22,

  6. (6)

    1010 orbits of size 11,

for a total of 9696 orbits. One therefore has

orbt(Γ,𝒯7)=(32t+1+516t+1+208t+1+304t+1+302t+1+101t+1462)1/t{\operatorname{orb}}_{t}(\Gamma,{\mathscr{T}}_{7})=\left(\frac{32^{t+1}+5\cdot 16^{t+1}+20\cdot 8^{t+1}+30\cdot 4^{t+1}+30\cdot 2^{t+1}+10\cdot 1^{t+1}}{462}\right)^{1/t}

and

diamt(Γ,𝒯7)=32(46232t+1+516t+1+208t+1+304t+1+302t+1+101t+1)1/t.{\operatorname{diam}}_{t}(\Gamma,{\mathscr{T}}_{7})=32\left(\frac{462}{32^{t+1}+5\cdot 16^{t+1}+20\cdot 8^{t+1}+30\cdot 4^{t+1}+30\cdot 2^{t+1}+10\cdot 1^{t+1}}\right)^{1/t}.

In particular, one has

diam1(Γ,𝒯7)=3246241943.5250{\operatorname{diam}}_{1}(\Gamma,{\mathscr{T}}_{7})=32\cdot\frac{462}{4194}\approx 3.5250

and

diam1(Γ,𝒯7)=32964626.6494.{\operatorname{diam}}_{-1}(\Gamma,{\mathscr{T}}_{7})=32\cdot\frac{96}{462}\approx 6.6494.

The heptatonic scales of Γ\Gamma comprise the unique maximal Γ\Gamma-orbit of 𝒯7{\mathscr{T}}_{7}, which are the 32 scales

{0,12,34,56,7,89,1011},\left\{0,{1\atop 2},{3\atop 4},{5\atop 6},7,{8\atop 9},{10\atop 11}\right\},

or

{0,2,4,6,7,9,11},\{0,2^{\dagger},4^{\dagger},6^{\dagger},7,9^{\dagger},11^{\dagger}\},

where each of the \dagger’s is either a \natural (+0+0) or a \flat (1-1). This unique maximal Γ\Gamma-orbit consists of the 32 thāts of Hindustani classical music popularized by Bhatkhande.

One may also consider the maximal twelve tone group Γ\Gamma_{-} of signature (2,2,1,2,2,2)(2,2,1,2,2,2), an action that globally fixes 55 instead of 77. Since changing the order of the numbers in the signature does not affect any of the relevant values, the values above calculated for Γ\Gamma_{-} are the same as for Γ\Gamma. The maximal Γ\Gamma_{-}-orbit of 𝒯7{\mathscr{T}}_{7} contains the 32 scales

{0,12,34,5,67,89,1011},\left\{0,{1\atop 2},{3\atop 4},5,{6\atop 7},{8\atop 9},{10\atop 11}\right\},

or

{0,2,4,5,7,9,11},\{0,2^{\dagger},4^{\dagger},5,7^{\dagger},9^{\dagger},11^{\dagger}\},

where each of the \dagger’s is either a \natural (+0+0) or a \flat (1-1). In particular, the most “sharp” of these 32 scales is precisely the major scale. The intersection of the maximal Γ\Gamma-orbit and the maximal Γ\Gamma_{-}-orbit consists of the 16 scales

{0,12,34,5,7,89,1011}.\left\{0,{1\atop 2},{3\atop 4},5,7,{8\atop 9},{10\atop 11}\right\}.

We may also consider the compositum Γ1=ΓΓ{\Gamma_{1}}=\Gamma\Gamma_{-} of Γ\Gamma and Γ\Gamma_{-}, which is the maximal twelve tone group of signature (2,2,3,2,2)(2,2,3,2,2). There are:

  1. (1)

    11 orbit of size 4848,

  2. (2)

    44 orbits of size 2424,

  3. (3)

    1212 orbits of size 1212,

  4. (4)

    44 orbits of size 88,

  5. (5)

    1212 orbits of size 66,

  6. (6)

    66 orbits of size 44,

  7. (7)

    66 orbits of size 33,

  8. (8)

    1212 orbits of size 22,

  9. (9)

    44 orbits of size 11,

for a total of 6161 orbits. The maximal Γ1\Gamma_{1}-orbit is just the union of the maximal Γ\Gamma-orbit and the maximal Γ\Gamma_{-}-orbit. Consistent with inclusion-exclusion, one has 48=32+321648=32+32-16. One has

diam1(Γ1,𝒯7)=4846272263.0689{\operatorname{diam}}_{1}(\Gamma_{1},{\mathscr{T}}_{7})=48\cdot\frac{462}{7226}\approx 3.0689

and

diam1(Γ1,𝒯7)=48614626.3377.{\operatorname{diam}}_{-1}(\Gamma_{1},{\mathscr{T}}_{7})=48\cdot\frac{61}{462}\approx 6.3377.

Note that

diam1(Γ,𝒯7)diam1(Γ1,𝒯7)=32/419448/72263.52503.06891.1486\frac{{\operatorname{diam}}_{1}(\Gamma,{\mathscr{T}}_{7})}{{\operatorname{diam}}_{1}(\Gamma_{1},{\mathscr{T}}_{7})}=\frac{32/4194}{48/7226}\approx\frac{3.5250}{3.0689}\approx 1.1486

and

diam1(Γ,𝒯7)diam1(Γ1,𝒯7)=329648616.64946.33771.0492.\frac{{\operatorname{diam}}_{-1}(\Gamma,{\mathscr{T}}_{7})}{{\operatorname{diam}}_{-1}(\Gamma_{1},{\mathscr{T}}_{7})}=\frac{32\cdot 96}{48\cdot 61}\approx\frac{6.6494}{6.3377}\approx 1.0492.

Let us now consider the maximal twelve tone group Δ\Delta of signature (4,2,1,4)(4,2,1,4). This group is also of theoretical and historical importance in Indian classical music, as its unique maximal orbit consists precisely of the 72=1621672=1\cdot 6\cdot 2\cdot 1\cdot 6 mēḷakarta ragas, which are built as follows:

{0}{two of 1,2,3,4}{one of 5,6}{7}{two of 8,9,10,11}.\{0\}\cup\{\mbox{two of }1,2,3,4\}\cup\{\mbox{one of }5,6\}\cup\{7\}\cup\{\mbox{two of }8,9,10,11\}.

Here we have:

  1. (1)

    11 orbit of size 7272,

  2. (2)

    22 orbits of size 4848,

  3. (3)

    11 orbit of size 3636,

  4. (4)

    22 orbits of size 3232,

  5. (5)

    44 orbits of size 2424,

  6. (6)

    33 orbits of size 1616,

  7. (7)

    22 orbits of size 88,

  8. (8)

    22 orbits of size 66,

  9. (9)

    44 orbits of size 44,

  10. (10)

    22 orbits of size 22,

  11. (11)

    22 orbits of size 11,

for a total of 2525 orbits. We then have

diam1(Δ,𝒯7)=72462164822.0182{\operatorname{diam}}_{1}(\Delta,{\mathscr{T}}_{7})=72\cdot\frac{462}{16482}\approx 2.0182

and

diam1(Δ1,𝒯7)=72254623.8961{\operatorname{diam}}_{-1}(\Delta_{1},{\mathscr{T}}_{7})=72\cdot\frac{25}{462}\approx 3.8961

Note that, of the 72 mēḷakarta ragas comprising the maximal Δ\Delta-orbit, 32 are scales in the maximal Γ1\Gamma_{1}-orbit consisting of 48 scales, but only 16 are scales in the maximal Γ\Gamma_{-}-orbit consisting of 32 scales.

One may also consider the maximal twelve tone group Δ\Delta_{-} of signature (4,1,2,4)(4,1,2,4), an action that globally fixes 55 instead of 77, as does Δ\Delta. Since changing the order of the numbers in the signature does not affect any of the relevant values, the values above calculated for Δ\Delta_{-} are the same as for Δ\Delta. The largest Δ\Delta_{-}-orbit of 𝒯7{\mathscr{T}}_{7} contains the 72 scales

{0}{two of 1,2,3,4}{5}{one of 6,7}{two of 8,9,10,11}.\{0\}\cup\{\mbox{two of }1,2,3,4\}\cup\{5\}\cup\{\mbox{one of }6,7\}\cup\{\mbox{two of }8,9,10,11\}.

The intersection of the maximal Δ\Delta-orbit and the maximal Δ\Delta_{-}-orbit consists of the 36 scales

{0}{two of 1,2,3,4}{5,7}{two of 8,9,10,11}.\{0\}\cup\{\mbox{two of }1,2,3,4\}\cup\{5,7\}\cup\{\mbox{two of }8,9,10,11\}.

We may also consider the maximal twelve tone group Δ1=ΓΔ=Γ1Δ\Delta_{1}=\Gamma_{-}\Delta=\Gamma_{1}\Delta of signature (4,3,4)(4,3,4). Its largest orbit consists of 108=636=72+36108=6\cdot 3\cdot 6=72+36 possible scales, which is the union of the maximal Δ\Delta-orbit and the maximal Δ\Delta_{-}-orbit. Consistent with inclusion-exclusion, one has 108=72+7236108=72+72-36. The 108 scales are built as follows:

{0}{two of 1,2,3,4}{two of 5,6,7}{two of 8,9,10,11}.\{0\}\cup\{\mbox{two of }1,2,3,4\}\cup\{\mbox{two of }5,6,7\}\cup\{\mbox{two of }8,9,10,11\}.

These 108 scales include all 72 mēḷakarta ragas in the maximal Δ\Delta-orbit and all 32 scales from the maximal Γ\Gamma_{-}-orbit, along with 20 others. Indeed, by inclusion-exclusion, the combined total of scales in the maximal Δ\Delta-orbit and in the maximal Γ\Gamma_{-}-orbit is only 88=72+321688=72+32-16. Similarly, the 108 scales include all 72 mēḷakarta ragas and all 48 scales from the maximal Γ1\Gamma_{1}-orbit, along with the same 20 “new” scales. Indeed, by inclusion-exclusion, the combined total of scales in the maximal Δ\Delta-orbit and in the maximal Γ1\Gamma_{1}-orbit is 88=72+483288=72+48-32. The 20 new scales are as follows:

{0}{1,2}{5,6}{two of 8,9,10,11},\{0\}\cup\{1,2\}\cup\{5,6\}\cup\{\mbox{two of }8,9,10,11\},
{0}{3,4}{5,6}{two of 8,9,10,11},\{0\}\cup\{3,4\}\cup\{5,6\}\cup\{\mbox{two of }8,9,10,11\},
{0}{two of 1,2,3,4}{5,6}{8,9},\{0\}\cup\{\mbox{two of }1,2,3,4\}\cup\{5,6\}\cup\{8,9\},
{0}{two of 1,2,3,4}{5,6}{10,11}.\{0\}\cup\{\mbox{two of }1,2,3,4\}\cup\{5,6\}\cup\{10,11\}.

At first sight this appears to be 24=6424=6\cdot 4 scales; however, 4 scales are repeated twice, namely, the 4 scales

{0}({1,2} or {3,4}){5,6}({8,9} or {10,11}).\{0\}\cup(\{1,2\}\mbox{ or }\{3,4\})\cup\{5,6\}\cup(\{8,9\}\mbox{ or }\{10,11\}).

For the group Δ1\Delta_{1} we have:

  1. (1)

    11 orbit of size 108108,

  2. (2)

    22 orbits of size 7272,

  3. (3)

    22 orbits of size 4848,

  4. (4)

    22 orbits of size 2424,

  5. (5)

    11 orbit of size 1616,

  6. (6)

    22 orbits of size 1212,

  7. (7)

    22 orbits of size 66,

  8. (8)

    22 orbits of size 44,

  9. (9)

    22 orbits of size 33,

for a total of 1616 orbits. Here we have

diam1(Δ1,𝒯7)=108462284581.7533{\operatorname{diam}}_{1}(\Delta_{1},{\mathscr{T}}_{7})=\frac{108\cdot 462}{28458}\approx 1.7533

and

diam1(Δ1,𝒯7)=108164623.7403.{\operatorname{diam}}_{-1}(\Delta_{1},{\mathscr{T}}_{7})=\frac{108\cdot 16}{462}\approx 3.7403.

Thus, we see that

diam1(Δ,𝒯7)diam1(Δ1,𝒯7)=72/16482108/284582.01821.75331.1511\frac{{\operatorname{diam}}_{1}(\Delta,{\mathscr{T}}_{7})}{{\operatorname{diam}}_{1}(\Delta_{1},{\mathscr{T}}_{7})}=\frac{72/16482}{108/28458}\approx\frac{2.0182}{1.7533}\approx 1.1511

and

diam1(Δ1,𝒯7)diam11Δ1,𝒯7)=7225108163.89613.74031.0417.\frac{{\operatorname{diam}}_{-1}(\Delta_{1},{\mathscr{T}}_{7})}{{\operatorname{diam}}_{-1}1\Delta_{1},{\mathscr{T}}_{7})}=\frac{72\cdot 25}{108\cdot 16}\approx\frac{3.8961}{3.7403}\approx 1.0417.

Thus, the passing from Δ\Delta to Δ1\Delta_{1} decreases “musicality” of the scales in the maximal orbit in a manner that is comparable to passing from Γ\Gamma to Γ1\Gamma_{1}.

The lattice diagram for the maximal twelve tone groups Γ,Γ,Γ1,Δ,Δ,Δ1\Gamma,\Gamma_{-},\Gamma_{1},\Delta,\Delta_{-},\Delta_{1} that we have discussed thus far, along with the groups Γ0=ΓΓ\Gamma_{0}=\Gamma\cap\Gamma_{-} and Δ0=ΔΔ\Delta_{0}=\Delta\cap\Delta_{-}, are as follows.

The values of diam1(G,𝒯7){\operatorname{diam}}_{1}(G,{\mathscr{T}}_{7}) for these eight groups are (approximately) as follows.

The signatures of these maximal twelve tone groups are as follows.

The heptatonic scales for these groups are the sets

having cardinalities

It is interesting that 𝒯7,Δ0=𝒯7,Δ1{{\mathscr{T}}_{7,\Delta_{0}}}={{\mathscr{T}}_{7,\Delta_{1}}} and 𝒯7,Γ0=𝒯7,Γ1{{\mathscr{T}}_{7,\Gamma_{0}}}={{\mathscr{T}}_{7,\Gamma_{1}}}, even though 𝒯7,Δ0{{\mathscr{T}}_{7,\Delta_{0}}} and 𝒯7,Γ0{{\mathscr{T}}_{7,\Gamma_{0}}} each consist of three equal-sized orbits (of size 36 and 16, respectively) while 𝒯7,Δ1{{\mathscr{T}}_{7,\Delta_{1}}} and 𝒯7,Γ1{{\mathscr{T}}_{7,\Gamma_{1}}} each consist of a unique orbit (of size 108 and 48, respectively).

While the analysis in this section provides some reasons for using the groups Γ\Gamma and Δ\Delta, Theorem 1.2 shows that Γ\Gamma is unique only up to conjugacy. One ought to ask whether or not Γ\Gamma is the “best” choice among all of its conjugates GG for the scales in the maximal GG-orbit.

We believe that the choice of Γ\Gamma and Δ\Delta, and thus the mēḷakarta raga system, can be justified. All 72 of the mēḷakarta ragas contain 0 and 77, which is a natural restriction to impose as the interval {0,7}\{0,7\} is a fifth (in fact, a perfect fifth in Indian classical tuning). The 72=1621672=1\cdot 6\cdot 2\cdot 1\cdot 6 mēḷakarta ragas are obtained as follows:

{0}{two of 1,2,3,4}{one of 5,6}{7}{two of 8,9,10,11}.\{0\}\cup\{\mbox{two of }1,2,3,4\}\cup\{\mbox{one of }5,6\}\cup\{7\}\cup\{\mbox{two of }8,9,10,11\}.

These 72 scales comprise the maximal Δ\Delta-orbit of 𝒯7{\mathscr{T}}_{7}, where Δ\Delta is the subgroup of S11S_{11} isomorphic to S4×S2×S1×S4S_{4}\times S_{2}\times S_{1}\times S_{4} that acts separately on {1,2,3,4}\{1,2,3,4\}, {5,6}\{5,6\}, {7}\{7\}, and {8,9,10,11}\{8,9,10,11\}. In other words, the 72 mēḷakarta ragas are precisely the scales with largest (Δ,t)(\Delta,t)-musciality for any t[,]t\in[-\infty,\infty]. The choice of Γ\Gamma and Δ\Delta among their conjugates are “natural” choices at the very least to the extent that the perfect fifth is “natural.”

For any subgroup GG of S11S_{11} conjugate to Γ\Gamma, one is required to choose an element of 12{\mathbb{Z}}_{12} besides 0 that is globally fixed by the action of GG. To the extent that the perfect fifth is “natural,” the most natural choice is the element 77, but one may rightfully choose 55 instead. Either of these is a natural choice since, while the interval {0,7}\{0,7\} is a fifth, the interval {0,5}\{0,5\} is a fourth, and both intervals coincide with perfect harmonic intervals (in some tunings). Moreover, 5 and 7 are the only elements of the cyclic group 12{\mathbb{Z}}_{12} other than 11 and 1=11-1=11 that generate the whole group, a fact that forms the basis of the circle of fifths and circle of fourths.

Once the choice of a second fixed element of 12{\mathbb{Z}}_{12} is made, one is required to partition the remaining ten elements of 12{\mathbb{Z}}_{12} into disjoint two-element subsets {ai,bi}\{a_{i},b_{i}\} for i=1,2,3,4,5i=1,2,3,4,5, where GG is then to act separately on {ai,bi}\{a_{i},b_{i}\} for all ii. A natural choice is for each aia_{i} and bib_{i} to be consecutive, so that bi=ai±1b_{i}=a_{i}\pm 1 for all ii. This is because any other choice would require the action to be “non-local,” with the occurence of “crossings,” as, for example, if GG were to act separately on {1,4}\{1,4\} and {2,3}\{2,3\}. It is clear that every subgroup of SnS_{n} is conjugate to a subgroup that acts without crossings. There are exactly six conjugates of Γ\Gamma that act without crossings, namely, those that fix 11, 33, 55, 77, 99, or 1111, respectively. (By definition, all of them fix 0.) Among these six conjugates of Γ\Gamma, the only one besides Γ\Gamma that it may also be natural to consider is the subgroup Γ\Gamma_{-} generated by {(1 2),(3 4),(6 7),(8 9),(10 11)}\{(1\ 2),(3\ 4),(6\ 7),(8\ 9),(10\ 11)\}, which fixes 55 instead of 77. From this group Γ\Gamma_{-} we obtain the following 32 scales in the unique maximal Γ\Gamma_{-}-orbit:

{C,DD,EE,F,GG,AA,BB}.\left\{\mbox{C},{\mbox{D}\flat\atop\mbox{D}},{\mbox{E}\flat\atop\mbox{E}},\mbox{F},{\mbox{G}\flat\atop\mbox{G}},{\mbox{A}\flat\atop\mbox{A}},{\mbox{B}\flat\atop\mbox{B}}\right\}.

Half of these 32 scales—those that contain G—were obtained previously using Γ\Gamma, so among these 32 scales we obtain the 16 additional scales listed in Table 3, namely, those that contain G\flat rather than G. Thus, the union of the maximal Γ\Gamma-orbit and the maximal Γ\Gamma_{-}-orbit consists of 48=32+16=32+321648=32+16=32+32-16 scales.

It is natural also to consider the compositum Γ1=ΓΓ\Gamma_{1}=\Gamma\Gamma_{-} of Γ\Gamma and Γ\Gamma_{-}, which is isomorphic to S2×S2×S3×S2×S2S_{2}\times S_{2}\times S_{3}\times S_{2}\times S_{2} and acts without crossings, separately on {1,2}\{1,2\}, {3,4}\{3,4\}, {5,6,7}\{5,6,7\}, {8,9}\{8,9\}, and {10,11}\{10,11\}. The unique maximal Γ1\Gamma_{1}-orbit consists of the 48 scales listed in Tables 1 and 2 comprising the union of the maximal Γ\Gamma-orbit and the maximal Γ\Gamma_{-}-orbit. The group Γ1\Gamma_{1} has the virtue that its action on {1,2,3,,11}\{1,2,3,\ldots,11\} is completely symmetrical, providing a natural theory of heptatonic scales that privileges both the fourth and the fifth equally. Moreover, the group Γ1\Gamma_{1} ranks 3rd–8th among the 3094 conjugacy classes of subgroups GG of S11S_{11} for its value of diam(G,𝒯7){\operatorname{diam}}(G,{\mathscr{T}}_{7}) for t=1t=1 and 2nd–7th for its values for t=0t=0 and t=1t=-1. One has

diam1(Γ1,𝒯7)3.0689,{\operatorname{diam}}_{1}(\Gamma_{1},{\mathscr{T}}_{7})\approx 3.0689,
diam0(Γ1,𝒯7)4.3060,{\operatorname{diam}}_{0}(\Gamma_{1},{\mathscr{T}}_{7})\approx 4.3060,
diam1(Γ1,𝒯7)6.3377,{\operatorname{diam}}_{-1}(\Gamma_{1},{\mathscr{T}}_{7})\approx 6.3377,

which closely rival the values for Γ\Gamma and Γ\Gamma_{-}. By contrast, the group Δ\Delta fares relatively poorly, ranking 529th–536th among the 3094 conjugacy classes for its value for t=1t=1, ranking 483rd–490th for its value for t=0t=0, and ranking 294th–301st for its value at t=1t=-1. These values are as follows:

diam1(Δ,𝒯7)2.0182,{\operatorname{diam}}_{1}(\Delta,{\mathscr{T}}_{7})\approx 2.0182,
diam0(Δ,𝒯7)2.5229,{\operatorname{diam}}_{0}(\Delta,{\mathscr{T}}_{7})\approx 2.5229,
diam1(Δ,𝒯7)3.8961.{\operatorname{diam}}_{-1}(\Delta,{\mathscr{T}}_{7})\approx 3.8961.

By this measure, then, the 48 scales in the maximal Γ1\Gamma_{1}-orbit are a worthy alternative to the 72 (mēḷakarta) scales in the maximal Δ\Delta-orbit.

Table 3. 16 additional scales in the maximal Γ1\Gamma_{1}-orbit of 𝒯7{\mathscr{T}}_{7}
C D E F G\flat A B
C D E F G\flat A B\flat
C D E F G\flat A\flat B
major Locrian C D E F G\flat A\flat B\flat
C D E\flat F G\flat A B
C D E\flat F G\flat A B\flat
C D E\flat F G\flat A\flat B
half diminished C D E\flat F G\flat A\flat B\flat
C D\flat E F G\flat A B
C D\flat E F G\flat A B\flat
Persian C D\flat E F G\flat A\flat B
C D\flat E F G\flat A\flat B\flat
C D\flat E\flat F G\flat A B
Locrian 6 C D\flat E\flat F G\flat A B\flat
C D\flat E\flat F G\flat A\flat B
Locrian mode C D\flat E\flat F G\flat A\flat B\flat

It must be noted that the maximal twelve tone groups that are conjugate to Γ\Gamma, besides Γ\Gamma_{-}, are those of signature (1,2,2,2,2,2)(1,2,2,2,2,2), (2,1,2,2,2,2)(2,1,2,2,2,2), (2,2,2,2,1,2)(2,2,2,2,1,2), and (2,2,2,2,2,1)(2,2,2,2,2,1), respectively. These four groups are those that fix 11, 33, 99, and 1111, respectively, instead of 77 or 55. These six groups appear in three “inverse” pairs: Γ\Gamma and Γ\Gamma_{-} are inversions, as are those of signature (1,2,2,2,2,2)(1,2,2,2,2,2) and (2,2,2,2,2,1)(2,2,2,2,2,1), as are those of signature (2,1,2,2,2,2)(2,1,2,2,2,2) and (2,2,2,2,1,2)(2,2,2,2,1,2). At this stage, one ought to seek further mathematical justification for the choice of fixing 77 or 55 over and above 11, 33, 99, or 1111, based on more than just the naturality of fixing the perfect fifth (0 and) 77. Regarding this problem, one of the two reviewers wrote the following.

The proposed formal framework provides no actual explanation for the privileged role of 77 and 55 for the choice of the second fixed element. In my view, this is a methodological weakness. Although the authors try to explain it as a “most natural choice” but all the reasoning is based on aspects that are external to the actual formal framework: tuning and generators of 12{\mathbb{Z}}_{12}. If tuning was crucial, why does 44 play no role in the model? If generators were important, why do 11 or 111 lead to unmusical systems? To solve the issue, I believe, the model should be enhanced by an additional formal constraint resulting in disqualification of other choices for the second fixed element. Below, I theorize about one possible approach (evenness)\ldots.

As I challenged above, the choice of the other fixed element as 7 or 5 is based on ad hoc arguments. It would be much more elegant if an additional formal concept was introduced from which the two choices of 7 and 5 would formally follow. I think that some generalization of Clough’s concept of evenness could be a viable option\ldots. I think that some measure of “average evenness” for systems of scales could be introduced (and computed) and it would disqualify the other choices of the fixed element in the heptatonic scales. Additionally, evenness applies obviously even on the level of particular scales. This would provide a natural ordering of scales with diatonic scales (and so the anhemitonic pentatonic scales) being maximally even.

No doubt this is a promising way to resolve the issue. As mentioned in the introduction, it is possible that the methods of this paper can be combined synergistically with other ways of mathematically justifying the various musical scales. We leave this to the interested reader to pursue further.

The reviewers also commented that one ought to try to generalize Theorem 1.2 to atonic scales, that is, to the action of S12S_{12} on the set SS7\SS_{7} of all 7-note atonic scales in SS\SS. Based on that suggestion, we used GAP and SAGE to verify the following theorem, in a manner similar to the proof of Theorem 1.2. Note that there are 10723 conjugacy classes of subgroups of S12S_{12}.

Theorem 5.2 (with James Allen).

For all t[1,1]t\in[-1,1], the subgroups GG of S12S_{12} for which diamt(G,SS7){\operatorname{diam}}_{t}(G,\SS_{7}) is largest are the group generated by {(0 1),(2 3),(4 5),(7 8),(9 10)(6 11),(9 11)(6 10)}\{(0\ 1),(2\ 3),(4\ 5),(7\ 8),(9\ 10)(6\ 11),(9\ 11)(6\ 10)\}, along with its conjugate subgroups. Moreover, for all t[0,1]t\in[0,1], the subgroups GG of S12S_{12} for which diamt(G,SS7){\operatorname{diam}}_{t}(G,\SS_{7}) is second largest are the group Γ\Gamma generated by {(1 2),(3 4),(5 6),(8 9),(10 11)}\{(1\ 2),(3\ 4),(5\ 6),(8\ 9),(10\ 11)\}, along with its conjugate subgroups.

The theorem provides further justification that the group Γ\Gamma and its conjugates are “natural” choices for a theory of musical scales. On the interval [1,0][-1,0], as tt approaches 1-1, one other conjugacy class begins to surpass the group Γ\Gamma in tt-power diameter, namely, the conjugates of the group generated by {(0 1),(2 3),(4 5),(7 8)(9 10),(9 10)(6 11),(9 11)(6 10)}\{(0\ 1),(2\ 3),(4\ 5),(7\ 8)(9\ 10),(9\ 10)(6\ 11),(9\ 11)(6\ 10)\}. The values of diamt(G,𝒯7){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{7}) for t=1,0,1t=1,0,-1 for these three conjugacy classes of subgroups GG of S12S_{12} are provided in Table 4. For reasons explained earlier, the values on the interval [0,1][0,1] are more critical than those on the interval [1,0][-1,0].

Table 4. The three conjugacy classes of subgroups of S12S_{12} with largest tt-power diameter
Signature Maximal orbits diam1(G,𝒯7){\operatorname{diam}}_{1}(G,{\mathscr{T}}_{7}) diam0(G,𝒯7){\operatorname{diam}}_{0}(G,{\mathscr{T}}_{7}) diam1(G,𝒯7){\operatorname{diam}}_{-1}(G,{\mathscr{T}}_{7})
(4,2,2,2,2)(4,2,2,2,2) 1 of size 64 3.9501 5.5199 7.8384
(2,2,2,2,2,1,1)(2,2,2,2,2,1,1) 1 of size 32 3.7917 5.0929 6.9091
(4,2,2,2,2)(4,2,2,2,2) 1 of size 64 3.7183 5.0397 7.0303

6. Hexatonic scales

It is most common to obtain a hexatonic scale in one of the following three ways: (1) deleting a note from a given heptatonic scale (e.g., the major and minor hexatonic scales are obtained from the major and natural minor heptatonic scales by deleting the seventh note and the sixth note, respectively); (2) adding a note to a given pentatonic scale (e.g, the major and minor blues hexatonic scales are obtained from the major and minor pentatonic scales by adding an extra half step after the third note in each); and (3) combining three non-overlapping triads.

A fourth way of obtaining a hexatonic scale from a heptatonic scale is as follows. Define the (tonic) complement s¯\overline{s} of a kk-tonic scale ss to be the (13k)(13-k)-tonic scale

s¯=(12s){0}.\overline{s}=({\mathbb{Z}}_{12}-s)\cup\{0\}.

Of course one has s¯¯=s\overline{\overline{s}}=s for all scales s𝒯s\in{\mathscr{T}}. For any action \cdot of S11S_{11} on 𝒯{\mathscr{T}}, there is a (tonic) complementary action ¯\overline{\cdot} of S11S_{11} defined by

σ¯s=σs¯¯\sigma\overline{\cdot}s=\overline{\sigma\cdot\overline{s}}

for all σS11\sigma\in S_{11} and all s𝒯s\in{\mathscr{T}}. Moreover, the induced action \cdot on 𝒯k{\mathscr{T}}_{k} corresponds to the induced action ¯\overline{\cdot} on 𝒯13k{\mathscr{T}}_{13-k}. As a consequence, our results on actions of the subgroups of S11S_{11} on 𝒯7{\mathscr{T}}_{7} yield corresponding results on the actions of the subgroups of S11S_{11} on 𝒯6{\mathscr{T}}_{6}. Thus, by Theorem 1.2 and complementarity, we have the following.

Theorem 6.1.

For all t[1,1]t\in[-1,1], the subgroups GG of S11S_{11} for which diamt(G,𝒯6){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{6}) is largest are the group Γ\Gamma generated by {(1 2),(3 4),(5 6),(8 9),(10 11)}\{(1\ 2),(3\ 4),(5\ 6),(8\ 9),(10\ 11)\}, along with its conjugate subgroups.

Also by complementarity, each of the theories of heptatonic scales developed in Section 5 has a complementary theory of hexatonic scales: the sets of complements of the scales in the maximal orbits of a given action on 𝒯7{\mathscr{T}}_{7} are precisely the maximal orbits of the complementary action on 𝒯6{\mathscr{T}}_{6}. However, none of the 32 scales in the maximal orbit of 𝒯6{\mathscr{T}}_{6} under the action of our twelve tone group Γ\Gamma of signature (2,2,2,1,2,2)(2,2,2,1,2,2) contains the interval {0,7}\{0,7\} of a fifth. Consequently, this particular group is perhaps not the most natural for yielding interesting hexatonic scales. For this purpose we single out the subgroup Λ\Lambda of S11S_{11} generated by the set {(2 3),(4 5),(6 7),(8 9),(10 11)}\{(2\ 3),(4\ 5),(6\ 7),(8\ 9),(10\ 11)\}, which is the maximal twelve tone group of signature (1,2,2,2,2,2)(1,2,2,2,2,2), and the subgroup Λ\Lambda^{\prime} of S11S_{11} generated by the set {(1 2),(3 4),(5 6),(7 8),(9 10)}\{(1\ 2),(3\ 4),(5\ 6),(7\ 8),(9\ 10)\}, which is the maximal twelve tone group of signature (2,2,2,2,2,1)(2,2,2,2,2,1). Among the 32 scales

{C,DE,EF,GG,AA,BB}\left\{\mbox{C},{\mbox{D}\atop\mbox{E}\flat},{\mbox{E}\atop\mbox{F}},{\mbox{G}\atop\mbox{G}\flat},{\mbox{A}\atop\mbox{A}\flat},{\mbox{B}\atop\mbox{B}\flat}\right\}

in the maximal Λ\Lambda-orbit of 𝒯6{\mathscr{T}}_{6} appear the whole tone scale {C, D, E, G, A, B}\{\mbox{C, D, E, G$\flat$, A$\flat$, B$\flat$}\} (the complement of Neopolitan major), the Prometheus scale {C, D, E, G, A, B}\{\mbox{C, D, E, G$\flat$, A, B$\flat$}\} (the complement of Neopolitan minor), and the augmented scale {C, E, E, G, A, B}\{\mbox{C, E$\flat$, E, G, A$\flat$, B}\}. Among the 32 scales

{C,DD,EE,FG,GA,AB}\left\{\mbox{C},{\mbox{D}\atop\mbox{D}\flat},{\mbox{E}\atop\mbox{E}\flat},{\mbox{F}\atop\mbox{G}\flat},{\mbox{G}\atop\mbox{A}\flat},{\mbox{A}\atop\mbox{B}\flat}\right\}

in the maximal Λ\Lambda^{\prime}-orbit of 𝒯6{\mathscr{T}}_{6} appear the whole tone scale, the major hexatonic scale {C, D, E, F, G, A}\{\mbox{C, D, E, F, G, A}\}, the minor hexatonic scale {C, D, E, F, G, B}\{\mbox{C, D, E$\flat$, F, G, B$\flat$}\}, and the tritone scale {C, D, E, G, G, B}\{\mbox{C, D$\flat$, E, G$\flat$, G, B$\flat$}\}. The whole tone scale is the only scale that lies in both sets of 32 scales. Unfortunately, the major and minor blues hexatonic scales do not appear in either set but rather have Λ\Lambda-orbits and Λ\Lambda^{\prime}-orbits of size 1616.

7. Pentatonic scales

The most prominent of the pentatonic scales are the five black-key pentatonic scales formed by the black keys of a piano: the major and minor and blues major and minor pentatonic scales and the Egyptian, or suspended, pentatonic scale. Let Σ\Sigma denote the subgroup of S11S_{11} generated by {(2 3),(4 5),(7 8),(9 10)}\{(2\ 3),(4\ 5),(7\ 8),(9\ 10)\}. In other words, Σ\Sigma is the maximal twelve tone group of signature (1,2,2,1,2,2,1)(1,2,2,1,2,2,1). The 16 scales in the unique maximal Σ\Sigma-orbit of 𝒯5{\mathscr{T}}_{5} are the 16 scales

{C,DD,EF,GG,AA}\left\{\mbox{C},{\mbox{D}\atop\mbox{D}\sharp},{\mbox{E}\atop\mbox{F}},{\mbox{G}\atop\mbox{G}\sharp},{\mbox{A}\atop\mbox{A}\sharp}\right\}

listed in Table 5. Among these 16 scales are the five black-key pentatonic scales.

Table 5. 16 pentatonic scales in the maximal Σ\Sigma-orbit of 𝒯5{\mathscr{T}}_{5}
major C D E G A
C D E G A\sharp
C D E G\sharp A
C D E G\sharp A\sharp
Blues major, or Ritsusen, or yo C D F G A
Egyptian, or suspended C D F G A\sharp
C D F G\sharp A
C D F G\sharp A\sharp
C D\sharp E G A
C D\sharp E G A\sharp
C D\sharp E G\sharp A
C D\sharp E G\sharp A\sharp
C D\sharp F G A
minor C D\sharp F G A\sharp
C D\sharp F G\sharp A
Blues minor, or Man Gong C D\sharp F G\sharp A\sharp

The following theorem was proved using GAP and SAGE in a manner similar to the proof of Theorems 1.2 and 5.2.

Theorem 7.1.

For all t[1,1]t\in[-1,1], the subgroups GG of S11S_{11} for which diamt(G,𝒯5){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{5}) is largest are the group Σ1\Sigma_{1} generated by {(2 3),(4 5),(7 8),(9 10)(6 11),(9 11)(6 10)}\{(2\ 3),(4\ 5),(7\ 8),(9\ 10)(6\ 11),(9\ 11)(6\ 10)\}, along with its conjugate subgroups. Moreover, for all t[0,1]t\in[0,1], the subgroups GG of S11S_{11} for which diamt(G,𝒯5){\operatorname{diam}}_{t}(G,{\mathscr{T}}_{5}) is second largest are the group Σ\Sigma generated by {(2 3),(4 5),(7 8),(9 10)}\{(2\ 3),(4\ 5),(7\ 8),(9\ 10)\}, along with its conjugate subgroups.

On the interval [1,0][-1,0], as tt approaches 1-1, several other subgroups begin to surpass the group Σ\Sigma in tt-power diameter.

The group Σ1\Sigma_{1} of the theorem is isomorphic to 25{\mathbb{Z}}_{2}^{5}, and the group Σ\Sigma is a subgroup of Σ1\Sigma_{1} isomorphic to 24{\mathbb{Z}}_{2}^{4}. For the action of Σ1\Sigma_{1} on 𝒯5{\mathscr{T}}_{5} there are:

  1. (1)

    11 orbit of size 3232,

  2. (2)

    33 orbits of size 1616,

  3. (3)

    1919 orbits of size 88,

  4. (4)

    1616 orbits of size 44,

  5. (5)

    1515 orbits of size 22,

  6. (6)

    44 orbits of size 11,

for a total of 5858 orbits. For the action of Σ\Sigma on 𝒯5{\mathscr{T}}_{5} there are:

  1. (1)

    11 orbit of size 1616,

  2. (2)

    1212 orbits of size 88,

  3. (3)

    2020 orbits of size 44,

  4. (4)

    4040 orbits of size 22,

  5. (5)

    1818 orbits of size 11,

for a total of 101101 orbits. From this we deduce that

diam1(Σ1,𝒯5)3.1731{\operatorname{diam}}_{1}(\Sigma_{1},{\mathscr{T}}_{5})\approx 3.1731
diam0(Σ1,𝒯5)4.2068{\operatorname{diam}}_{0}(\Sigma_{1},{\mathscr{T}}_{5})\approx 4.2068
diam1(Σ1,𝒯5)5.6242{\operatorname{diam}}_{-1}(\Sigma_{1},{\mathscr{T}}_{5})\approx 5.6242

and

diam1(Σ,𝒯5)3.1391{\operatorname{diam}}_{1}(\Sigma,{\mathscr{T}}_{5})\approx 3.1391
diam0(Σ,𝒯5)3.9004{\operatorname{diam}}_{0}(\Sigma,{\mathscr{T}}_{5})\approx 3.9004
diam1(Σ,𝒯5)4.8970.{\operatorname{diam}}_{-1}(\Sigma,{\mathscr{T}}_{5})\approx 4.8970.

One of the reviewers suggested the following theory of atonic pentatonic scales alternative to our theory of tonic pentatonic scales.

I really like the idea of complementarity. It is a very elegant way of dealing with related systems. However, from the musical perspective it seems quite counterintuitive that the hexatonic scales turn out to be the complements to the heptatonic scales. How a Neapolitan minor scale is complementary to the whole tone scale? Of course, it follows from the feature that one tone is fixed in all scales and only the others are movable.

However, if this feature is reconsidered, one might achieve an elegant explanation of the pentatonic scales while keeping the heptatonic scales in check. A scale would be any subset of 12{\mathbb{Z}}_{12}, not necessarily containing 0. Instead of S11S_{11}, one would consider actions of subgroups of S12S_{12}. The definition of local actions would require a cosmetic change: it would need to consider the cyclic nature of 12{\mathbb{Z}}_{12}.

As regards the heptatonic scales, I conjecture that one would obtain the maximal tt-orbit diameters with the signature (2,2,2,2,2,1,1)(2,2,2,2,2,1,1). To fix 0 one could consider the permutations of the signature starting with 11. Then the evenness should lead to two resulting signatures (1,2,2,1,2,2,2)(1,2,2,1,2,2,2) and (1,2,2,2,1,2,2)(1,2,2,2,1,2,2). One could even call them the authentic and the plagal systems.

And for the pentatonic scales one should get a perfect remedy. The pentatonic scales would be the complementary scales to the heptatonic scales. Therefore, the maximal twelve tone groups are conjugates of Γ\Gamma. It would be elegant to consider the conjugate group generated by {(11 0),(2 3),(4 5),(7 8),(9 10)}\{(11\ 0),(2\ 3),(4\ 5),(7\ 8),(9\ 10)\}. This makes 0 movable, but leads to relevant musical scales. It is nice that this way the system includes not only the yo but also the in pentatonic scales. (The Japanese music theorist Uehara proposed two basic pentatonic modes: yo (anhemitonic) and in (hemitonic).)

Under the scheme of pentatonic atonic scales proposed above, the largest orbit has 3232 elements, exactly half of which are tonic scales.

References

  • [1] Carey, Norman, and David Clampitt. “Aspects of Well-Formed Scales.” Music Theory Spectrum 11, no. 2 (1989): 187–206.
  • [2] Clough, John, and Jack Douthett. “Maximally Even Sets.” Journal of Music Theory 35, no. 1/2 (1991): 93–173.
  • [3] Clough, John, Nora Engebretsen, and Jonathan Kochavi. “Scales, Sets, and Interval Cycles: A Taxonomy.” Music Theory Spectrum 21, no. 1 (1999): 74–104.
  • [4] Clough, John, Jack Douthett, N. Ramanathan, and Lewis Rowell. “Early Indian Heptatonic Scales and Recent Diatonic Theory.” Music Theory Spectrum 15, no. 1 (1993): 36–58.
  • [5] Douthett, Jack. “Filtered Point-Symmetry and Dynamical Voice-Leading.” In Music Theory and Mathematics: Chords, Collections, and Transformations, edited by Jack Douthett, Martha M. Hyde, and Charles J. Smith, 72–106. University of Rochester Press, 2008.
  • [6] Hook, Julian. “Signature Transformations.” In Music Theory and Mathematics: Chords, Collections, and Transformations, edited by Jack Douthett, Martha M. Hyde, and Charles J. Smith, 137–160. University of Rochester Press, 2008.
  • [7] Hook, Julian. “Spelled Heptachords.” In Mathematics and Computation in Music—MCM 2011, edited by Carlos Agon, Emmanuel Amiot, Moreno Andreatta, Gérard Assayag, Jean Bresson, and John Mandereau, 84–97. Springer, 2011.
  • [8] Tymoczko, Dmitri. “Scale Networks and Debussy.” Journal Of Music Theory 48, no. 2 (2004): 219–294. Tymoczko, Dmitri. A Geometry of Music: Harmony and Counterpoint in the Extended Common Practice. Oxford University Press, 2011.
  • [9] Žabka, Marek. “Dancing with the Scales: Sub-Chromatic Generated Tone Systems.” Journal of Music Theory 58, no. 2 (2014): 179–233.