Group actions, power mean orbit size, and musical scales
Abstract.
We provide an application of the theory of group actions to the study of musical scales. For any group , finite -set , and real number , we define the -power diameter to be the size of any maximal orbit of divided by the -power mean orbit size of the elements of . The symmetric group acts on the set of all tonic scales, where a tonic scale is a subset of containing . We show that, for all , among all the subgroups of , the -power diameter of the -set of all heptatonic scales is largest for the subgroup , and its conjugate subgroups, generated by . The unique maximal -orbit consists of the 32 thāts of Hindustani classical music popularized by Bhatkhande. This analysis provides a reason why these 32 scales, among all 462 heptatonic scales, are of mathematical interest. We also apply our analysis, to a lesser degree, to hexatonic and pentatonic scales.
Keywords: scales, group action, power mean, heptatonic scales, hexatonic scales, pentatonic scales.
MSC: 05E18, 26E60.
1. Introduction and summary
This paper provides an application of group actions to the study of musical scales and uses it to motivate a new invariant, which we call the -power diameter , defined, for any group , finite -set , and extended real number , to be the size of any maximal orbit of divided by the -power mean orbit size of the elements of .
We may represent the chromatic scale as the set
where represents the tonic, or key, of the chromatic scale, which can be any fixed pitch class. Thus, for example, if one decides to let represent the pitch class C, then represents C, represents D, and so on. A scale (in ) is a subset of , while a tonic scale (in ) is a scale in containing . A tonic scale is -tonic if it consists of notes, where ; thus, any tonic scale is -tonic, where is the cardinality of . The -tonic scales, respectively, for are called monotonic, ditonic, tritonic, tetratonic, pentatonic, hexatonic, heptatonic, octatonic, nonatonic, decatonic, hendecatonic, and chromatic. There are a total of possible tonic scales, with a total of -tonic scales for each . These numbers comprise the eleventh row of Pascal’s triangle:
Thus, for example, there are 462 heptatonic scales and 330 pentatonic scales. We call a scale that may not contain the tonic an atonic scale (in ). There are a total of possible atonic scales (including the unique empty scale), with a total of -atonic scales for each . These numbers comprise the twelfth row of Pascal’s triangle:
Throughout this paper, denotes the set of all tonic scales in and the set of all -tonic scales in . The symmetric group acts naturally on the sets and : a permutation in acts on a tonic scale by , where one sets . Explicitly, an element of maps a scale to the scale . This defines an action of on , and since for all , the action induces an action on for all . More generally, the group acts on the set of all (atonic) scales in . Moreover, if we consider as a subgroup of , then the action of on descends to the action of on . Although in this paper we focus mainly on the action of on , many of our results ascend appropriately to the action of on .
The group has elements (and the group has elements). A “musical” scale acted on by a randomly chosen element of is very unlikely to be very musical. For example, under the permutation , the heptatonic major scale maps to the somewhat “unmusical” scale . However, by contrast, some permutations preserve musicality fairly well, e.g., the permutation , which swaps the major scale and the harmonic minor scale . One of the main questions we investigate in this paper is the following: are there medium-sized subgroups of whose actions on preserve “musicality”? An ideal subgroup of would be one that “respects musicality” in the sense that scales of approximately the same “musicality” appear in the same orbit. As we will see, some subgroups of induce actions on the heptatonic scales that preserve musicality better than others do. One of our main claims is that the group of generated by is the “best” such subgroup, and the 32 scales in its unique maximal orbit, which coincide with the 32 thāts of Hindustani (North Indian) classical music, represent under a particular measure the “most musical” scales among the 462 possible heptatonic scales.
To measure the musical efficacy of a subgroup of , we define the -musicality of a scale to be the size of the -orbit of in divided by the average size of a -orbit of . A consequence of this definition is that the -musicality is a small as possible, namely , for all scales if and only if every orbit has the same number of elements, which holds if or if is the trivial group. In general, the -musicality of a given scale will attain a maximum for some subgroups of in between those two extremes.
The intution behind the concept of -musicality is that, if a scale has a small -orbit relative to the average -orbit size, then it has too much symmetry relative to and thus the notes comprising the scale are more “-equivalent” to each other and therefore have fewer notes that have their own individual character, whereas scales with larger orbits relative to the average orbit size are comprised of notes that can be better differentiated, or distiguished from one another, by the group . The thesis of this paper is that the subgroups of that yield the largest possible -musicality (relative to the other subgroups of ) of any -tonic scale naturally lead to mathematically and musically interesting theories of -tonic scales, namely, those -tonic scales with the largest -musicality for any subgroup of .
There is a slight subtlety here, however, since, given a group and a finite -set , the average size of a -orbit of can be measured in at least two distinct ways. One might naively define the average size of a -orbit of to be
where is the set of all -orbits of and where is the number of orbits. This represents the average number of elements in each orbit, in a naive sense. However, one may also define the average orbit size of the elements of to be
This represents the expected value of for , where each element of is equally likely to be chosen. By contrast, the number previously considered represents the expected value of , where each orbit is equally likely to be chosen. Since our focus is on the elements of rather than on the orbits, is a better notion of average orbit size than is the naive definition .
Nevertheless, both of these measures of “average orbit size” have mathematical merit, and this is supported by the observation that, using power means, one may continuously deform one of these two means to the other, as follows. For any , we define the -power mean orbit size of the elements of to be
This represents the -power mean of over all . Clearly, for is the average orbit size of the elements of . Moreover, for , we have
and therefore is the average number of elements of in each orbit. Taking appropriate limits at , one can define for all , and then
is the maximal orbit size of , and
is the minimal orbit size. Clearly, then, every -orbit of has the same size if and only if the function is constant with respect to . One can use calculus to show that, if the function is not constant, then it is bounded and has positive derivative everywhere. Moreover, from and the function for , one can recover all of the orbit sizes. Our general philosophy is that the critical region of interest of the function is the interval , with the value at being the most important.
For any finite -set , we define the -power diameter of to be
In other words, is the ratio of the maximal -orbit size of to the -power mean orbit size of the elements of . The function , if not identically , has negative derivative with respect to and has limiting values of and at and , respectively. The -power diameter represents in an intuitive sense the amount of spread in the “kinetic energy” or “entropy” of the elements of under the action of , where elements with larger orbits, or equivalently with smaller stabilizers, are considered to have more kinetic energy. The mathematical problem we pose here is the following.
Problem 1.1.
Given a group , a finite -set , and , determine the subgroups of for which is largest.
Such subgroups of maximize the spread of the “kinetic energy” of the elements of under the induced goup action.
The following is our main result regarding heptatonic scales in .
Theorem 1.2 (with James Allen, Paul Estrada, and Michael McCann).
For all , the subgroups of for which is largest are the group generated by , along with its conjugate subgroups.
The theorem can be proved numerically using GAP and SAGE, as follows. First, note that, for any group and any finite -set , the number for any subgroup of depends only on the conjugacy class of . The group has 3094 subgroups up to conjugacy. (Those that are cyclic have order , , , , , , , , , , , , , , , , , , , or .) Using GAP and SAGE, one can compute generators for representatives of all 3094 conjugacy classes, and for each of these representatives one can compute the -orbits of . Then the functions can be plotted and verified to achieve a maximum for for all in the interval .
The fact that attains a maximum on the entire interval for a single conjugacy class of subgroups of is in itself a surprising result. One has
Thus, for example, a consequence of Theorem 1.2 is that, for any subgroup of , the maximal -orbit size of is at most times the average -orbit size of the elements of , and the maximum ratio possible is obtained precisely by the group and its conjugates.
Given a subgroup of and a scale , we define the -musicality of to be the quantity
which is the size of the -orbit of relative to the -power mean orbit size of the elements of . Thus, the quantity
represents the largest possible -musicality of any -tonic scale. Theorem 1.2 says that the heptatonic scales with the largest possible -musicality for any subgroup of and any occur precisely for the group and its conjugates. These scales comprise the unique maximal -orbit of and consist of the 32 heptatonic scales
or equivalently, starting, say, at C, the 32 scales
listed in Table 1. This orbit contains the major scale, the harmonic and melodic minor scales, and many other heptatonic scales that figure prominently in Western and Indian classical music. In fact, all 32 of these scales are among the 72 mēḷakarta ragas of Carnatic (South Indian) classical music standardized by Govindacharya in the 18th century and coincide with the 32 thāts of Hindustani classical music popularlized by the system created by Vishnu Narayan Bhatkhande (1860–1936), one of the most influential musicologists in the field of Hindustani classical music in the twentieth century.
major, Ionian mode, or Bilāwal thāt | C | D | E | F | G | A | B |
Mixolydian or Adonai malakh mode, or Khamaj thāt | C | D | E | F | G | A | B |
harmonic major | C | D | E | F | G | A | B |
Mixolydian b6 | C | D | E | F | G | A | B |
Lydian mode, or Kalyan thāt | C | D | E | F | G | A | B |
acoustic, or Lydian dominant | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
minor Lydian | C | D | E | F | G | A | B |
ascending melodic minor | C | D | E | F | G | A | B |
Dorian mode, or Kāfi thāt | C | D | E | F | G | A | B |
harmonic minor | C | D | E | F | G | A | B |
natural minor, Aeolian mode, or Āsāvari thāt | C | D | E | F | G | A | B |
diminished Lydian | C | D | E | F | G | A | B |
Ukrainian Dorian | C | D | E | F | G | A | B |
Hungarian minor | C | D | E | F | G | A | B |
gypsy | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
C | D | E | F | G | A | B | |
double harmonic, or flamenco mode | C | D | E | F | G | A | B |
Phrygian dominant | C | D | E | F | G | A | B |
Mārvā thāt | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
Pūrvi thāt | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
Neapolitan major | C | D | E | F | G | A | B |
Phrygian raised sixth | C | D | E | F | G | A | B |
Neapolitan minor | C | D | E | F | G | A | B |
Phrygian mode, or Bhairav thāt | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
C | D | E | F | G | A | B | |
Todi thāt | C | D | E | F | G | A | B |
Bhairavi thāt, or Pelog (approximate) | C | D | E | F | G | A | B |
The remainder of this paper is organized as follows. In Section 2 we discuss power means and expected values, and in Section 3 we apply Section 2 to the study of the power mean orbit size and diameter of a finite -set. In Section 4 we apply Section 3 to the study of scales in . In Section 5 we focus on heptatonic scales in particular, while in Section 6 we briefly study hextonic scales, and in Section 7 we study pentatonic scales.
I would like to thank the two reviewers for their thoughtful and invaluable input on the first draft of this paper. As one of the reviewers pointed out, it is likely that the methods of this paper can be combined synergistically with other ways of understanding musical scales, such as those developed in [1], [2], [3], [4], [5], [6], [7], [8], and [9]. It is not my intention that the results in this paper be definitive. My aim is merely to provide yet another perspective on the already well-developed mathematical theories of musical scales, one that, in my view, has also inspired some new and interesting problems regarding the theory of group actions. Based on the reviewers comments, I also discuss some ways in which the theory might be amended or developed further.
The research for this paper was conducted with undergraduate students James Allen and Paul Estrada and MS student Michael McCann at California State University, Channel Islands, in the academic year 2016–17, under the supervision of the author. The idea for the project began with conversations between the author and undergraduate student Vickie Chen during a semester-long project on group theory in music for a first course in abstract algebra. It is in those conversations that Vickie and I first came up with the idea of examining the maximal -orbit of , an idea that, to our pleasant surprise, eventually led to Theorem 1.2.
2. Power means and expected values
Arithmetic means are generalized by what are known as power means. If is a finite set of cardinality and a positive real-valued random variable on (with the uniform distribution on ), then, for any nonzero , the -power mean of is defined to be the positive real number
where for all . Equivalently, the arithmetic mean of is just , and one sets
For , one defines
It is well known that
is the geometric mean of the , while
and
are the maximum and minimum, respectively, of the .
Let denote the set of all extended real numbers. For all one has
for all and
It follows that, if , that is, if , then
for all .
We may generalize the definition of by assuming that is a finite probability space with probability distribution , which we assume is nonzero at all elements of , and is a positive real-valued random variable on . (Previously we implicitly assumed that for all .) We may define the -power expected value of to be
Equivalently, the expected value of is just , and one sets
For , one defines
One has
while
and
If is constant, then clearly is a constant function of and one has for all and all . Conversely, if is a constant function of , then , whence must be constant.
One can show that the function of is differentiable with nonnegative derivative, and is therefore nondecreasing, with respect to . Thus, one has
for all . Moreover, if is nonconstant, then has positive derivative, and therefore is strictly increasing, with respect to . In other words, if nonconstant, the function of is a sigmoid function, that is, it is a bounded differentiable function from to whose derivative is everywhere positive. Thus it has horizontal asymptotes, specifically at and at and , respectively. Thus, its graph is an “S-shaped” curve. For an explicit example using the uniform probability distribution, see Figure 1.

3. Power mean orbit size of a finite -set
Throughout this section, denotes a group and a finite -set. One may readily generalize all of what follows to the situation where is also assumed to be a probability space; in that case, one simply replaces all -power means with -power expected values.
For any , we define the -power mean orbit size of the elements of to be
which, for is given by
As observed in the introduction, is the average orbit size of the elements of , and is the average number of elements of in each orbit, while is the maximal orbit size of and is the minimal orbit size.
For any , we define the -power relative size of to be
The -power relative size of is the size of the orbit of relative to (or normalized with respect to) the -power mean orbit size of the elements of . The -power mean of the orbit sizes of the elements of is equal to
of , while the -power mean of the -power relative sizes of the elements of is equal to :
Because of this normalization property, if and are subgroups of , then it makes sense to compare the values of and with each other.
The -power diameter of , as defined in the introduction, is equivalently the maximal -power relative size of an element of , that is, one has
The function , if not identically , has negative derivative with respect to and has limiting values of and at and , respectively.
For example, Figure 2 provides the graph of and for any -set with orbit sizes , , , , and .

4. Musicality of scales and groups
Throughout this section, denotes a subgroup of , which acts on the set of all possible tonic scales, as well as on the subsets of all possible -tonic scales, for all , as described in the introduction, and denotes a number or variable with values in the extended reals .
For any , that is, for any tonic scale , we define the -musicality of , or of the orbit , to be
which is the -power relative size of in (not in ). Musicality defines a natural function
where denotes the lattice of all subgroups of . For a fixed , , and , the -power mean of over all -tonic scales is equal to . The -musicality of a -tonic scale is directly proportional to the size of its -orbit. The constant of proportionality depends on and is defined natually in such a way that one can meaningfully compare the values for various subgroups of for a fixed , or compare the values for various scales for a fixed group .
Observe that
Thus is the largest possible -musicality of a scale in , or equivalently it is the -musicality of any maximal -orbit of . If we let denote the set of all maximal -orbits of , then the union is the set of all scales in that have the largest possible -musicality (namely, ) for any . We call the scales in the set the -tonic scales of . Our philosophy is that the scales in the set should be regarded as the optimally musical -tonic scales relative to .
Let be a subgroup of . We define a signature of (in ) to be a list of the -orbit sizes of , listed in any particular order. In particular, one has for any signature of in . We say that acts without crossings (in ), if the -orbits of the -set are all of the form , where the addition here is ordinary addition of positive integers, not addition modulo . (In the case at hand, , and is omitted from the discussion because we have chosen to leave fixed by .) It is clear that every subgroup of is conjugate to a subgroup that acts without crossings. In fact, the conjugates of that act without crossings in are in one-to-one correspondence with the signatures of in . If acts without crossings in , we define the signature of (in ) to be the list of the orbit sizes of , listed in order so that the orbits are , , etc.
Let be a sequence of positive integers whose sum is . Then the group naturally embeds into in the following way. The first factor acts on the first numbers, . The next factor acts on the next numbers, . And so onward to the last factor , which acts on the last numbers, to . We denote the image of the embedding
described above by . We say that a twelve tone group of signature at most is a subgroup of of the form , where is a subgroup of for all . Note that the group is the largest twelve tone group of signature at most in the sense that it contains every twelve tone group of signature at most . For this reason we call it the maximal twelve tone group of signature . For example, the group the maximal twelve tone group of signature , and therefore every subgroup of is a twelve tone group of signature . Note that all twelve tone groups act without crossings, and the signature of the maximal twelve tone group of signature is .
Of course, there are other natural embeddings of in . For example, can be embedded in by allowing the first factor to act, say, on and the second factor on . Such an embedding does not yield a twelve tone group with signature . In loose terminology, twelve tone groups do not allow the factors to act in a ways that are “intertwined”: no “crossing” is allowed. Philosophically, this restriction can be motivated as follows. The chromatic scale has a linear ordering, and our goal is to understand how various scales may be transformed from one to the other. The most obvious and most common way in which this is done is by changing various notes of the scale by applying operations , which can be modeled by permuting neighboring notes. The notion of a twelve tone group is meant to capture this notion of locality.
Even without these locality restrictions, our mathematical analysis of twelve tone groups will apply equally well to any of the embeddings of in that allow crossings, because we can simply relabel the numbers through so that there are no crossings. We say that a maximal twelve tone group of signature with or without crossings is a subgroup of that is the result of applying some (inner) automorphism of to the maximal twelve tone group . We note the following proposition, whose proof is elementary.
Proposition 4.1.
There are possible signatures, hence maximal twelve tone groups, corresponding to independent choices of whether or not to separate from , for . There are maximal twelve tone groups with or without crossings, corresponding to the possible partitions of an eleven element set. Up to isomorphism, there are maximal twelve tone groups (or maximal twelve tone groups with or without crossings), corresponding to the possible partitions of the number .
5. Heptatonic scales
In this section we are primarily interested in the action of subgroups of on the set of all heptatonic (tonic) scales, which has 462 elements.
The following proposition gives a formula for for any maximal twelve tone group . Multisets are a generalization of sets where, as with tuples, repetition is allowed, but, as with sets, order doesn’t matter.
Proposition 5.1.
Let be a maximal twelve tone subgroup of of signature with or without crossings. The multiset of -orbit sizes of is the multiset
of positive integers. Therefore, one has
for all , the maximal -orbits of have size
and one has
One also has
and the number of -orbits of is equal to
Proof.
The proof is elementary. ∎
Clearly, this proposition generalizes to -tonic scales by replacing everywhere in the proposition with , replacing the number everywhere with the number , and replacing the number 462 with . (It even works for an -note chromatic scale by replacing everywhere with .) It also generalizes to -atonic scales.
Using the proposition, one can compute for all 56 maximal twelve tone groups , say, for the critical values . These values are listed in Table 2 in descending order of .
Signature | Maximal orbits | orbits | |||
---|---|---|---|---|---|
1 of size 32 | 96 | 3.5250 | 4.8324 | 6.6494 | |
1 of size 48 | 61 | 3.0689 | 4.3060 | 6.3377 | |
3 of size 16 | 131 | 2.7603 | 3.5264 | 4.5368 | |
1 of size 48 | 48 | 2.6679 | 3.4917 | 4.9870 | |
3 of size 24 | 83 | 2.4115 | 3.1423 | 4.3117 | |
1 of size 80 | 26 | 2.3864 | 3.1193 | 4.5022 | |
1 of size 72 | 31 | 2.3203 | 3.1114 | 4.8312 | |
10 of size 8 | 179 | 2.1513 | 2.5733 | 3.0996 | |
3 of size 36 | 53 | 2.1027 | 2.8000 | 4.1299 | |
3 of size 24 | 65 | 2.0921 | 2.5480 | 3.3766 | |
1 of size 72 | 25 | 2.0182 | 2.5229 | 3.8961 | |
1 of size 80 | 18 | 1.9833 | 2.3882 | 3.1169 | |
10 of size 12 | 113 | 1.8844 | 2.2930 | 2.9351 | |
3 of size 40 | 35 | 1.8788 | 2.2763 | 3.0303 | |
3 of size 54 | 34 | 1.8293 | 2.4951 | 3.9740 | |
3 of size 36 | 42 | 1.8261 | 2.2705 | 3.2727 | |
1 of size 140 | 9 | 1.8033 | 2.1611 | 2.7273 | |
1 of size 120 | 14 | 1.8032 | 2.2539 | 3.6364 | |
1 of size 108 | 16 | 1.7533 | 2.2482 | 3.7403 | |
1 of size 120 | 12 | 1.7228 | 2.1281 | 3.1169 | |
35 of size 4 | 245 | 1.6709 | 1.8779 | 2.1212 | |
10 of size 18 | 72 | 1.6480 | 2.0433 | 2.8052 | |
3 of size 60 | 23 | 1.6364 | 2.0284 | 2.9870 | |
10 of size 12 | 88 | 1.6325 | 1.8594 | 2.2857 | |
3 of size 54 | 27 | 1.5907 | 2.0232 | 3.1558 | |
3 of size 36 | 34 | 1.5849 | 1.8411 | 2.6494 | |
3 of size 40 | 24 | 1.5577 | 1.7428 | 2.0779 | |
1 of size 120 | 10 | 1.4994 | 1.7256 | 2.5974 | |
10 of size 20 | 47 | 1.4704 | 1.6611 | 2.0346 | |
1 of size 140 | 6 | 1.4667 | 1.6141 | 1.8182 | |
35 of size 6 | 154 | 1.4667 | 1.6733 | 2.0000 | |
10 of size 27 | 46 | 1.4388 | 1.8207 | 2.6883 | |
10 of size 18 | 57 | 1.4289 | 1.6569 | 2.2208 | |
3 of size 70 | 12 | 1.4224 | 1.5770 | 1.8182 | |
3 of size 60 | 19 | 1.4220 | 1.6448 | 2.4675 | |
3 of size 90 | 15 | 1.4218 | 1.8074 | 2.9221 | |
1 of size 210 | 5 | 1.3618 | 1.5615 | 2.2727 | |
3 of size 60 | 16 | 1.3583 | 1.5529 | 2.0779 | |
1 of size 252 | 3 | 1.3469 | 1.4752 | 1.6364 | |
1 of size 200 | 6 | 1.3379 | 1.5416 | 2.5974 | |
126 of size 2 | 336 | 1.2941 | 1.3704 | 1.4545 | |
35 of size 9 | 98 | 1.2857 | 1.4911 | 1.9091 | |
10 of size 30 | 31 | 1.2848 | 1.4802 | 2.0130 | |
3 of size 100 | 11 | 1.2727 | 1.4694 | 2.3810 | |
1 of size 210 | 4 | 1.2725 | 1.4383 | 1.8182 | |
35 of size 6 | 119 | 1.2692 | 1.3569 | 1.5455 | |
3 of size 105 | 8 | 1.2375 | 1.4053 | 1.8182 | |
10 of size 20 | 32 | 1.2171 | 1.2718 | 1.3853 | |
3 of size 70 | 8 | 1.1538 | 1.1779 | 1.2121 | |
35 of size 10 | 63 | 1.1458 | 1.2122 | 1.3636 | |
126 of size 3 | 210 | 1.1379 | 1.2211 | 1.3636 | |
10 of size 35 | 16 | 1.1149 | 1.1508 | 1.2121 | |
1 of size 252 | 2 | 1.0820 | 1.0864 | 1.0909 | |
3 of size 126 | 4 | 1.0645 | 1.0765 | 1.0909 | |
1 of size 462 | 462 | 1 | 1 | 1 | |
462 of size 1 | 462 | 1 | 1 | 1 |
As the most important example, consider the maximal twelve tone group with signature . Equivalently, is the subgroup of generated by the set , and it is isomorphic to and has order . The orbits in therefore have 1, 2, 4, 8, 16, or 32 elements. Using Proposition 5.1 we find that under the action of has:
-
(1)
orbit of size ,
-
(2)
orbits of size ,
-
(3)
orbits of size ,
-
(4)
orbits of size ,
-
(5)
orbits of size ,
-
(6)
orbits of size ,
for a total of orbits. One therefore has
and
In particular, one has
and
The heptatonic scales of comprise the unique maximal -orbit of , which are the 32 scales
or
where each of the ’s is either a () or a (). This unique maximal -orbit consists of the 32 thāts of Hindustani classical music popularized by Bhatkhande.
One may also consider the maximal twelve tone group of signature , an action that globally fixes instead of . Since changing the order of the numbers in the signature does not affect any of the relevant values, the values above calculated for are the same as for . The maximal -orbit of contains the 32 scales
or
where each of the ’s is either a () or a (). In particular, the most “sharp” of these 32 scales is precisely the major scale. The intersection of the maximal -orbit and the maximal -orbit consists of the 16 scales
We may also consider the compositum of and , which is the maximal twelve tone group of signature . There are:
-
(1)
orbit of size ,
-
(2)
orbits of size ,
-
(3)
orbits of size ,
-
(4)
orbits of size ,
-
(5)
orbits of size ,
-
(6)
orbits of size ,
-
(7)
orbits of size ,
-
(8)
orbits of size ,
-
(9)
orbits of size ,
for a total of orbits. The maximal -orbit is just the union of the maximal -orbit and the maximal -orbit. Consistent with inclusion-exclusion, one has . One has
and
Note that
and
Let us now consider the maximal twelve tone group of signature . This group is also of theoretical and historical importance in Indian classical music, as its unique maximal orbit consists precisely of the mēḷakarta ragas, which are built as follows:
Here we have:
-
(1)
orbit of size ,
-
(2)
orbits of size ,
-
(3)
orbit of size ,
-
(4)
orbits of size ,
-
(5)
orbits of size ,
-
(6)
orbits of size ,
-
(7)
orbits of size ,
-
(8)
orbits of size ,
-
(9)
orbits of size ,
-
(10)
orbits of size ,
-
(11)
orbits of size ,
for a total of orbits. We then have
and
Note that, of the 72 mēḷakarta ragas comprising the maximal -orbit, 32 are scales in the maximal -orbit consisting of 48 scales, but only 16 are scales in the maximal -orbit consisting of 32 scales.
One may also consider the maximal twelve tone group of signature , an action that globally fixes instead of , as does . Since changing the order of the numbers in the signature does not affect any of the relevant values, the values above calculated for are the same as for . The largest -orbit of contains the 72 scales
The intersection of the maximal -orbit and the maximal -orbit consists of the 36 scales
We may also consider the maximal twelve tone group of signature . Its largest orbit consists of possible scales, which is the union of the maximal -orbit and the maximal -orbit. Consistent with inclusion-exclusion, one has . The 108 scales are built as follows:
These 108 scales include all 72 mēḷakarta ragas in the maximal -orbit and all 32 scales from the maximal -orbit, along with 20 others. Indeed, by inclusion-exclusion, the combined total of scales in the maximal -orbit and in the maximal -orbit is only . Similarly, the 108 scales include all 72 mēḷakarta ragas and all 48 scales from the maximal -orbit, along with the same 20 “new” scales. Indeed, by inclusion-exclusion, the combined total of scales in the maximal -orbit and in the maximal -orbit is . The 20 new scales are as follows:
At first sight this appears to be scales; however, 4 scales are repeated twice, namely, the 4 scales
For the group we have:
-
(1)
orbit of size ,
-
(2)
orbits of size ,
-
(3)
orbits of size ,
-
(4)
orbits of size ,
-
(5)
orbit of size ,
-
(6)
orbits of size ,
-
(7)
orbits of size ,
-
(8)
orbits of size ,
-
(9)
orbits of size ,
for a total of orbits. Here we have
and
Thus, we see that
and
Thus, the passing from to decreases “musicality” of the scales in the maximal orbit in a manner that is comparable to passing from to .
The lattice diagram for the maximal twelve tone groups that we have discussed thus far, along with the groups and , are as follows.
The values of for these eight groups are (approximately) as follows.
The signatures of these maximal twelve tone groups are as follows.
The heptatonic scales for these groups are the sets
having cardinalities
It is interesting that and , even though and each consist of three equal-sized orbits (of size 36 and 16, respectively) while and each consist of a unique orbit (of size 108 and 48, respectively).
While the analysis in this section provides some reasons for using the groups and , Theorem 1.2 shows that is unique only up to conjugacy. One ought to ask whether or not is the “best” choice among all of its conjugates for the scales in the maximal -orbit.
We believe that the choice of and , and thus the mēḷakarta raga system, can be justified. All 72 of the mēḷakarta ragas contain and , which is a natural restriction to impose as the interval is a fifth (in fact, a perfect fifth in Indian classical tuning). The mēḷakarta ragas are obtained as follows:
These 72 scales comprise the maximal -orbit of , where is the subgroup of isomorphic to that acts separately on , , , and . In other words, the 72 mēḷakarta ragas are precisely the scales with largest -musciality for any . The choice of and among their conjugates are “natural” choices at the very least to the extent that the perfect fifth is “natural.”
For any subgroup of conjugate to , one is required to choose an element of besides that is globally fixed by the action of . To the extent that the perfect fifth is “natural,” the most natural choice is the element , but one may rightfully choose instead. Either of these is a natural choice since, while the interval is a fifth, the interval is a fourth, and both intervals coincide with perfect harmonic intervals (in some tunings). Moreover, 5 and 7 are the only elements of the cyclic group other than and that generate the whole group, a fact that forms the basis of the circle of fifths and circle of fourths.
Once the choice of a second fixed element of is made, one is required to partition the remaining ten elements of into disjoint two-element subsets for , where is then to act separately on for all . A natural choice is for each and to be consecutive, so that for all . This is because any other choice would require the action to be “non-local,” with the occurence of “crossings,” as, for example, if were to act separately on and . It is clear that every subgroup of is conjugate to a subgroup that acts without crossings. There are exactly six conjugates of that act without crossings, namely, those that fix , , , , , or , respectively. (By definition, all of them fix .) Among these six conjugates of , the only one besides that it may also be natural to consider is the subgroup generated by , which fixes instead of . From this group we obtain the following 32 scales in the unique maximal -orbit:
Half of these 32 scales—those that contain G—were obtained previously using , so among these 32 scales we obtain the 16 additional scales listed in Table 3, namely, those that contain G rather than G. Thus, the union of the maximal -orbit and the maximal -orbit consists of scales.
It is natural also to consider the compositum of and , which is isomorphic to and acts without crossings, separately on , , , , and . The unique maximal -orbit consists of the 48 scales listed in Tables 1 and 2 comprising the union of the maximal -orbit and the maximal -orbit. The group has the virtue that its action on is completely symmetrical, providing a natural theory of heptatonic scales that privileges both the fourth and the fifth equally. Moreover, the group ranks 3rd–8th among the 3094 conjugacy classes of subgroups of for its value of for and 2nd–7th for its values for and . One has
which closely rival the values for and . By contrast, the group fares relatively poorly, ranking 529th–536th among the 3094 conjugacy classes for its value for , ranking 483rd–490th for its value for , and ranking 294th–301st for its value at . These values are as follows:
By this measure, then, the 48 scales in the maximal -orbit are a worthy alternative to the 72 (mēḷakarta) scales in the maximal -orbit.
C | D | E | F | G | A | B | |
---|---|---|---|---|---|---|---|
C | D | E | F | G | A | B | |
C | D | E | F | G | A | B | |
major Locrian | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
C | D | E | F | G | A | B | |
C | D | E | F | G | A | B | |
half diminished | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
C | D | E | F | G | A | B | |
Persian | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
C | D | E | F | G | A | B | |
Locrian 6 | C | D | E | F | G | A | B |
C | D | E | F | G | A | B | |
Locrian mode | C | D | E | F | G | A | B |
It must be noted that the maximal twelve tone groups that are conjugate to , besides , are those of signature , , , and , respectively. These four groups are those that fix , , , and , respectively, instead of or . These six groups appear in three “inverse” pairs: and are inversions, as are those of signature and , as are those of signature and . At this stage, one ought to seek further mathematical justification for the choice of fixing or over and above , , , or , based on more than just the naturality of fixing the perfect fifth ( and) . Regarding this problem, one of the two reviewers wrote the following.
The proposed formal framework provides no actual explanation for the privileged role of and for the choice of the second fixed element. In my view, this is a methodological weakness. Although the authors try to explain it as a “most natural choice” but all the reasoning is based on aspects that are external to the actual formal framework: tuning and generators of . If tuning was crucial, why does play no role in the model? If generators were important, why do or 1 lead to unmusical systems? To solve the issue, I believe, the model should be enhanced by an additional formal constraint resulting in disqualification of other choices for the second fixed element. Below, I theorize about one possible approach (evenness).
As I challenged above, the choice of the other fixed element as 7 or 5 is based on ad hoc arguments. It would be much more elegant if an additional formal concept was introduced from which the two choices of 7 and 5 would formally follow. I think that some generalization of Clough’s concept of evenness could be a viable option. I think that some measure of “average evenness” for systems of scales could be introduced (and computed) and it would disqualify the other choices of the fixed element in the heptatonic scales. Additionally, evenness applies obviously even on the level of particular scales. This would provide a natural ordering of scales with diatonic scales (and so the anhemitonic pentatonic scales) being maximally even.
No doubt this is a promising way to resolve the issue. As mentioned in the introduction, it is possible that the methods of this paper can be combined synergistically with other ways of mathematically justifying the various musical scales. We leave this to the interested reader to pursue further.
The reviewers also commented that one ought to try to generalize Theorem 1.2 to atonic scales, that is, to the action of on the set of all 7-note atonic scales in . Based on that suggestion, we used GAP and SAGE to verify the following theorem, in a manner similar to the proof of Theorem 1.2. Note that there are 10723 conjugacy classes of subgroups of .
Theorem 5.2 (with James Allen).
For all , the subgroups of for which is largest are the group generated by , along with its conjugate subgroups. Moreover, for all , the subgroups of for which is second largest are the group generated by , along with its conjugate subgroups.
The theorem provides further justification that the group and its conjugates are “natural” choices for a theory of musical scales. On the interval , as approaches , one other conjugacy class begins to surpass the group in -power diameter, namely, the conjugates of the group generated by . The values of for for these three conjugacy classes of subgroups of are provided in Table 4. For reasons explained earlier, the values on the interval are more critical than those on the interval .
Signature | Maximal orbits | |||
---|---|---|---|---|
1 of size 64 | 3.9501 | 5.5199 | 7.8384 | |
1 of size 32 | 3.7917 | 5.0929 | 6.9091 | |
1 of size 64 | 3.7183 | 5.0397 | 7.0303 |
6. Hexatonic scales
It is most common to obtain a hexatonic scale in one of the following three ways: (1) deleting a note from a given heptatonic scale (e.g., the major and minor hexatonic scales are obtained from the major and natural minor heptatonic scales by deleting the seventh note and the sixth note, respectively); (2) adding a note to a given pentatonic scale (e.g, the major and minor blues hexatonic scales are obtained from the major and minor pentatonic scales by adding an extra half step after the third note in each); and (3) combining three non-overlapping triads.
A fourth way of obtaining a hexatonic scale from a heptatonic scale is as follows. Define the (tonic) complement of a -tonic scale to be the -tonic scale
Of course one has for all scales . For any action of on , there is a (tonic) complementary action of defined by
for all and all . Moreover, the induced action on corresponds to the induced action on . As a consequence, our results on actions of the subgroups of on yield corresponding results on the actions of the subgroups of on . Thus, by Theorem 1.2 and complementarity, we have the following.
Theorem 6.1.
For all , the subgroups of for which is largest are the group generated by , along with its conjugate subgroups.
Also by complementarity, each of the theories of heptatonic scales developed in Section 5 has a complementary theory of hexatonic scales: the sets of complements of the scales in the maximal orbits of a given action on are precisely the maximal orbits of the complementary action on . However, none of the 32 scales in the maximal orbit of under the action of our twelve tone group of signature contains the interval of a fifth. Consequently, this particular group is perhaps not the most natural for yielding interesting hexatonic scales. For this purpose we single out the subgroup of generated by the set , which is the maximal twelve tone group of signature , and the subgroup of generated by the set , which is the maximal twelve tone group of signature . Among the 32 scales
in the maximal -orbit of appear the whole tone scale (the complement of Neopolitan major), the Prometheus scale (the complement of Neopolitan minor), and the augmented scale . Among the 32 scales
in the maximal -orbit of appear the whole tone scale, the major hexatonic scale , the minor hexatonic scale , and the tritone scale . The whole tone scale is the only scale that lies in both sets of 32 scales. Unfortunately, the major and minor blues hexatonic scales do not appear in either set but rather have -orbits and -orbits of size .
7. Pentatonic scales
The most prominent of the pentatonic scales are the five black-key pentatonic scales formed by the black keys of a piano: the major and minor and blues major and minor pentatonic scales and the Egyptian, or suspended, pentatonic scale. Let denote the subgroup of generated by . In other words, is the maximal twelve tone group of signature . The 16 scales in the unique maximal -orbit of are the 16 scales
listed in Table 5. Among these 16 scales are the five black-key pentatonic scales.
major | C | D | E | G | A |
---|---|---|---|---|---|
C | D | E | G | A | |
C | D | E | G | A | |
C | D | E | G | A | |
Blues major, or Ritsusen, or yo | C | D | F | G | A |
Egyptian, or suspended | C | D | F | G | A |
C | D | F | G | A | |
C | D | F | G | A | |
C | D | E | G | A | |
C | D | E | G | A | |
C | D | E | G | A | |
C | D | E | G | A | |
C | D | F | G | A | |
minor | C | D | F | G | A |
C | D | F | G | A | |
Blues minor, or Man Gong | C | D | F | G | A |
The following theorem was proved using GAP and SAGE in a manner similar to the proof of Theorems 1.2 and 5.2.
Theorem 7.1.
For all , the subgroups of for which is largest are the group generated by , along with its conjugate subgroups. Moreover, for all , the subgroups of for which is second largest are the group generated by , along with its conjugate subgroups.
On the interval , as approaches , several other subgroups begin to surpass the group in -power diameter.
The group of the theorem is isomorphic to , and the group is a subgroup of isomorphic to . For the action of on there are:
-
(1)
orbit of size ,
-
(2)
orbits of size ,
-
(3)
orbits of size ,
-
(4)
orbits of size ,
-
(5)
orbits of size ,
-
(6)
orbits of size ,
for a total of orbits. For the action of on there are:
-
(1)
orbit of size ,
-
(2)
orbits of size ,
-
(3)
orbits of size ,
-
(4)
orbits of size ,
-
(5)
orbits of size ,
for a total of orbits. From this we deduce that
and
One of the reviewers suggested the following theory of atonic pentatonic scales alternative to our theory of tonic pentatonic scales.
I really like the idea of complementarity. It is a very elegant way of dealing with related systems. However, from the musical perspective it seems quite counterintuitive that the hexatonic scales turn out to be the complements to the heptatonic scales. How a Neapolitan minor scale is complementary to the whole tone scale? Of course, it follows from the feature that one tone is fixed in all scales and only the others are movable.
However, if this feature is reconsidered, one might achieve an elegant explanation of the pentatonic scales while keeping the heptatonic scales in check. A scale would be any subset of , not necessarily containing 0. Instead of , one would consider actions of subgroups of . The definition of local actions would require a cosmetic change: it would need to consider the cyclic nature of .
As regards the heptatonic scales, I conjecture that one would obtain the maximal -orbit diameters with the signature . To fix one could consider the permutations of the signature starting with . Then the evenness should lead to two resulting signatures and . One could even call them the authentic and the plagal systems.
And for the pentatonic scales one should get a perfect remedy. The pentatonic scales would be the complementary scales to the heptatonic scales. Therefore, the maximal twelve tone groups are conjugates of . It would be elegant to consider the conjugate group generated by . This makes movable, but leads to relevant musical scales. It is nice that this way the system includes not only the yo but also the in pentatonic scales. (The Japanese music theorist Uehara proposed two basic pentatonic modes: yo (anhemitonic) and in (hemitonic).)
Under the scheme of pentatonic atonic scales proposed above, the largest orbit has elements, exactly half of which are tonic scales.
References
- [1] Carey, Norman, and David Clampitt. “Aspects of Well-Formed Scales.” Music Theory Spectrum 11, no. 2 (1989): 187–206.
- [2] Clough, John, and Jack Douthett. “Maximally Even Sets.” Journal of Music Theory 35, no. 1/2 (1991): 93–173.
- [3] Clough, John, Nora Engebretsen, and Jonathan Kochavi. “Scales, Sets, and Interval Cycles: A Taxonomy.” Music Theory Spectrum 21, no. 1 (1999): 74–104.
- [4] Clough, John, Jack Douthett, N. Ramanathan, and Lewis Rowell. “Early Indian Heptatonic Scales and Recent Diatonic Theory.” Music Theory Spectrum 15, no. 1 (1993): 36–58.
- [5] Douthett, Jack. “Filtered Point-Symmetry and Dynamical Voice-Leading.” In Music Theory and Mathematics: Chords, Collections, and Transformations, edited by Jack Douthett, Martha M. Hyde, and Charles J. Smith, 72–106. University of Rochester Press, 2008.
- [6] Hook, Julian. “Signature Transformations.” In Music Theory and Mathematics: Chords, Collections, and Transformations, edited by Jack Douthett, Martha M. Hyde, and Charles J. Smith, 137–160. University of Rochester Press, 2008.
- [7] Hook, Julian. “Spelled Heptachords.” In Mathematics and Computation in Music—MCM 2011, edited by Carlos Agon, Emmanuel Amiot, Moreno Andreatta, Gérard Assayag, Jean Bresson, and John Mandereau, 84–97. Springer, 2011.
- [8] Tymoczko, Dmitri. “Scale Networks and Debussy.” Journal Of Music Theory 48, no. 2 (2004): 219–294. Tymoczko, Dmitri. A Geometry of Music: Harmony and Counterpoint in the Extended Common Practice. Oxford University Press, 2011.
- [9] Žabka, Marek. “Dancing with the Scales: Sub-Chromatic Generated Tone Systems.” Journal of Music Theory 58, no. 2 (2014): 179–233.