Solymosi's Sum-Product estimate

78

By nhkatz

What's this?

 I am writing this hub to describe a recent paper in mathematics by Jozsef Solymosi. I provide a link to the paper below. The paper proves a result concerning sums and products of real numbers. The result is not known to be optimal and generally not believed to be, so it serves as the introduction to a research problem. Nevertheless, Solymosi's paper has the best currently known result, so it is reasonable to believe that understanding it fully is all you need to get going. Expositions of this paper  have been written already, notably on Izabella Laba's blog to which I link below, but I'd like to write one here as, hopefully, the first in a series exploring possible improvements. Participation through comments is welcome.

Incidentally, the idea of doing research through a blog is currently coming into fashion on the web. Tim Gowers would like to write a paper with a large number of coauthors in the way one writes a story with a large number of coauthor's. (Each one writes one sentence!) If you'd like to join in, or read his description of how this is supposed to work, I provide a link to his blog below.

Solymosi, himself

What's the problem

 Before going any further, I want to express what mathematical background the reader might need to understand this hub. First the reader should be familiar with the notion of a set. If the set is finite, the reader should be familiar with the notion of the number of elements in the set (its cardinality, if you will.) If A is a finite set, we will denote by |A|, the number of elements in A. Further, it would be helpful for the reader to be familiar with addition and multiplication of real numbers. Actually, this is a tricky business. I plan for my next hub to explain this topic fully. (It will be my first hub in the "Calculus for Cranks" series.) If you don't know what real numbers are, every time I mention real numbers, pretend that I am talking about rationals. I expect all my readers to know how to add, subtract, mutliply, and divide fractions. That's it.

Given a finite set of real numbers A, I will now define its sumset A+A, and its product set AA.

A+A={a+b: a and b are in A}  ;  AA={ab: a and b are in A}

In other words, the sumset is the set of distinct sums of two elements of A and the product set is the set of distinct products of two elements of A. Let us look at two examples.

Example 1:

A={1, ... , n}

A is the set of the first n natural numbers, an arithmetic progression if you will. Then

A+A={2,3,...,2n}

and has only 2n-1 elements, while AA is a rather large subset of {1, ... , n2}.

Example 2:

A={2,4,8, ..., 2n}

A is the set of the first n powers of 2, a geometric progression if you will. Then

AA={22,23, ... , 22n}

and is pretty small, while A+A is rather large.

EXERCISE: Calculate |A+A|

The Sum Product problem:  Given a set of real numbers with |A|=N, how small can the maximum of the numbers |A+A| and |AA| be?

A trendy book about sumsets and product sets

Additive Combinatorics (Cambridge Studies in Advanced Mathematics)
Amazon Price: $115.35
List Price: $110.00

What kind of answer do I want? (Asymptotics)

For any given value of N, the answer to the above question is a number. Thus the full answer is a function of N, call it f(N). We don't really expect to find an exact formula for f(N). (Call us pessimists.) It would be enough to understand roughly how fast f(N) grows as N grows. How can we make this precise? We do it with O and Ω notation.

If f(N) and g(N) are two functions of N, we say that f(N) is O(g(N)) if there is a positive number C independent of N so that we always have f(N) ≤C g(N). Similarly, we say that f(N) is Ω(g(n)) if there is a positive number C so that

 f(N) ≥C g(N).

In what follows, we restrict our attention to f(N) being the smallest possible value of the maximum of the two numbers |A+A| and |AA| for any set A of N real numbers. (This is a kind of minimax. We pick A to be the set of a given size making this maximum smallest. It is somewhat mysterious what this set might look like.)

There is a very old guess by Erdös and Szemeredi about how fast f(N) might grow. They conjectured that for any α < 2 , one has that f(N) is Ω(N α). We abbreviate this by saying that f(N) is Ω(N2-). You can think of this as meaning that f(n) is almost Ω(N2). This is consistent with the examples above. This conjecture is considered pretty important. The reader who solves it will be famous, at least in Hungary.

Now we are ready to state Solymosi's result which is weaker than the conjecture. (And he's alreadyfamous in Hungary.) He showed that f(N) is Ω(N4/3-). How did he arrive at this? That is what will be explained in the rest of this hub.

Paul Erdos: He conjectured more than he proved

How Solymosi did it.

We may assume our set A consist only of positive numbers.

Exercise: Why can we assume that?

We need a lemma saying essentially that most children belong to large families.

Lemma Suppose that Nα objects are put in Nβ boxes. Then Ω(Nα) objects are in boxes containing Ω(Nα-β)objects

Proof Consider those boxes containing fewer than ½Nα-β objects. In all they contain fewer than ½Nα objects. Q.E.D.

Now we suppose that |A|=N, |A+A|=Na,|AA|=Nb. We're looking for a lower bound on maximum of a and b. We define a multiplicative quadruplet to be an ordered quadruplet (s,t,u,v) of elements of A so that st=uv. Using the lemma with pairs (s,t) as the objects and products st as the holes, we see that there are Ω(N2) pairs which have products that can be represented in Ω(N2-b) ways. Therefore there are Ω(N4-b) multiplicative quadruplets.

Next we notice that whenever we have a multiplicative quadruplet (s,t,u,v) then it follows that s/u=v/t. Now we apply the lemma with multiplicative quadruplets being the objects and pairs (s,u) being the boxes, to see that there are Ω(N4-b) quadruplets with the property that each one has (s,u) which appears in Ω(N2-b) others. We denote the set of quotients s/u comingfrom these quadruplets as Q. For each quotient q in Q, we denote r(q) to be the number of representations s/u of q as a quotient of elements in A. Then for every q in Q, we have that r(q) is

 Ω(N2-b). Finally, we observe

∑r(q)2 = Ω(N4-b),

where the sum on the left is taken over Q and represents the set of quadruplets with the good property.

It would be convenient if all the numbers r(q) were about the same. They may not be but we can sort them into boxes in which they are. Precisely, we let

Qj={q in Q: 2j ≤ r(q) < 2j+1}

Clearly Qjis empty unless 2j ≤ N and unless 2jis Ω(N2-b). In particular, there are at most O(log N) nonempty sets Qj and since log N grows slower than any power of N we see that there is a j so that

∑r(q)2 = Ω(N(4-b)-)

where the sum is taken over Qj. Thus there are Ω(2-2j N(4-b)-) quotients in Qj. We number these from smallest to largest,

q1<q2<...<qM,

where M is Ω(2-2jN(4-b)-). We consider the pairs (s,u) of elements of A which have a particular quotient ql. They lie on the line y=ql x. (Notice we just switched to two dimensions!) Denote this set of points all on a line Ll. Now consider the consecutive sumsets Ll+Ll+1. They are disjoint because they lie between the consecutive lines y=qlx and y=ql+1x. Moreover no two pairs yield the same sum. (It's like summing points on the x-axis with points on the y-axis.) Thus

|Ll+Ll+1| ≈22j.

Now we take the union of the disjoint sumsets running over l and we get that the number of sums is Ω(N(4-b)-). But each sum is a pair of elements in A+A. There are only N2apairs of elements in A+A. Thus one of a or b is at least (4/3)-. Q.E.D.

That is the whole of Solymosi's argument. It is pretty simple as these things go. It does not solve the problem. I hope to write more hubs and perhaps discuss in the comments how one might hope to do better. Let me begin by giving a hint. What happens to the sums of points on non-consecutive lines?

Comments

Aya Katz profile image

Aya Katz Level 4 Commenter 3 years ago

Nets, I am having trouble reading the entire hub in one sitting, so I hope that you will forgive me if I ask a small question after reading only a part of the hub.

You write:

"A+A={2,3,...,2n}

and has only 2n-1 elements, while AA is a rather large subset of {1, ... , n2}."

Does this imply that both A+A and AA contain only some of the members of the set A, but they also contain numbers that are not included in A?

 

nhkatz profile image

nhkatz Hub Author 3 years ago

In this example, A+A does not contain 1 because you cannot achieve 1 as a set of two positive integers. AA does in fact contain all of A because 1 is an element of A and so the product of a with 1 is in AA for any a in A.

In general, AA and A+A need not contain A.

Aya Katz profile image

Aya Katz Level 4 Commenter 3 years ago

Nets. thanks for answering my question. I have now read the next section of the hub. My question about this section isn't really mathematical. You say that the conjecture by Erdös and Szemeredi is considered important. Why is it important? What is its importance? Does "important" have a special meaning when used in mathematics?

I mean, to the lay person, something that is just a conjecture seems unimportant in relation to something that has been proved.

nhkatz profile image

nhkatz Hub Author 3 years ago

Aya,

Think of a conjecture as a question. Is the conjecture true or is it false? Surely, the lay person agrees that a question could be important. For instance, we might think that it is important to know whether non-human apes can achieve literacy before the question has been resolved and not only in hindsight.

Why should one conjecture be more important than another in mathematics? This is a subtle issue often overtly political. But generally a conjecture becomes important if it seems fundamental, if a major breakthrough on it would be likely to impact other problems of wide current interest, and if it is old and has resisted solution by a number of well-admired mathematicians in the past.

The Erdos-Szemeredi conjecture has these things going for it. As a practical matter, it is important in following meaningful (and political) sense. If you could provide a correct solution, you would receive a lot of attention from the mathematical community regardless of your academic pedigree.

Nets

Aya Katz profile image

Aya Katz Level 4 Commenter 3 years ago

Is there any literature about how Erdös and Szemeredi arrived at their conjecture?

nhkatz profile image

nhkatz Hub Author 3 years ago

In the preprint of Solymosi to which I linked there is a reference to the 1983 paper of Erdos and Szemeredi in which the question was posed and you might find what you want there. In fact, it is the most natural conjecture one can make when one realizes that one has no examples significantly more interesting than the ones above.

I could tell you more about how certain related sum product problems which are quite fashionable now arose, but you might not be quite as interested.

Nets

Aya Katz profile image

Aya Katz Level 4 Commenter 3 years ago

Nets, here is a really nit-picky question, but it's one of the things that sometimes make it impossible for me to fully concentrate on the flow of a mathematical argument. The question is this: in your hub are lower case "n" and uppercase "N" the same, or is there some relationship between the two?

You write; "For any given value of N, the answer to the above question is a number. Thus the full answer is a function of N, call it f(N). We don't really expect to find an exact formula for f(N). (Call us pessimists.) It would be enough to understand roughly how fast f(N) grows as N grows."

As I'm reading this I think: "N. Hmm do I know what N is? I think it stands for number, but just to be sure let's go back to the previous section and check."

So then I re-read the previoius section, and I notice a lot of lower case n's, as in: "A={1, ... , n}".

Lower case n seems to be the last number of the set of natural numbers, and I take it that it can be anything, as long as it was the last in a series of natural numbers. Upper case N doesn't show up until the next to last line of that section:

"The Sum Product problem:  Given a set A of real numbers with |A|=N"

Is there any logical relationship between upper case N and lower case n, or do they describe two completely unrelated numbers, like two people whose name just happens to be Ned?

Or did you use the n/N designation because you believe there is a relationship, but you don't know what it is?

 

nhkatz profile image

nhkatz Hub Author 3 years ago

There's always the danger that my hub will contain a misprint. But in this case, the N in question is the one that appears in the statement The Sum Product Problem. You should now ignore the examples. They are now over.

john rey tuso 22 months ago

what the answer

Submit a Comment
Members and Guests

Sign in or sign up and post using a hubpages account.



    • No HTML is allowed in comments, but URLs will be hyperlinked
    • Comments are not for promoting your Hubs or other sites

    Please wait working