Dimensione VC dei polinomi rispetto alle semenze tropicali?

Come in questa domanda, sono interessato al problema $\mathbf{BPP}$ vs. $\mathbf{P}$ / per i circuiti tropicali e . Questa domanda si riduce a mostrare i limiti superiori per la dimensione VC dei polinomi rispetto alle semirazioni tropicali (vedere Teorema 2 di seguito). $\mathrm{poly}$ $(\max,+)$ $(\min,+)$

Sia un semiring. Un modello zero di una sequenza di polinomi in è un sottoinsieme per il quale esiste e tale che per tutti $R$ $(f_1,\ldots,f_m)$ $m$ $R[x_1,\ldots,x_n]$ $S\subseteq \{1,\ldots,m\}$ $x\in R^n$ $y\in R$ $i=1,\ldots,m$ , sse . Cioè, i grafici di esattamente quei polinomi con devono colpire il punto . ("Zero-pattern" perché la condizione può essere sostituita da ) Let $f_i(x)= y$ $i\in S$ $f_i$ $i\in S$ $(x,y)\in R^{n+1}$ $f_i(x)=y$ $f_i(x)-y=0$ = il numero massimo possibile di schemi zero di una sequenza di polinomi di grado al massimo . Quindi, . La dimensionediVapnik-Chervonenkisdeipolinomidi grado è . $Z(m)$ $m$ $d$ $0\leq Z(m)\leq 2^m$ $d$ $VC(n,d) := \max\{m\colon Z(m)=2^m\}$

Nota: di solito, la dimensione VC è definita per una famiglia di insiemi come la cardinalità più grande di un insieme tale che . Per adattarci a questo frame, possiamo associare ad ogni coppia l'insieme di tutti i polinomi di grado per cui ${\cal F}$ $|S|$ $S$ $\{F\cap S\colon F\in{\cal F}\}=2^S$ $(x,y)\in R^{n+1}$ $F_{x,y}$ $f$ $\leq d$ tiene. Quindi la dimensione VC della famiglia di tutti questi insiemi è esattamente . $f(x)=y$ ${\cal F}$ $F_{x,y}$ $VC(n,d)$

Un limite superiore banale su è (abbiamo bisogno di almeno vettori distinti per avere tutti i modelli di possibili), ma è inutile in semirimorchi infiniti. Per avere buoni limiti superiori sulla dimensione VC, abbiamo bisogno di buoni limiti superiori su . Sui campi sono noti tali limiti. $m=VC(n,d)$ $m\leq n\log |R|$ $2^m$ $x\in R^n$ $2^m$ $Z(m)$

Teorema 1: Su ogni campo , abbiamo $R$ $Z(m)\leq \binom{md+n}{n}$ .

Simili limiti superiori erano stati precedentemente dimostrati da Milnor , Heintz e Warren ; le loro prove usano tecniche pesanti dalla vera geometria algebrica. Al contrario, una dimostrazione di mezza pagina del Teorema 1 di Ronyai, Babai e Ganapathy (che diamo di seguito) è una semplice applicazione dell'algebra lineare.

Cercando piccole 'soddisfacenti $m$ $\binom{md+n}{n} < 2^m$ , we obtain that $VC(n,d)=O(n\log d)$ holds over any field. In view of the $\mathbf{BPP}$ vs. $\mathbf{P}$ / $\mathrm{poly}$ , important here is that the dimension is only logarithmic in the degree $d$ . This is important because circuits of polynomial size can compute polynomials of exponential degree, and because a result of Haussler in PAC learning (Corollary 2 on page 114 of questo documento ) fornisce quanto segue (laddove ipotizziamo che i circuiti deterministici possano utilizzare il voto di maggioranza per produrre i loro valori).

Teorema 2: vale per i circuiti su qualsiasi semiringa , dove è solo polinomiale in e . $\mathbf{BPP}\subseteq \mathbf{P}/\mathrm{poly}$ $R$ $VC(n,d)$ $n$ $\log d$

Vedi qui su come il risultato di Haussler implica il Teorema 2.

In particular, by Theorem 1, $\mathbf{BPP}\subseteq \mathbf{P}/\mathrm{poly}$ holds over any field. (Interesting is here only the case of infinite fields: for finite ones, much simpler arguments work: Chernoff bound then does the work.) But what about (infinite) semirings that are not fields, or even not rings? Motivated by dynamic programming, I am mainly interested in tropical $(\max,+)$ and $(\min,+)$ semirings, but other "non-field" (infinite) semirings are interesting as well. Note that, over the $(\max,+)$ semiring, a polynomial $f(x)=\sum_{a\in A} c_a\prod_{i=1}^n x_i^{a_i}$ with $A\subseteq\mathbb{N}$ and $c_a\in \mathbb{R}$ , turns into the maximization problem $f(x)=\max_{a\in A}\ \{c_a+a_1x_1+a_2x_2+\cdots+a_nx_n\}$ ; the degree of $f$ is (as customary) the maximum of $a_1+\cdots+a_n$ over all $a\in A$ .

Question : Is the VC dimension of degree $\leq d$ polynomials over tropical semirings polynomial in $n\log d$ ?

I admit, this can be a rather hard question to expect a quick answer: tropical algebra is rather "crazy". But perhaps somebody has some ideas on why (if any) tropical polynomials could produce more zero-patterns than real polynomials? Or why they "shouldn't"? Or some related references.

Or, perhaps, the proof of Babai, Ronyai, and Ganapathy (below) can be somehow "twisted" to work over tropical semirings? Or over any other infinite semirings (which are not fields)?

Proof of Theorem 1: Assume that a sequence $(f_1,\ldots,f_m)$ has $p$ different zero-patterns, and let $v_1,\ldots,v_p\in R^n$ be witnesses to these zero-patterns. Let $S_i=\{k\colon f_k(v_i)\neq 0\}$ be a zero-pattern witnessed by the $i$ -th vector $v_i$ , and consider the polynomials $g_i:=\prod_{k\in S_i}f_k$ . We claim that these polynomials are linearly independent over our field. This claim completes the proof of the theorem since each $g_i$ has degree at most $D:=md$ , and the dimension of the space of polynomials of degree at most $D$ is $\binom{n+D}{D}$ . To prove the claim, it is enough to note that $g_i(v_j)\neq 0$ if and only if $S_i\subseteq S_j$ . Suppose contrariwise that a nontrivial linear relation $\lambda_1 g_i(x)+\cdots+\lambda_p g_p(x)=0$ exists. Let $j$ be a subscript such that $|S_j|$ is minimal among the $S_i$ with $\lambda_i\neq 0$ . Substitute $v_j$ in the relation. While $\lambda_jg_j(v_j)\neq 0$ , we have $\lambda_ig_i(v_j)=0$ for all $i\neq j$ , a contradiction. $\Box$

— Stasys
fonte

I've realized that the answer to my question is - yes: the VC dimension of degree $\leq d$ polynomials on $n$ variables over any tropical semiring is at most a constant times $n^2\log(n+d)$ . This can be shown using Theorem 1 above. See here for details. So, BPP $\subseteq$ P/poly holds also for tropical circuits and, hence, also for "pure" dynamic programming algorithms.

N.B. (added 25.06.2019) In the mean time, I've resolved the problem completely in this paper. In such a generality, which I haven't even dreamed at the beginning. Tropical case is here just a very, very special case. And even more curiously: by just an appropriate combination of already know (deep in any respect) results of other authors.

What remains else to do in this (BPP vs. P/poly) direction? Besides the decrease of the size of resulting deterministic circuits (an interesting question in itself).

— Stasys
fonte