The Collatz conjecture is a mathematical problem that has eluded resolution. While its rules appear elementary, they manifest intricate behaviors. This phenomenon has been the subject of extensive research by mathematicians for many years.
The present visualization is an examination of the concept through the lens of a novel algebraic framework, named "The Tuple-based Transform".
The amphora metaphor is an apt analogy: the container is compared to ancient Greek vessels that were used to store valuable items. Conversely, this mathematical container encapsulates the rules that govern the Collatz behavior.
The Collatz Amphora visualization provides an intuitive representation of our fundamental algebraic results: cycle uniqueness, parameter repetition necessity and guaranteed convergence. While previous approaches relied on statistical methods or partial bounds, our framework demonstrates these properties through direct algebraic proof.
The 3D visualization simply makes these abstract structural constraints tangible and comprehensible.
This proof of concept contains a concise synopsis of the ideas outlined in our article, which will incorporate the forthcoming corrections and enhancements in the subsequent version.
The Collatz conjecture, alternatively referred to as the \(3n+1\) problem, stipulates that for any positive integer a greater than \(0\), a will eventually reach \(1\) when applying the function \(f(n)\).
For instance, upon considering the sequence of numbers \(\textbf{n=9}\), \(\textbf{n=28}\), \(\textbf{n=37}\), \(\textbf{n=86}\) and one additional iteration on the trivial cycle, the following sequences are obtained:
The tuple-based transform is a computational technique that facilitates the conversion of Collatz sequences into structured tuples, thereby enabling a more profound mathematical analysis. This transformation establishes a complete isomorphy between the original dynamic problem and a static algebraic structure.
The following properties are fundamental to this representation:
The numbers \(\textbf{n=9}\), \(\textbf{n=28}\), \(\textbf{n=37}\) and \(\textbf{n=86}\) are represented using the simplified Tuple-based Transform as:
This algebraic framework transforms the conjecture from a dynamical systems problem into a static structural analysis, where convergence properties become algebraic invariants that can be systematically studied.
The Tuple-based Transform demonstrates the uniqueness of the trivial cycle, represented by the sequence of numbers \(\{4, 2, 1\}\), due to the fact that it is the sole instance in which consecutive multiplicity values, denoted by \(m\), can be identical.
The algebraic proof of the unique multiplicity property is as follows:
The valid consecutive values \(m_{i} = 0 = m_{i+1}\) can be reversed in the following manner:
The tuple-based transform establishes a complete proof of the Collatz conjecture through two fundamental results.
For \(m\) to remain bounded away from zero, the proportion of odd numbers must be at least \(0.3869\), but structural constraints limit this proportion to at most \(0.5\).
The aforementioned competing dynamic forces result in the arbitrary approach of zero on repeated occasions. According to the Pigeonhole Principle, repetition becomes inevitable. The following is the sequence of implications that are logically derived from the initial proposition:
In the Tuple-based Transform, the initial repetition of nonconsecutive values, denoted by \(m_r\) and depicting pseudocycles, and the subsequent repetitions on the trivial cycle are highlighted.
A remarkable discovery about the Tuple-based Transform is that, despite the infinite variety of starting values, all Collatz sequences belong to exactly 42 distinct structural classes, which are determined by their first repeated parameter value, \(m_r\).
These \(m_r\) classes represent the complete, finite set
As the Tuple-based Transform can be bidirectionally reversed, these 42 \(m_r\) values can be transformed into 42 pairs aligned with the Collatz sequences (note that the native Collatz sequences do not repeat values).
This fundamental invariance property demonstrates that all sequences with the same value of \(m_r\) exhibit equivalent distances between the first and second occurrences of that parameter.
The constant distance, \(D(m_r)\), remains unaltered irrespective of the initial value, \(n\), thereby delineating deterministic pathways within a dynamic system that appears to be chaotic.
In the interest of the preceding example, it is imperative to note that the sequence of numbers \(\textbf{n=9}\), \(\textbf{n=28}\), \(\textbf{n=37}\) and \(\textbf{n=86}\) belong to the class \(mn_r = 6\) of the Tuple-based Transformation, characterized by a static distance.
In the Collatz plane, the \(m_r=6\) class is represented by \([14,13]\):
a total of \(9\) elements (\(8\) steps) were to be saved, given the fact that the distance between repetitions is constant. As will be shown, it is possible to circumvent the necessity for further computational processes.
In addition to the structural classification provided by these 42 \(m_r\) classes, sequences manifest a natural geometric taxonomy based on the location of their maximum value in relation to parameter repetitions. This process gives rise to three exhaustive types:
The distribution of the data reveals a significant degree of asymmetry. Type A comprises an infinite set representing the vast majority of sequences, while Types B and C form finite exceptional sets of exactly \(1156\) and \(73\) sequences, respectively. The maximum initial values for these sets are \(n=29148\) for Type B and \(n=9088\) for Type C.
This geometric classification is mathematically distinct from structural classes, resulting in a two-dimensional framework where each sequence is uniquely characterized by the pair \((m_r, \text{Type})\).
The presence of precisely 42 values of \(m_r\) is indicative of a fundamental self-limiting mechanism inherent in the dynamics of the Collatz sequence. This finite bound is not arbitrary; rather, it arises from the mathematical impossibility of generating new repeated parameter values beyond a certain threshold. As the initial value, denoted by \(n\), increases, two competing dynamical processes establish a natural boundary:
Therefore, for sufficiently large \(n\), the system attains a critical threshold at which the maximum value \(M(n)\) occurs so early in the trajectory that it prevents the formation of new parameter repetitions. This phenomenon gives rise to an asymptotic separation, which in turn brings about a fundamental transformation in the taxonomic distribution. Type B sequences transition to Type A, characterized by a maximum number of moves prior to the initial repetition \(m_r\).
Concurrently, Type C sequences also adopt an Type A configuration, a transition that occurs once the maximum number of repetitions has been reached. Consequently, while the sets of Type B and C remain finite (with the \(1156\) sequences and the \(73\) sequences, respectively), Type A grows to encompass an infinite set. All sufficiently large starting values inevitably belong to Type A.
Each of the 42 structural classes is defined by a unique transformation sequence that maps the parameter value back to itself: \(\omega(m_r) = m_r\).
These sequences consist of \(T_1\) operations (\(m \mapsto 3m + 1\)) representing odd Collatz steps and \(T_2\) operations (\(m \mapsto \lfloor m/2 \rfloor\)) representing even steps. The order matters critically, since sequences with identical counts of \(T_1\) and \(T_2\) operations but different arrangements produce entirely different self-return values.
Continuing with the numbers \(\textbf{n=9}\), \(\textbf{n=28}\), \(\textbf{n=37}\) and \(\textbf{n=86}\), and using the \(p\) subsequence \([2], 1, 2, 1, 2, 1, 2, 2, [1]\):
It is evident that this scenario must inherently symmetrical in the context of Collatz sequences. Consequently, we need to reverse the values to this environment.
The notion of wormholes emerges naturally from the structural properties of the 42 classes of pseudocycles and their associated 42-dimensional manifolds. It has been established that each pseudocycle delineates a computational shortcut.
The initiation of a wormhole sequence is marked by the arrival of a sequence at an entry point. This precomputed trajectory leads directly to the trivial cycle, denoted by the sequence numbering \(\{4, 2, 1\}\). In a similar manner, the Tuple-based Transform demonstrates the existence of wormholes from the initial occurrence of any repeated parameter \(m_r\), directly to \(m = 0\), thereby delineating the algebraic pathway to convergence that is devoid of intermediate computations.
Given that these wormhole paths are precalculated and stored in both domains, it is possible to determine the remaining steps from entry point to convergence instantaneously. The aforementioned phenomenon engenders "wormhole travel" through the Collatz space. The sequences in question transition into known tunnels, which effectively guide them to final convergence. It is noteworthy that the entirety of the journey is predetermined algebraically, as opposed to being iteratively computed.
The framework represents a paradigm shift in the field of stopping time analysis by introducing a fundamental computational dichotomy. It has been determined that approximately 65% of the initial values of \(n\) are associated with the predictable cases, as these cases include pseudocycles within their respective pathways. The remaining 35% of cases represent the unpredictable instances in which parameter repetition occurs exclusively on the trivial cycle.
The computational savings exhibited significant variation: sequences that employed complete trajectory computation exhibited negligible savings, given the necessity of computing the entire trajectory. Conversely, sequences that initiated with a value equal to an entry point value attained substantial savings of 100%, a result of the immediate prediction capability. The precise reduction in steps depends on the rate at which the wormhole entry point is identified in the sequence trajectory.
Previous approaches to Collatz can be categorized into distinct paradigms, each of which captures only a partial aspect of the problem. Tao's probabilistic model demonstrates that nearly all orbits attain almost bounded values through a sophisticated statistical analysis. However, the model is unable to identify specific exceptional cases. The Krasikov-Lagarias difference inequalities are a set of mathematical inequalities that provide analytical limits, thereby constraining trajectory behavior and establishing growth bounds and density estimates.
Michel's geometric approach is indicative of patterns in residue classes modulo powers of 2, thereby providing insights into the local trajectory structure. The Simons-de Weger cycle analysis is a method of analyzing systems that eliminates potential non-trivial cycles through the implementation of length and value constraints. This process serves to reduce the overall space of possible counterexamples.
The Amphora offers a comprehensive structural visualization of the partial information captured by each approach, thereby unifying these perspectives. In instances where Tao exhibit statistical convergence, the Amphora unveils the finite wormhole network that ensures it. In instances where Krasikov-Lagarias demonstrate the existence of bounds, the Amphora elucidates the topological constraints that give rise to these bounds. In instances where Michel focuses on the identification of local patterns, the Amphora highlights the global structure that underlies these local patterns, thereby demonstrating a comprehensive and interconnected framework.
Rather than competing with these approaches, the Amphora complements them by transforming abstract mathematical constraints into intuitive structural understanding, making the fundamental convergence mechanism visible and comprehensible.
In consideration of the significant interest that \(n=27\) has elicited, we append a few values from its class \(m_r=60\) for the purpose of evaluation.
Should further examples be required for processing, they can be found here. The data set includes all values of \(n\) less than \(2^20\), meticulously classified into designated \(m_r\) classes and A, B and C taxonomies.
The Collatz Amphora is an interactive 3D visualization of the famous Collatz conjecture, presenting it through the lens of the Tuple-based Transform.
Built with Three.js for 3D graphics, featuring:
For additional resources regarding the Collatz conjecture and the Tuple-based Transform, please refer to this repository.
Thanks to Marta for her unconditional love of Classical Antiquity and for her brilliant idea naming "The Collatz Amphora".
Javier Hernandez
For any question or suggestion, please drop me an email at 271314@pm.me
1.0 (August 2025)
CC-BY-NC-SA 4 License - see LICENSE file for details.