Stream Inference

Background

The hypothesis here is that intelligent systems use Predictive Modeling for agent behavior in a "virtual" environment and then the actions are sent to the effectors to test the behavior. Models increase in granularity and predictive ability the longer they gain observations and more simulations they do. The internal modeling and simulation system however appears to be one of the most interesting aspects here in that these stochastic simulation generators are constantly running and providing policy outputs.

This approach makes the claim, that some minimum number of elements [sensor, modeler, effector] of an intelligent system [human/octopus/tree/etc...] is required to generate the functions or capabilites observed that would be considered "intelligent", and rejects perspectives that suggest decomposition [brain in a vat] could be considered intelligent. Further, this approach does not intend to model biological systems, rather it intends to build systems which have equivalent objective performance irrespective of the means for which they are performed.

This system, if effective will not answer questions of what are: "consciousness", "soul", "ethics" but would likely inform concepts like that by demonstrating material origins.

The biological inspiration here is the thought experiment where: If you sliced the spinal cord, or optic nerve or any nerve within the PNS or CNS in half and just measured the voltage/information what is coming out, it would be a continuous stream with a laregely uniform process throughout - though certainly with significant variation in density/amount etc... This leads to my general assumption that the system should be uniform in mathematical composition, not a hybrid approach (despite SOTA results of hybrid systems) such as the MCTS + DL in AlphaGo

The goal of such a system would be able to be predictive about future state of any input streams, within the set of data stream constraints. Eg. The predictions would be only relevant for predictions about the same streams that are coming into the system. The prediction should be generalizable in the sense that modeling and prediction can take as input anything that can be serializable. Unknown whether the prediction of the stream is transferable to variable inputs after inputs are trained, however there is a bootstrapping function that could likely take advantage of previous stream characteristics. In theory as more streams are input the better any individual stream prediction would be - but this has not yet been verified.

Attributes of an idealized stream inference system:

Provide synthetic events of some count {n} beyond last observed event {x₀}, with likelihood estimates {l_n}, for any variable in multiviariate stream
A "model" {m} is the combined synthetic event prediction likelihood weights that can be transported, translated and compared with similar input types with some variability of input (language)
There is a integrated testing process which includes effectors
Can be scaled to trillions of streams
Event streams can be any voltage generating signal

Example of a streaming predictions:

Streaming input > Model Prediction
Sensors, primarily light sensors stream a 3D representation of this set of blocks falling to the rest of the system. The agent may or may not be paying full attention to this, but if the stream is coming into the receptor, there will likely be attention paid to it over other static activities in the environment. Depending on the age of the agent, the model of the system will predict when the blocks will hit the table/ground and also predict (anticipate) that there will be a sound and roughly what that sound will sound like.
Despite not being able to communicate the precise estimate/prediction of when the blocks would hit the table**, an agent coulc have expected give a rough estimate of where the blocks may fall, proven by interceding with some effector that is fast enough "catch" a falling block by using the future physical state estimate. The agent has some concept of what the next state will be.

Hypotheses about emergent attributes

Predictions that are further in the future {t_n...} have longer processing strings but will have lower likelihoods
Likelihood estimates will be trend inversely correlated with time {t}
Maximum likelihood estimates* {l=1} are largely ignored in larger time value predictions
Event strings have an upper bound length before they cannot be processed
Most inputs are dropped out for larger time value predictions
Sensor dropoff is something like logarithmic
Some combination of inputs activate certain "stored" or "off the shelf" like responses of effectors
There is a limit to number of events that can be processed and predictions made

MISC [*]

The Lateral Geniculate Body and the Medulla likely serve as the first major processing and routing systems that interpret the raw sensor data from the PNS. My assumption is that the peripheral nervous system compresses, smooths or pre-processes peripheral sensor/nerve data into the brain. Conceptual distinctions between the brain, spinal column and peripheral nervous system are in my view, artificial in the context of defining necessary constituent parts for a holistic functioning of an intelligent agent. Medically these distinctions are likely extremely instructive for diagnostics, troubleshooting and medicine.
Combination of sensor input strings that comprise a model, with a likelihood = 1, such as those which would be considered eg. "gravity" would be largely ignored.
**Prediction and precision increases over time and scale should be considered as "progress" with respect to engineering practices - However there should be no expectation that human precision for predictions for example in anything not using a tool, should have gone up, and in some cases that tool may be mathematical - so that a learned person would make a more accurate prediction than someone who has not studied things.
IEEE TRANSACTIONS ON ROBOTICS: TossingBot: Learning to Throw Arbitrary Objects with Residual Physics Does not actually learn physics as part of a latent eg. policy gradient, inherits physics from a physics engine (nothing wrong with that) however it's not inductively bootstrapping a model of action.
Event Forecasting with Pattern Markov Chains paper, and Related Github.
Complex Event Recognition Group
SPLUNK predictive analytics
WS02 Forecasting
Making sense of sensory input

v0.1 - Updated 1/20/2021
2024 revist: No further revisions coming - this direction was integrated into my operational learning architecture that is not public
Copyright (c) 2020 Andrew Kemendo