Robert Max Williams

Preprint of Optical Illusions Paper

2018-10-01T00:00:00+00:00

With the help of Dr. Yampolskiy, I put the information in the series on optical illusions together into an academic paper, which can be found here: https://arxiv.org/abs/1810.00415

The main goal of creating the dataset is more or less complete, but as it stands it is not super useful and could use some reorganization.

I sent the paper to Luke Miles to see what he thought (after submitting to Arxiv, unfortunately) and he gave the following feedback:

Unrequested feedback on Max’s paper:

That was actually a good read
The 1 footnote on Max’s name is never resolved
First couple sentences of abstract are not relevant
Backwards quotes?
Italic author names?
Is confusion matrix test or train?
Column labels on confusion matrix?
For consistency, section 4.2 should be called “Generative Adversarial Network Results”
In section 5, clarify that the vision system of interest is the eyes of predators. Even if butterflies couldn’t see, they would evolve fake eyes to fool bluejays. Feedback is via the killing of butterflies (by owls) who wear weak illusions.
Good choice to host the images on floydhub; I had like 8MB/S download speed. Maybe clarify that you can download them for free without an account, because floydhub may try to get them to sign up first (like on dropbox)

I didn’t realize latex required specifying quote direction, so all of my quotes are backwards! How embarrassing. I won’t re-upload for cosmetic changes because that’s against Arxiv’s archiving philosophy, but when I do make another version I will incorporate these changes.

Data Download and Initial Neural Network Training Results

2018-04-10T00:00:00+00:00

I downloaded and cleaned all of the files from Mighty Optical Illusions and ViperLib into JPEG (.jpg) images. They are available for download from https://www.floydhub.com/robertmax/datasets/illusions-jpg and the source files and build process can be found on https://github.com/robertmaxwilliams/optical-illusion-dataset.

A greatly reduced version of around 500 images can be found in this floydhub dataset. Training a GAN on these might yield better results than the last attempt with the full dataset (see below).

Classifier

I trained the “bottleneck” of an image classification model, taken from the tensorflow models repo, on the moillusion images. I used a smaller subset of the categories, categories that had at least 20 images and seemed relevant to the goals of the project.

It performed much better than random, but not very well. I doubt it learned anything meaningful to the data, only texture and context clues that roughly correlate with the assigned classes. As well, the data is multi-class but I treated it as single class by including an images in all of its given classes. This means that on a multiclass image, it was guessing the class. I didn’t account for this in the results, which I really should have.

I need to repeat this study with a proper multiclass model. I want to make it from scratch in Keras or Tensorflow, but will need more computing power to train the full image convolution layers instead of just the dense bottleneck layers which I trained on my laptop.

GAN Failure

This is the second time I’ve attempted to train a GAN. The first was on kickstarter images, and led to some strange shapes and textures, but nothing sellable. This time I used the full dataset linked above, and got similar results. It learns… something. It’s hard to say what exactly. One thing I realized in hindsight is that later in training, every image in the group looks exactly the same. A lack of variety within a batch indicates some sort of mistake on my part setting the model parameters, not just with the data.

training progression of the hyperGAN model on the viperlib images_

I only attempted one full training run, and let it go overnight until I used up all of my floydhub credits. Dr. Yampolskiy suggested that I could use the university’s computing resources, which would make it practical to experiment more with these computationally expensive models. With more experimentation and a more narrow dataset, I might actually get a model to generate images that create an illusion, instead of the illusion-of-being-colorful-blobs that GAN models are so eager to produce.

What’s Next

I’m working over the summer on a machine learning project (details soon) and should be able to apply my new skills to this project. So far I’ve spent most of my time working with the data, and very little tweaking the actual models. Now that I have a nice clean dataset and two baseline projects, I can improve upon it from here. The data work isn’t totally done; the disparity between viperlib and moillusions is a difficult problem that needs to be resolved thoughfully. A full human labeling of the dataset would be nice, but I would need to recruit help to make high quality labeling feasible, and make software to streamline the process.

Long Term Universe Outcomes

2018-02-14T00:00:00+00:00

The Fermi Paradox is a huge problem for humanity. The simple question, “Where are they?” has no easy answer. Are there really no aliens out there colonizing the universe? Maybe life is unlikely, or maybe no life ever gets off its home planet. Or perhaps alien life is crowding the galaxy but we are too small minded to see it or are intentionally blocked off by a higher order. Assuming the observable universe is as cold and dead as it looks, humans have a few options going forward:

Team Human : Humans make it to control the universe while remaining recognizable and fill the universe with corn and war and video games.

Everyone Dies : The Unsiverse is dead again, Earth is still likely to breed another smart monkey in a few billion years, if no one else gets to it. Go back 4 spaces.

Nanotech : Tiny robots eat everything! They might colonize the universe and even evolve into interesting and complex forms of life 2.0, but are completely alien so we’re not happy about that. Do not pass go, do not collect 200 dollars.

Paperclip maximizer : AI pursuing a goal eats the universe and maximizes its reward. The fallout might have interesting dynamics that harbor life 2.0 of some sort, but the long term outcome is that the universe is converted to things of little human value.

Human maximizer : An AI fills the universe with hedonium, not much better than paperclips.

VR : We live in virtual reality. Humanity either stay confined to the solar system or fill the universe, effectively hedonium.

Humans 2.0 : We augment ourselves with machines, maybe even become a hivemind. We remain human and fill the universe with beauty and science while remaining in control.

Invent God : Friendly AI helps us make the universe into our happy but challenging garden and everyone lives happily ever after as their ideal self, but the universe is controlled by non-human entity.

This list is incomplete, and I am no authority in determining humanities long term future. However, I do think the stakes are very high, and the long term outcome for the observable universe depends on our actions as a species. Notice that only a few of these outcomes result in humans leaving Earth, and it is up to us to prove ourselves worthy of the prize that is the universe for all of time. Tech arms races point us towards the less attrctive items on this list, so let’s try to keep things under control with this “AI arms race” that we seems to be on the verge of.

Thanks to Luke Miles for the discussion and idea to make this list into a blog post.

Fun With (Fun With Quines in Python) Quines In Python

2018-02-09T00:00:00+00:00

I’ve seen quines before, as well as quines that aren’t really quines, like this dumb bash trick:

echo $BASH_COMMAND

However, I couldn’t really get my head around how to write one. I decided to give it a shot using python and see what I could do. I struggled for a bit and went down several tunnels of infinite regression of quotes and escape characters.

I gave in and searched for “python quines” and peeked at the first page for a hint. I noticed that all of the quines used an interediate variable, so I went back to work with that one piece of information. A few more attempts at inifinitely nested quotations later, I came up with this monster:

q = "\""
n = "\n"
s = "\\"

m1 = 'q = '+q+s+q+q+n+'n = '+q+s+'n'+q+n+'s = '+q+s+s+q
m2 = "'q = '+q+s+q+q+n+'n = '+q+s+'n'+q+n+'s = '+q+s+s+q"
foo = "foo = ... print(m1 + n + n + 'm1 = ' + m2 + n + 'm2 = ' + q + m2 + q + n + foo[:6] + q + foo + q + n + foo[10:])"
print(m1 + n + n + 'm1 = ' + m2 + n + 'm2 = ' + q + m2 + q + n + foo[:6] + q + foo + q + n + foo[10:])

Credit to Luke Miles for help spotting that 10 which was an 11 and for motivation.

Which I quickly brought down to two lines when I realized what chr() does in python:

foo = "foo = ... print(foo[:6] + chr(34) + foo + chr(34) + chr(10) + foo[10:])"
print(foo[:6] + chr(34) + foo + chr(34) + chr(10) + foo[10:])

And the need to run it easily from plaintext motivated a pseudo-oneliner that only uses single quotes:

foo = 'foo = ... print(foo[:6] + chr(39) + foo + chr(39) + chr(59) + foo[10:])';print(foo[:6] + chr(39) + foo + chr(39) + chr(59) + foo[10:])

Which leads finally to the single line that can be run straight from the command line as my times as you want:

python -c "foo = 'python -c foo = ... print(foo[:10] + chr(34) + foo[10:16] + chr(39) + foo + chr(39) + chr(59) + foo[20:] + chr(34))';print(foo[:10] + chr(34) + foo[10:16] + chr(39) + foo + chr(39) + chr(59) + foo[20:] + chr(34))"

So what is going on and why are these possible? Are they possible in every programming language that is turing complete and has text output? The answer is yes, and I have not idea why. According the wikipedia it has something to do with a Mr. Kleene and his Recursion Theorom, where a quine would be considered a fixed point of the function that is the interpreter. I’ll get back with you when I understand how it can be proved that fixed points always exist.

Until then, here is a quine that also includes my favorite python error message:

from sys import setrecursionlimit, stdout

def foo(x):
    setrecursionlimit(x)
    foo(x+1000)

bar = """from sys import setrecursionlimit, stdout

def foo(x):
    setrecursionlimit(x)
    foo(x+1000)

bar = ...
print(bar[:97]+chr(34)*3+bar+chr(34)*3+chr(10)*2+bar[101:]+chr(10)+chr(35), end=str())
stdout.flush()
foo(42)"""

print(bar[:97]+chr(34)*3+bar+chr(34)*3+chr(10)*2+bar[101:]+chr(10)+chr(35), end=str())
stdout.flush()
foo(42)
#Segmentation fault (core dumped)

The program appears on screen easily, but reading the output into a file is more difficult. Using bash redirects and some other wizard stuff I made this command to save the output to a file:

{ python3 quinefault.py > quinefault2.py; } 2>> quinefault2.py

I Don’t Understand Fixed Points

Do hash algorithms have fixed points? Encryption? What about f(x) = x+1, it seems like many computable functions don’t have an x where f(x) = x. Maybe I am misunderstanding what is meant by computable function.

Taking the Whole Thing Too Far

Using a compiler/interpreter/etc as a function and finding a fixed point in it is neat and all, but what about neural networks? Can a neural net output its own weights? Most neural nets used today are capable of approximating universal computation, so shouldn’t they be able to take in a zero vector and output the values of their weights? Using a plain feedforward network would be impossible because the output layer would make the net too large and the whole thing can’t fit inside itself. A recurrent network or some sort of external attention mechanism would be needed. I think such a system would count as a quine. Even though it is supported by a great deal of external equipment, that shouldn’t disqualify it. A python quine doesn’t need to output python’s source, and a neural quine doesn’t need to output its own textual source code. Not saying that it couldn’t…

Data Collection Results

2018-02-04T00:00:00+00:00

None of the website owners replied to my emails, so I collected the images myself. All of the content on these sites has been collected from other sources, so I don’t think there is any issue with copyright. I have collected all of the image URLs and some metadata for every illusion images on both websites, and some percent of non-illusion images.

Mighty Optical Illusions

To obtain images, I started at this page https://www.moillusions.com/one-two-face-illusion/. I used python’s Beuatiful Soup library to copy all image links and then follow the previous button to the next page, and repeated until I was out of next pages. This leads to a grand total of 6436 images. It’s hard to know how many are duds, but that is a good start.

ViperLib

Here I changed the page number in the url, and all of the relevant areas on the site. I was able to get 1454 images. The descriptions might end up being useful weeding out some of the illusions that require video.

On closer inspection, there are only a hundred or so good illusions here. I do not plan to download them.

I think between these two sites I have about as many illusion images as I’ll get. I am waiting to hear back from http://illusionoftheyear.com/ to see if they would share their images for the good of science, but I have not heard back from them. The JSON files and scraping code will be available here: https://github.com/robertmaxwilliams/optical-illusion-dataset. The next step is to combine the JSON files and actually download all of the images and find a place to host them.

Next Steps

I was able to obtain 7890 images from my top two websites, and maybe a few more thousand if IllusionsOfTheYear wants to help. I plan to learn to use a Variational Autoencoder and apply it to this data. It should be able to come up with a good latent space for these images, and then I can cluster them and see what sorts of clusters appear. At this point I can try training a GAN on specific classes, perhaps certain illusions are easy for a GAN to reproduce than others. I expect that texture based geometric illusions will be obtainable, but anything with large scale structure will not be possible without a major revolution in AI or a few orders of magnitude more images.w

Optical Illusion Datasets

2018-01-15T00:00:00+00:00

I have been digging around for optical illusion datasets. They don’t seem to exist, so I will be laying the grounds for the creation of one and starting to collect data. I don’t know all that much about copyright, so if anything I describe sounds legally dubious I would really like to know.

Things that need to be done:

Find enough data to make a good dataset
Clean it up, convert everything to the same format (png or jpeg?) and naming convention and organize metadata somehow
Create different resolution versions and cropped version for easier usage
Make it into an easy to use archive and host in on UofL server or github or something

Find Enough Data for a Good Dataset

http://www.michaelbach.de/ot/

★★★★☆

This website is really amazing, but many of his illusions require video to work. Current ML research on these moving pictures is limited, including videos would likely be a waste of time for the current state of the art. If we exclude videos, this website only has a few dozen images, possible worth collecting if we need it.

https://www.moillusions.com/

★★★★★

The bottom bar shows 414 pages, and each page has 8 posts, which appear to be a short article and a high quality illusion image each. Not only does this mean we have 3312 images, we also have textual metadata and categories! This is really something. Probably not enough to train a GAN but maybe some revolution will bring capsule nets to general image generation. Besides, not having enough data is no reason not to try anyways and see what sorts of horrible blobs it generates. If this is the end-all beat-all source of data, it should be enough to train a classifier and then we can inspect it to see what it believes makes illusions what they are and perhaps find some new insights into human vision.

http://www-bcs.mit.edu/gaz/publications/gazzan.dir/gazzan.htm

★★★☆☆

This article explains various illusions where the same color looks different in different contexts

http://www.cfar.umd.edu/~fer/optical/smoothing4.html

★★☆☆☆

This page explains the wavy grid illusion. Not directly useful but certainly interesting.

http://www.cvrl.org/

★☆☆☆☆

Here is some sort of scientific database, and a few low-power illusion images.

http://www.handprint.com/LS/CVS/color.html

★☆☆☆☆

Not super relevant but a degree’s worth of information about color and human perception. Maybe the authors would be of assistance.

http://illusionoftheyear.com/

★★★★★

This contest collects and shares a large number of images, 20 this year and they’ve been at it for 12 years. 240 Images is pretty significant, but that’s just the top rated. The almost certainly have a huge stash of submissions, now just to get in conact with them and see if they are willing to share.

http://www.cogsci.uci.edu/~ddhoff/illusions.html

★★★☆☆

This is 30ish images, at least half of which are static.

http://mathworld.wolfram.com/topics/Illusions.html

★★★☆☆

These are probably worth downloading.

http://www.sandlotscience.com/

★★☆☆☆

These folks are selling a book that claims to have “200 optical illusions.” I doubt they would be willing to just hand them all over, but it’s worth a shot. It’s not like having their images in a zip file out there for deep learning is going to hurt their sales.

http://viperlib.york.ac.uk/

★★★★★

Oh shoot, they have 1,860 images on file. I’ll have to look through and see what is usable for this research, but that is more than all of the other sources combined

https://www.reddit.com/r/opticalillusions/top/?sort=top&t=all

★★★★☆

I’ve scraped reddit before for my meme-deepfryer based on cyclegan, so this should be a pretty easy way to collect many high quality images. It goes on for at least a few hundred posts, so it is worth it. Quality is lower and there are many photographic images, which is different from other datasets.

It seems that we will be able to get at most 1,000 to 2,000 images, which is alright. I don’t think GANs will be able to do anything useful, but if I knew for sure I wouldn’t be doing this. This dataset should be extremely valuable and development will continue full throttle.

Notes on Image Content

Many images have logos or text. This might confuse the GAN, like how cats from the GAN paper had white impact font with illegible but English looking text over them due to the large amount of impact font English text in the dataset. Thankfully, the images don’t seem to have watermarks, which would not only confuse the network but would make the illusions less powerful. Most images are non-photographic, and maybe 10% of the reddit images are photographs. A great deal of sorting by hand could catagorize them, but I think using a deep learning technique would be sufficient. But that is for the next post about what to do with all this data.

Adversarial Examples for human vision

2018-01-09T00:00:00+00:00

Adversarial examples are very revealing about a neural net’s inner workings and weaknesses. This wonderful post by open AI discusses the security implications of adversarial examples, and this arxiv paper demonstrates extremely robust “adversarial patches” that can work on new networks that were not used in design. With adversarial example generation reaching this level of complexity, it raises the question of how immune the human vision system is to similar attacks, and what we can learn from attempting to generate adversarial examples for human vision.

Optical Illusions as Adversarial Examples

Optical illusions are patterns that when observed by the human eye, create false impressions of non-existent stimulus. Examples of especially powerful illusions are Skye’s Oblique Grating which causes straight line to appear parallel and the Scintillating Grid which causes black dots to appear anywhere you are not looking at directly. This can be seen as a related phenomenon to a misclassification by neural networks when observing an adversarial example. These patterns are painstakingly created by human artists, and developing a new kind of pattern (as opposed to a new instance of a known pattern) requires incredible skill and luck, especially given the large amount of existing patterns.

Disruptive coloration is another kind of optical illusion, but created by nature through evolution. Illusions of this type are more organic and generalize to nearly anything with a vision system, perhaps even to machine learning based systems. They are created through evolution with incredible amounts of trial and error on extremely complex environments and agents, on a scale not reproducible in simulation.

Generative Model for Human Adversarial Examples

Recent work on generative adversarial networks (GANs) has shown that high resolution images of faces can be created using a large dataset of 30,000 images. This size and quality of images is not available for optical illusions; naively applying their methods would likely yield a model that is extremely overfit or generates nothing of value. Any attempt at pre-training on general images would also be fruitless, as optical illusions are usually non-photographic and exist outside the space of common visual stimulus. The number of static optical illusion images is likely in the low thousands, and the number of unique kinds of illusions is certainly very low, perhaps even less than one hundred. Creating a model capable of learning from such a small and limited dataset would represent a huge leap in generative models and understanding of human vision

Human in the Loop

Both artistic designers of illusion images and the glacial process of evolution have access to active vision systems to verify their work against. An illusion artist can make an attempt at creating an illusion, observe its effect on their eyes, and add or remove elements to try to create a more powerful illusion. In an evolutionary process, every agent has a physical appearance and a vision system, allowing for patterns to be verified in their environment constantly. A GAN trained on existing illusions would have none of these advantages, and would be just as likely to create non-illusions as it is to create a novel class of images.

To improve the model beyond the existing data, its outputs can be classified by hand and fed back into the network. I am not sure if this form of dataset expansion has been used with generative models before.

Dataset

In coming blog posts, I will search out sources of illusion images and make further considerations for how to approach this problem.

Robert Max Williams

Preprint of Optical Illusions Paper

Data Download and Initial Neural Network Training Results

Classifier

GAN Failure

What’s Next

Long Term Universe Outcomes

Fun With (Fun With Quines in Python) Quines In Python

I Don’t Understand Fixed Points

Taking the Whole Thing Too Far

Data Collection Results

Reddit

Next Steps

Optical Illusion Datasets

Find Enough Data for a Good Dataset

Notes on Image Content

Adversarial Examples for human vision

Optical Illusions as Adversarial Examples

Generative Model for Human Adversarial Examples

Human in the Loop

Dataset