Pete Shirley's Graphics Blog: 2014

Wednesday, December 17, 2014

Winner of U of Utah Ray Tracing class image contest

Yesterday I attended Cem Yuksel's end of semester image contest from his ray tracing class. The images were all very impressive (it's clearly a good class... see the link above) and the winner (by Laura Lediaev) I found so impressive I asked her if I could post it here. Here's her description:

This is a fun scene with candy. There are two main components to this scene - the glass teapot, and the candies. I spent over 30 hours creating this teapot practically from scratch. I started with the Bezier patch description, which I used to create a mesh, and went to work duplicating surfaces, shrinking them to create the inner surfaces, doing some boolean work for cutting out holes, then fusing together all the seams vertex by vertex. The candies started out as a single candy prototype which I sculpted starting from a cube. I then created a huge array of candy copies and used a dynamics simulation to drop the candies into one teapot, and onto the ground in front of the other teapot. The backdrop is just a ground with a single wall (a.k.a. an infinite plane). I have two area lights, and an environment image which is creating the beige color of the ground and some interesting reflections. Can you spot the reflection of a tree in the left teapot handle? The challenge with rendering this scene is all the fully specular paths, which are rays that connect the camera to a light while only hitting specular surfaces such as glass or mirrors. The only way to do this using the rendering methods that we learned in the class is brute force path tracing which takes an extraordinary amount of time. The image has roughly 30,000 samples per pixel.

Tuesday, December 16, 2014

Cool website using hex color

A 24bit RGB triple such as red (255,0,0) is often represented as a hex string because 16^2 = 256 and you only need 6 digits. Recall that the hex digits are (0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, F). So the color (255, 0, 0) would be (FF,0,0) or as a string FF0000. Usually people put a "#" at the front by convention to tag it as a hex. #FF0000. Alain Chesnais made me aware of a clever site that uses the fact that times are also 6 digits when seconds are included. For example 101236 is 36 seconds after 10:12. If one interprets that as a hex color, it is valid (note the maximum on 0..255 is 59 so they are all somewhat dark). There is a website that makes this more concrete so you can start internalizing hex codes. The dark ones anyway! Here's a screenshot.

As it gets closer to the minute rollover you'll get a dark blue.

Monday, December 8, 2014

Empirical confimation of diffuse ray hack

Benjamin Keinert sent a nice note in regards to an earlier post to quickly get "lambertianish" rays. He empirically confirmed it is exactly lambertian. Cool, and thanks Benjamin I always believe such demonstrations more convincing that proofs which can have subtle errors (fine, yes, I am an engineer). I include his full email with his permission.

I think I found a simple informal "engineer's style proof" that it is indeed lambertian (assuming the sample rejection method yields a uniform distribution on a sphere, which it should).
Instead of using a rejection method I picked the uniform distribution on a sphere using spherical fibonacci point sets [2] and constructed a cosine hemisphere sampling variant of it.
Without the loss of generality it should be sufficient to show that the mapping is lambertian for a single normal (0,0,1) - given uniformly distributed points on a sphere and rotational invariance.

Sorry, rapid prototyping code, oldschool OpenGL, PHI = (sqrt(5.0)*0.5 + 0.5):

// PDF: 1/(4*PI)
float3 uniformSampleSphereSF(float i, float n) {
    float phi = 2*PI*(i/PHI);
    float cosTheta = 1 - (2*i+1)/n;
    float sinTheta = sqrt(1 - cosTheta*cosTheta);
    return float3(cos(phi)*sinTheta, sin(phi)*sinTheta, cosTheta);
}

// PDF: cos(theta)/PI
float3 cosineSampleHemisphereSF(float i, float n) {
    float phi = 2*PI*(i/PHI);
    float cosTheta = sqrt(1 - (i+0.5)/n);
    float sinTheta = sqrt(1 - cosTheta*cosTheta);
    return float3(cos(phi)*sinTheta, sin(phi)*sinTheta, cosTheta);
}

[...]
void test() {
    [...]
    // Enable additive blending etc.
    [...]
    uint n = 1024;
    glBegin(GL_POINTS);
    for (uint i = 0; i < n; ++i) {
        glColor4f(0,1,0,1); // Green
        float3 p = normalize(uniformSampleSphereSF(i, n) + float3(0,0,1));
        glVertex3fv(&p[0]);

        glColor4f(1,0,0,1); // Red
        float3 q = cosineSampleHemisphereSF(i, n);
        glVertex3fv(&q[0]);
        // Additive blending => Yellow == "good"
    }
    glEnd();
}

This little function results in the attached image (orthogonal projection of cosine distributed points on a hemisphere -> uniformly distributed points on a circle).
With some more effort one can show that normalize(uniformSampleSphereSF(i, n) + float3(0,0,1)) = cosineSampleHemisphereSF(i, n) - instead of using additive blending.

[1] http://psgraphics.blogspot.de/2014/09/random-diffuse-rays.html
[2] Spherical Fibonacci Point Sets for Illumination Integrals, Marques et. al.

Complex Christmas Tree Lighting

Here is an example of why most rendererings use approximations for complex lighting configurations; as lighting gets more complex, it also gets harder to figure out if it is right!

Tuesday, December 2, 2014

More on lazy ray tracing tricks

I got some questions on yesterday's post. First, you will need a bunch of samples per pixel (like hundreds).   You can reduce that by jittering, but again, you bought that fancy computer-- watch movies while your sampling progresses.

Second, don't branch. Once you go to random, you may as well "path" trace so there is no ray tree. Instead of

color(ray) = R*color(reflected_ray) + (1-R)*color(refracted_ray)

instead do:
    if (drand48() < R)
          return color(reflected_ray)
else
return color(refracted_ray)

If you want motion blur, you can add a time to the ray ray.t = t0 + drand48*(t1-t0) and add some moving primitives or a moving camera. For a translating sphere, it would be center = p0 + time*(p1-p0)

Monday, December 1, 2014

Lazy ray tracing tricks

If you are doing some ray tracing assignments, I have some lazy tricks for making your pictures look better with very little code. Note these may be slow, but that's what overnight runs are for.

Hollow glass primitive. This is to get thin glass balls like Turner Whitted's famous image.

Make a sphere with radius R centered at C. Make another at C with radius -0.9R (or -0.95 or whatever). The sphere intersection code depends only on R^2 so the negative radius wont matter. The surface normal is:

N = hitPoint - C /radius

The negative radius will make an inward facing normal so the refraction code works out.

Soft shadows from point sources.

If you have a point source at position P, instead of sending a shadow ray toward P, send one to a random point within radius R of P.   To do this just use a rejection method by picking random points in a cube of radius R:

do {
   x = drand48();
   y = drand48();
   z = drand48();
} while (x*x + y*y + z*z > 1) // now (x,y,z) is random in unit sphere
newrandomlightpoint = P + R*vec3(x,y,z)

Fuzzy (glossy) reflections

You have a hitpoint P and a reflected ray direction V.   The "end" of the ray is Q = P + V. Now find a random point within a sphere centered at Q (see soft shadows). Now the new reflection ray is V' = Q - P. Make it a unit vector if your code requires it.

Depth of field (defocus blur)

Randomly perturb the eye point within a sphere centered at the eye point. This will work better with some ways people implement cameras than other equally good ways.

Directionally varying ambient

Instead of a constant, use Color1*(max(0,dot(N, V1)) + Color2*(max(0,dot(N, V2))
Making V1 "up" and sky (background) color and V2 "down" and ground color will look pretty good.

Directionally varying background

If you miss everything, and are too lazy for an environment map (I am!), use something that varies. For example (0.2, 0.2, 1.0) + (1.0-fabs(v.z))*(0.6,0.6,0.0).   Fool with that.

Thursday, November 27, 2014

More on tattoos

Here is a nice little graphics project: tools for optimizing illusion tattoos like these:

Here is the article that is from.

I suggest if you and some friends do this you all agree to get a tattoo using the tool at paper submission time. I guarantee that is a strong incentive for software quality!

Radiosity on an iPhone

I went to Brad Loos' PhD defense earlier this week (and it and the work presented was excellent; congratulate Brad and remind him that "Dr. Loos" sounds like a James Bond villain). One of the things Brad talked about was modular radiance transport (MRT). This was one of the most important works in rendering to my mind because it was a serious attempt to "make radiosity work" and in my opinion was the most successful one so far. The basic idea is to divide the world into boxes with some number of open interfaces. By some muscular linear algebra and clever representation they make this work very well for environments that are not exactly mazes and have clutter and details and characters. Brad mentioned in his talk that Kenny Mitchell spearheaded an MRT tech demo game that is free on the apple app store. This was done at the time of the work and it even worked on my generation 0 iPad, and here's a screen shot:

It is fluid on that old hardware, and REALLY fluid on my newer phone. Interactive radiosity really does look good (and the underlying approximations can be seen if you pay careful attention, but I would not have guessed this wasn't just a full solution except for how fast it was). Try it-- it's fun!

Here is a video of it running.

Also, there is a full report on all the tech behind it.

Wednesday, November 26, 2014

Secondary light source

One of the most useful assumptions in graphics is that there is "direct" lighting and "indirect" lighting. Mirrors always mess that up. Here is an example of a shadow cast by "indirect" light:

The shadows on the ground are from the Sun which is well above the horizon. The dimmer shadow in the doorway is from a bright "secondary" light.

The light comes from a car:

The shadow is cast from the highlight on the red car. Interestingly the iPhone camera makes all these bright hightlights about the same, whereas in real life one is MUCH brighter. Clamping FTW!

For games and historical visualizations it is probably best to just pretend those secondary specular lights are not there. However, for predictive applications they should be tracked. Dumping the whole concept of "direct" light is not irrational in such cases.

Tuesday, November 25, 2014

Photo processing tattoos

Our app extension fixes to SimplePic went live last night (we solved the app store search bug of Pic! by just changing the name. Crude, but effective. The fix to app extension was replacing some of the Swift code with Objective-C... old technology has more warts but fewer bugs). I tested it on Tina Ziemek's tattoo as that is one of the things SimplePic is designed for:

Top: original image. Bottom two output of SimplePic

The middle picture is a good example of the tradeoff in changing colors without messing with skin tones too much. Some of the hues change a little as some of that tradeoff. There is most definitely a few SIGGRAPH papers to be had in this domain, and I am not going to pursue it, but if anybody does please keep me apprised on what you find (there is a correlation between tattoos and iPhones so there's a specialized app to be had as well). The posterization I personally use when I want to exaggerate contrast (like a photo of a sign where the colors are blah) so I wouldn't probably do that there.

PS-- Many of you who know Tina, and FYI she's started a game company and has a crowd-funding campaign going. Sign up!

Monday, November 24, 2014

Strange looking reflection

This one looks to me like a bug in real life.

(I don't mean insect). It is a reflection of this lamp:

Friday, November 21, 2014

And a mistake in the original ray tracing paper

More fun trivia on the original ray tracing paper. Whitted put in attenuation with distance along the reflection rays:

This is a very understandable mistake. First, that radiance doesn't fall off as distance squared is confusing when you first hit a graphics class, and there were no graphics classes with ideal specular reflection then; Whitted was inventing it! Second, the picture with the bug I think probably looks better than without: the fading checkerboard in the specular reflection looks kind of like a brushed metal effect. So it's a good hack!

Whitted's famous ray tracing image. From wikipedia

There are two lessons to draw from this: when bugs look good think about how to make them into a technique. Second, when you make a mistake in print, don't worry about it: if somebody is pointing it out decades later you probably wrote one of the best papers of all time. And if the paper is not great, it will disappear and nobody will notice the mistake!

Thursday, November 20, 2014

Another fun tidbit from the original ray tracing paper

The paper also used bounding volume hierarchies (built by hand). It's amazing how well this paper stands up after so long.

Cloud ray tracing in 2050

My recent reread of Turner Whitted's paper from 35 years ago made me think about what kind of ray tracing one can do 35 years from now. (I am likely to barely see that).

First, how many rays can I send now? Of the fast ray tracers out there I am most familiar with the OptiX one from NVDIA running on their hardware. Ideally you go for their VCA Titan-based setup. My read of their speed numbers is that for "hard" ray sets (incoherent path sets) on real models, that on a signle VCA (8 titans) I can get over one billion rays per second (!).

Now how many pixels do I have now. The new computer I covet has about 15 million pixels. So the VCA machine and OptiX ought to give me around 100 rays per pixel per second even on that crazy hi-res screen. So at 30fps I should be able to get ray tracing with shadows and specular inter-reflections (so Whitted-style ray tracing) at one sample per pixel. And at that pixel density I bet it looks pretty good. (OptiX guys I want to see that!).

How would it be for path-traced previews with diffuse inter-reflection? 100 rays per pixel per second probably translates to around 10 samples (viewing rays) per pixel per second. That is probably pretty good for design previews, so I expect this to have impact in design now, but it's marginal which is why you might buy a server and not a laptop to design cars etc.

In 35 years what is the power we should have available? Extending Moore's law naively is dangerous due to all the quantum limits we hear about, but with Monte Carlo ray tracing there is no reason that for the design preview scenario where progressive rendering is used you couldn't use as many computers as you could afford and where network traffic wouldn't kill you.

The overall Moore's law of performance (yes that is not what the real Moore's Law is so we're being informal) is that historically performance of a single chip has doubled every 18 months or so. The number of pixels has only gone up about a factor of 20 in the last 35 years and there are limits of human resolution there but let's say they go up 10. If the performance Moore's law continues due to changes in computers, or more likely lots of processors on the cloud for things like Monte Carlo ray tracing, then we'll see about 24 doublings which is about 16 million. To check that, Whitted did about a quarter million pixels per hour, so let's call that a million rays an hour which is about 300 rays per second. Our 16 million figure would predict about 4.8 giga rays per second, which for Whitted's scenes I imagine people can easily get now. So what is that in 2050 per pixel (assuming 150 million pixels) we should have ray tracing (on the cloud? I think yes) of 10-20 million paths per second per pixel.

What does this all mean? I think it means unstratified Monte Carlo (path tracing, Metopolis, bidirectional, whatever) will be increasingly attractive. That is in fact true for any Monte Carlo algorithm in graphics or non-graphics markets. Server vendors: make good tools for distributed random number generation and I bet you will increase the server market! Researchers: increase your ray budget and see what algorithms that suggests. Government funding agencies: move some money from hardware purchase programs to server fee grants (maybe that is happening: I am not in the academic grant ecosystem at present).

Wednesday, November 19, 2014

Path tracing preview idea

In reviewing papers from the 1980s I thought again of my first paper submission in 87 or so. It was a bad idea and rightly got nixed. But I think it is now a good idea. Anyone who wants to pursue it please go for it and put me in the acknowledgements: I am working on 2D the next couple of years!

A Monte Carlo ray tracer samples a high dimensional space. As Cook pointed out in his classic paper, it makes practical sense to divide the space into 1D and 2D subspaces and parametrize them to [0,1] for conceptional simplicity. For example:

camera lens
specular reflection direction
diffuse reflection direction (could be combined with #3 but in practice don't for glossy layered)
light sampling location/direction
pixel area
time (motion blur)

Each of these can be parametrized not only to [0,1]^2, it can be uniform by looking at the random number seeds that feed into them. This yields:

(u1, u2)
(u3, u4)
(u5, u6)
(u7, u8)
(u9, u10)
u11

You can just stratify each of these 2D sets to lower noise and that can help a lot. For example, generate the first pair so (from Andrew Kensler's paper at Pixar):

All sorts of fun and productive games are played to make the multidimensional properties of the samples for sets across those dimensions, with QMC making a particular approach to that, and Cook suggesting magic squares. The approach in that bad paper was to pretend there is no correlation at between the pairs of dimensions. So use:

(u1, u2)
(u1, u2)
(u1, u2)
(u1, u2)
(u1, u2)
u1

No we have a 2D sampling problem and we can use Warnock style adaptive sampling (wikipedia with figure from the article below)

The samples can be regular or jittered-- just subdivide when the 4-connected or 8-connected neighbors don't match. Start with 100 or so samples per pixel.

Yes this approach has artifacts, but often now Monte Carlo is used for a noisy preview and the designer hits "go" for a noisy version when the setup is right. The approach above will instead give artifacts but the key question for preview (a great area to research now) is how to get the best image for the designer as the image progresses.

Note this will be best for one bounce of diffuse only but that is often what one wants, and it is another artifact to replace multi-bounce with an ambient. The question for preview is how best to tradeoff artifacts for time. Doing it unbiased just makes the artifact noise.

Let me know if anybody tries it. If you have a Monte Carlo ray tracer it's a pretty quick project.

Tuesday, November 18, 2014

Choosing research topics

A graduate student recently asked me for some advice on research and I decided to put some of the advice down here. One of the hardest things to do in research is figure out what to work on. I reserve other topics like how to manage collaborations for another time.

What are your career goals?

To decide what to work on what on, first decide what goal any given piece of work leads to. I roughly divide career paths for researchers into these:

Academic researcher
Academic teacher
Industrial researcher
Industrial advanced development
start-up company

I am still trying to figure out what I most want to do on that list, and it is surely that way for most people. So projects that further more than one, or a mix of projects is fine.

What are you good at?

This should be graded on the curve. Obvious candidates are:

math
systems programming
prototyping programming and toolkit use
collaborating with people from other disciplines (biology, econ, archeology)

What do you like?

This is visceral. When you have to make a code faster is it fun or a pain? Do you like making 2D pictures? Do you like wiring novel hardware together?

Where is the topic in the field ecosystem?

How hot is the topic, and if hot will it be played out soon? If lots of people are already working on it you may be too late. But this is impossible to predict adequately. Something nobody is working or worse something out of fashion on is hard to get published, but find a minor venue or tech report and it will get rewarded, sometimes a lot, later.

How does it fit into your CV portfolio?

This is the most important point in this post. Look at your career goals and figure out what kind of CV you need. Like stocks and bonds in the economic world, each research project is a risk (or it wouldn't be research!) and you want a long term goal reached by a mix of projects. Also ask yourself how much investment is needed before you know whether a particular project has legs and will result in something good.

What is my advantage on this project?

If the project plays to your strengths then that is a good advantage (see "what I am good at" above). But access to unique data, unique collaborators, or unique equipment is a lot easier than being smarter than other people. Another advantage can be to just recognize something is a problem before other people do. This is not easy, but be on the lookout for rough spots in doing your work. In the 1980s most rendering researchers produced high dynamic range images, and none of us were too sure what to do with them, so we scaled. Tumblin and Rushmeier realized before anybody else that this wasn't an annoyance: it is a research opportunity. That work also is an example of how seminal work can be harder to publish but does get rewarded in the long run, and tech reporting it is a good idea so you get unambiguous credit for being first!

How hard is it?

Will it take three years full time (like developing a new OS or whatever) or is it a weekend project? If it's a loser, how long before I have a test that tells me it's a loser?

What is the time scale on this project?

Is the project something that will be used in industry tomorrow, in a year, 3 years, 10 years? There should be a mix of these in your portfolio as well, but figure out which appeals to you personally and lean in that direction.

Now let's put this into practice.

Suppose you have a research topic you are considering:

Does it help my career goals?
Am I good at it?
Will it increase my skills?
Will it be fun?
Is the topic pre-fashion, in-fashion, or stale?
What is my edge in this project?
Where is it in the risk/reward spectrum?
How long before I can abandon this project?
Is somebody paying me for this?
Is this project filling out a sparse part of my portfolio?

If you have a project firing on all of these then yes do it. The one I would attend to most is #1 and #10. For #1 look at the CVs of people that just got jobs that you want and see if you are on pace. Things may be different in four years, but it's as good a heuristic as we can get. #10 keeps your options more open and exposes you to less risk.

Suppose you DON'T have a project in mind and that is your problem. This is harder, but these tools can still help. I would start with #6. What is your edge? Then look for open problems. If you like games, look for what sucks in games and see if you bring anything to the table there. If your lab has a great VR headset, think about whether there is anything there. If you happen to know a lot about cars, see if there is something to do with cars. But always avoid the multi-year implimentational projects with no clear payoff unless the details of that further your career goals. For example, anyone that has written a full-on renderer will never lack for interesting job offers in the film industry. If you are really stuck just embrace Moore's law and project resources out 20 years. Yes it will offend most, reviewers but it will be interesting. Alternatively, go to a department that interests you, go to the smartest person there, and see if they have interesting projects they want CS help with.

Publishing the projects

No matter what you do, try to publish the project somewhere. And put it online. Tech report it early. If it isn't cite-able, it doesn't exist. And if you are bad at writing, start a blog or practice some other way. Do listen to your reviewers but no like it is gospel. Reviewing is a noisy measurement and you should view it as such. And if your work has long term intact the tech report will get read and cited. If not, it will disappear as it should. The short-term hills and valleys will be annoying but keep in mind that reviewing is hard and unpaid and thus rarely done well-- it's not personal.

But WAIT. I want to do GREAT work.

Great work is sometimes done in grad school. But it's high-risk high-reward so it's not wise to do it unless you aren't too particular about career goals. The good news in CS is that most industry jobs are implicitly on the cutting edge so it's always a debate whether industry or academics is more "real" research (I think it's a tie but that's opinion). But be aware of the trade-off you are making. I wont say it can't be done (Eric Veach immediately comes to mind) but I think it's better to develop a diversified portfolio and have at most one high-risk high-reward project as a side project. In industrial research you can get away with such research (it's exactly why industrial research exists) and after tenure you can do it, but keep in mind your career lasts 40-50 years, so taking extreme chances in the first 5-10 is usually unwise.

Monday, November 17, 2014

Nice set of optical illusions

These are old ones, but very good versions.

Sunday, November 16, 2014

A fun tidbit from the original ray tracing paper

We're about 35 years after Whitted's classic ray tracing paper. It's available here. One thing that went by people at the time is he proposed randomized glossy reflection in it.

Always be on the lookout for things current authors say are too slow!

Note that Rob Cook, always a careful scholar, credits Whitted for this idea in his classic 1984 paper.

Friday, November 14, 2014

NVIDIA enters cloud gaming market

NVIDIA has fired a big shot into the cloud gaming world. I'll watch this with great interest. If the technology is up to snuff this is the way I would like to play games: subscription model. One catch is that you need to buy a NVIDIA shield (This looks to be around $200) so I think of this as a cloud-based console and it's not at all out of whack with console costs. The better news is I wouldn't have to pay $50 to try a game. I think the key factors will be:

What games are available and what is the price to try them and/or play them?
Is the tech good?
In my internet good enough?

Knowing NVIDIA I bet 1 and 2 will be well-designed. And it looks like the service will be a "try it free" at first, so it all comes down to #3. It probably depends on the game and my internet. The latencies in game consoles are surprisingly high, so South Korean internet will probably make almost all games be fine. Portal 2 and the like will be good even with normal US internet. If anybody tries this let me know. I'll consider a purchase for Xmas depending on what I hear from early adopters.

Thursday, November 13, 2014

Converting Spectra to XYZ/RGB values

A renderer that uses a spectral representation must at some point convert to some image format. One can keep spectral curves in each pixel but most think that is overkill. Instead people usually convert to tristimulous values XYZ. These approximate the human response to spectra and there are a couple of versions of the standard with different viewing conditions. But all of the components of them are computed using some integrated weighting function:

Response = INTEGRAL weightingFunction * spectral radiance

The weighting function is typically some big ugly table prone to typos. Here is the first small fraction of such a table from my old code.

I whined about this a lot to Chris Wyman who found a very nice analytic approximation to the XYZ weighting functions he published in this paper (the only paper I have ever been on where my principle contribution was complaining). His simplest approximation has an RMS error of about 1% (and errors in the data itself and however almost all monitors are calibrated is much worse than that so I think the simple approximation is plenty):

However, his multilobe approximation is more accurate and makes for pretty nice code:

For any graphics application I would always use one of these approximations. If you need scotopic luminance for some night rendering you can either type in that formula or use this approximation from one on Greg Ward's papers:

I leave in that last paragraph from Greg's paper to emphasize we are lucky to now have sRGB as the defacto RGB standard. We do want to convert XYZ to RGB for display, and sRGB is the logical choice.

Tuesday, November 11, 2014

Making an orthonormal basis from a unit vector

Almost all ray tracing programs have a routine that creates an orthonormal basis from a unit vector n. I am messing with my Swift ray tracer and just looked around for the "best known method". This routine is always annoyingly inelegant in my code so I was hoping for something better. Some googling yeilded a lovely paper by Jeppe Revall Frisvad. He builds the basis with no normalization required:

It's not obvious, but those are orthonormal. The catch is the case where 1-nz = 0 so unfortunately an if is needed:

Still even with the branch it is very nice. I was hoping that a careful use of IEEE floating point rules would allow the branch to be avoided but I don't see it. But perhaps some clever person will see a reconstructing. The terms 0*0/0 should be a zero in principle and then all works out. The good news is that for n.z near 1.0 the stability looks good.

A more complex quartz composer example

Here is my first "real" quartz composer prototype. I duplicated the spirit of our kids' photo app 'dipityPix. This means 4 moving sprites that randomly adjusts posterization thresholds when hit with a mouse.

First I use an interaction routine for each sprite that logs mouse events. These are only live if there is a link between the interaction routine and the sprite (which is just a connection you make with the mouse). The sprite has one of two images depending on whether the mouse is pressed. Again this is just making the appropriate links.

Second we need random numbers. The Random routine just spews a stream of random numbers and we only want when when the mouse is pressed for that ball. Thus we "sample and hold" it when the number of mouse click changes (very basic dataflow concept and surprisingly hard for me to get used to because state is more "in your face" in C-like languages).

And that's it. I can see why people love the environment for prototyping.

For learning these basics the best resource I found was this excellent series of youtube videos by Rob Duarte. And here is the code for the example above.

Using Quartz Composer

I wanted to prototype some apps on a Mac and Alex Naaman suggested I give Quartz Composer a try and he helped me get started with Core Image shaders as well (thanks Alex!). My experience so far is this is a great environment. Facebook has embraced Quartz Composer and made an environment on top of it for mobile app prototyping called Origami.

Quartz Composer is a visual dataflow environment that is similar in spirit to AVS or SCIRun or Helix for you Mac old-timers. What is especially fun about it is that "hello world" is so easy. Here is a program to feed the webcam to a preview window with a Core Image shader posterizing it:

The "clear" there is layer 1 (like a Photoshop layer) and just gives a black background. The video input is the webcam. The Core Image filter is a GLSL variant. A cool thing is that if you add arguments to the filter, input ports are automatically added.

To program that filter you hit "inspect" on the filter and you can type stuff right in there.

The code for this example is here. You can run it on a mac with quicktime player, but if you want to modify it you will need to install xcode and the quartz composer environment.

Monday, November 10, 2014

Caustic from a distance

I spotted the strange bright spot in a parking lot and couldn't figure out where it was coming from. It is shown on the left below. Squatting down the source was clear as shown on the right. This is a good example of a small measure path that can make rendering hard!

Wednesday, October 22, 2014

Interesting camera computer-resolution cross-over.

Apple announced a new interesting desktop that actually has more pixels than its laptops making Apple desktop screens interesting again for some reason other than physical screen size. Apple has also been ahead of the curve on stopping the madness on too many pixels on its phone cameras.

While I think 4K TVs are a bridge too far and 1080p (or in the compatible terminology with 4K, a 1080p screen is 1920 wide is a "2K" screen) is plenty for movies and games, 4K is interesting for uses involving text and hi-res still images. As one experiences in Photoshop on most images, you need to either not see all the pixels by "fitting on screen" or zoom in and see "actual pixels". But for images from any iPhone, we just saw this crossover:

iMac Retina screen: 5120 x 2880
iPhone 4s/5/6 camera: 3264 x 2448

So you can (barely) get the whole camera image on there with "actual pixels". I expect this computer will be very popular with designers/artists/photographers.

Another interesting tidbit is similar to trends in phones, the screens are getting skinnier. While the camera sensor is 4x3 (like ever computer screen I had in the 80s and 90s), the monitor is 16x9 just like most phones these days.

Wednesday, October 8, 2014

Rendering lunar eclipses

In my googling I came across I lovely piece of work on rendering lunar eclipses by Theodore Yapo and Barbara Cutler. Here is my favorite part of their results (though not the prettiest-- check the rest out).

Impressive correspondence! Their online talk is a very nice summary of the work, and they have a good figure for the moon not being Lambertian:

So the BRDF of the moon is more like a stopsign than a sheet of paper.

iPhone 6+ test on eclipse

Here is a shot from Utah (ok dark sky but urban) with the iPhone6+. It's auto mode ended up at 1/4s exposure. I also used the DSC-QX100 from the last blog post.

iPhone 6+

Sony 1/8s exposure (auto) zoomed max

closeup Sony and iPhone 6+

Obviously neither of these cameras is designed for serious low light photography and the web will be filled with cool images from amazing cameras today. But I am overall impressed with how well they both do.

Tuesday, October 7, 2014

Very low-light test of iPhone6+ vs DSC-QX100

Here is a test against the Sony DSC-QX100 in very low light. It is DARK, Like movie theater near the exit sign dark (that's as close as we will get to units). I include this photo from the link above so you know it is a weird animal and not a conventional camera. I like it, but I don't know if it is getting a lot of traction or not.

Sony DSC-QX100 (from drpreview.com)

The sensors and such are quite different:

iPhone 6, 1/3" CMOS (? any camera nerds know more?) sensor
QX100, 1.0" Exmor R BSI CMOS sensor

So we should expect a 10X bigger sensor (and one good at low light according to the review above) to crush in this test. It does, so the iPhone 6 camera does not have the fairy dust component I was starting to suspect. Here they are:

Sony QX100

iPhone 6+

In fact, I'd say the iPhone 6+ picture is subjectively more realistic. But lets scale it up in photoshop and see how much information is there.

iPhone6+ scaled in photoshop

Conclusion: in some situations you just need a big low-light sensor!

Sergey Schetinin suggested in the comments there might be good info in the exif files (stripped in the images above) and I just grabbed the free version of exif wizard (really nice little program! I will get the pro one). Here are some screen dumps (interesting both 1/4s):

Saturday, October 4, 2014

HTC One vs iPhone6+ low light

My impression from earlier tests was that indoors the HTC One and the iPhone 5s cameras were about a tie. So I would expect the iPhone6+ to be a clear winner. In low light (subjectively, where I would have trouble reading but not "dark") here is the HTC One:

HTC One

And the iPhone6+

iPhone 6+

And some closeups. The yellow label is where we focused.

Left: iPhone6+. Right HTC One. 100x100 pixels

All in all, the new phone is better, but the HTC One is more than respectable!

Tuesday, September 30, 2014

Greys in skies

I've been messing with sky photos to try to make them look better algorithmically and realized they have greys in them more than I realized (I thought my histograms of saturation were wrong at first). Here's an example that shows a grey region between blue and orange (full at John Roever's flickr)