Capybara and asynchronous stuff

In this entry I will try to cover the following aspects:

  1. Running asynchronous code in a web driver
  2. Making the call synchronous
  3. Wrapping it into some common solution
  4. Advanced example - working with IndexedDB from Capybara

 What is Capybara, Poltergeist and PhantomJS?


First of all, we need to know what is PhantomJS. I would say it’s a ‘tool that acts like a browser but can be controlled from outside using simple command interface’. In more common words, it’s a web driver. It’s a full-featured WebKit (an engine of Chrome/Safari/few last versions of Opera and other browsers), but in console. You can use it for scripting, automating or testing.


Poltergeist is a Ruby wrapper for PhantomJS. Usually you write the code for PhantomJS on JavaScript, with Poltergeist you can run it on Ruby.


Well, I’m pretty sure you know what it is. It’s a test framework. And it supports different web drivers like:

RackTest - web driver that doesn’t support javascript, extremely fast, but deadly primitive; best solution for tests that don’t require any JS execution

Selenium - the most popular web driver


  1. Supports a lot of browsers (so you can run multiple test suites in different browsers)
  2. Super stable


  1. Requires X server to be installed on the machine that runs it (some cloud CI services just don’t have it)
  2. Opens a real browser that executes your test scenario, that slows your test suite.

Capybara-webkit - headless WebKit (doesn’t run a browser), but still requires X server

Poltergeist - see above, headless WebKit, doesn’t require X server, so you can run it everywhere.

 Asynchronous JavaScript code in your tests

When you write integration tests sometimes you need to run asynchronous JavaScript in context of your page and get a response back to Ruby. Here is an example from my current project: the client part of our application supports offline mode, so we store the data in WebSQL and sync it with server once connection is restored. The API of WebSQL is asynchronous, so we have a result of execution in some provided callback:

// something like
WebSQLWrapper.execute('select 1', function(result) {
  // result of execution becomes available after some time

We have integration tests with Capybara+Poltergeist+PhantomJS combo that tests the whole stack of interaction between a client-side application and a server API. And WebSQL should be flushed between tests or populated with startup data before some specific tests. We also have delayed operations on the client that runs periodically.

All these features require us to run JavaScript code manually in PhantomJS between/before tests.

 Capybara/Poltergeist/PhantomJS installation

Quite simple:

  1. PhantomJS: sudo apt-get install phantomjs or download binaries from the official site (you can even try 2.0 beta there).
  2. Capybara: gem 'capybara' and that’s it.
  3. Poltergeist: gem 'poltergeist'.

Require both of them in your spec_helper and force Capybara to use Poltergeist as a web driver:

require 'capybara/rails'
require 'capybara/poltergeist'

Capybara.register_driver :poltergeist_debug do |app|
  driver_options = {
    inspector: true,
    timeout: 5,
    js_errors: false,
    debug: false,
      logger: Kernel,
      js_errors: true,
      debug: true,
      phantomjs_logger:'log/phantomjs.log'), 'a')
  end, driver_options)

Capybara.javascript_driver = :poltergeist_debug
Capybara.default_driver = :poltergeist_debug

With this configuration Poltergeist doesn’t print any noisy output, but you can enable it by passing DEBUG_PHANTOMJS=true

 Small example

Here is a simple of the code that returns it’s response asynchronously:

setTimeout(function() {
  alert('Thanks for waiting 1 second');
}, 1000);

This code actually does nothing, but it’s a demonstration of how asynchronous stuff works.

 But… Capybara waits for my AJAX requests

Yes, when you write something like:

it 'displays a message' do
  expect(page).to have_text('Hey')

and at the moment of running this expectation your page doesn’t have this text but receives it a second later - the test will be green. Why?

Capybara has a setting called default_wait_time (was changed to default_max_wait_time, but is still acceptable) which is 2 seconds by default. Here is how Capybara uses it.

It runs the code again and again, and stops if

  1. It has a result of code execution - then it simple returns it
  2. The time has come (2 seconds) - then it raises an error

(A little remark here. Capybara saves the time on the beginning of this method and on every iteration compares this time with - this is a very nice hack to save Capybara from wrapping API calls into Timecop.freeze - good job!)

 Can we reuse it?

Yes, of course. Let’s simplify it a little bit:

module WaitHelper
  extend self

  # Calls provided +block+ every 100ms
  #   and stops when it returns false
  # @param timeout [Fixnum]
  # @yield block for execution
  # @example
  #   current_time =
  #   WaitHelper.wait_until(3) do
  # - current_time > 2
  #   end
  #   # 2 seconds later ...
  #   # => true
  #   current_time =
  #   WaitHelper.wait_until(3) do
  # - current_time > 10
  #   end
  #   # 3 seconds later (after timeout)
  #   # => false
  def wait_until(timeout, &block)
      Timeout.timeout(timeout) do
        sleep(0.1) until value =
    rescue TimeoutError

Alright, now let’s use it.

  1. Modify JS code to set result into some global variable (JS function context, you know)
  2. Run the code that gets response in callback.
  3. Run the code that polls the response from global varialbe
  4. Wrap this code with WaitHelper.wait_until(timeout) {}
code_for_execution = <<-JS
  setTimeout(function() {
    window.asyncResponse = 'some response';
  }, 1000)

code_for_polling = 'window.asyncResponse'

result = WaitHelper.wait_until(2) do
puts result
# => 'some response'

 Can we organize it as a common reusable solution?

Why not. Here is a gem called capybara-async-runner. And here is how to use it.


# Gemfile
gem 'capybara-async_runner'
# spec/spec_helper.rb
require 'capybara/async_runner'

First of all, I don’t like to mix JS and Ruby code in a single file, so we need templates (like .js.erb). You need to specify the directory with templates:

# spec/spec_helper.rb
Capybara::AsyncRunner.setup do |config|
  config.commands_directory = Rails.root.join('spec/fixtures/async_commands')

Let’s write our first command

class TestCommand < Capybara::AsyncRunner::Command
  # global command name
  self.command_name = :test_command_name

  # .js.erb file in directory specified above
  self.file_to_run = 'template'

  response :parsed_json do |data|

This class follows the Command pattern, you can invoke it in the following way:
# or

Let’s create our template for this command:

// spec/fixtures/async_commands/template.js.erb
setTimeout(function() {
  var json = JSON.stringify([1,2,3]);
  <%= parsed_json(js[:json]) %>

There are few things that I need to explain:

  1. parsed_json - this is an output point from the script that was defined in the command class. When you call it in the template, it embeds some JavaScript that stores passed data into the global variable. parsed_json(123) produces something like window.parsed_json_result = 123 (so we can can grab this response later from the second script)
  2. js - this is a proxy method from the gem that acts like a Hash. The method [] on this Hash returns passed key (and the whole method js is kind of “JavaScript memory”). <%= parsed_json(js[:json]) %> produces window.parsed_json_result = json which is exactly what we need.
  3. When you call, the gem executes the script generated from your template in the context of Capybara.current_session. Then it subscribes to all defined responses (this is actually, the second script) and returns the first one that becomes defined (window.parsed_json_result in this case).
  4. The block that we’ve specified for parsed_json is like a handler for transforming data which it returns (we invoke parsed_json method with json variable which contains raw JSON, our handler parses it)

If you are familiar with templates in Ruby, just quickly look at the code, this class is a context of rendering.

 Wrapping up

You need to follow these steps to create a command using a gem:

  1. Specify the directory with templates in the gem config
  2. Create a command class
  3. Specify its name
  4. Specify the name of template
  5. Create a template file
  6. Define a response(s)
  7. Call them in the template

 Passing data to template

You can pass any data to the template:

class TestCommand < Capybara::AsyncRunner::Command
  self.command_name = :test
  self.template = 'template'
  # if you don't pass any block
  # it will return raw value
  response :done

data = {
  name: 'Ilya'
}, data)
# => 'Ilya'

And use it template:

someLongRunningMethod(function() {
  // 'done' is a method that generates JavaScript
  // 'data' returns data that we've passed to 'run'
  // :name is a key in that Hash
  <%= done(data[:name]) %>

 Let’s write something complex

As I mentioned before, on my current project we use WebSQL, but it’s deprecated, so I’m not going to use it in examples. Instead let’s write a wrapper for IndexedDB.

First of all, we need a JavaScript wrapper, I don’t like the native IndexedDB API. First result from google = Dexie.js.

Let’s plan our scenario:

  1. Visit any page
  2. Inject Dexie.js into the page
  3. Create IndexedDB instance
  4. Write some data to the database
  5. Read them and print

 Visiting the page


 Injecting Dexie.js into the page

How to invoke:

module IndexedDB
  URL = ''
end'indexeddb:wrapper:inject', url: IndexedDB::URL)


  1. JS code detects whether the library has already been loaded
  2. If not - appends <script> tag to the <head>
  3. If yes - returns ‘success’
  4. If 3 seconds passed and we still have no library code - returns ‘error’

The command raises an exception if response is error.

All this manipulations are synchronous for Ruby. The end of running the command means that we can continue execution.

Moreover, this command is safe, we can call it multiple times and it will inject the script into the page only once.

 Create IndexedDB instance

How to invoke:'indexeddb:wrapper:initialize')


  1. JS opens the database using Dexie.js
  2. If callback was called, returns sucess
  3. If errback was called, returns error with error message
  4. Ruby command raises error if error was returned

 Write some data to the database

How to invoke:

user_data = { name: 'Some Name' }

user_id ='indexeddb:insert', store: 'users', data: user_data)
p "User ID: #{user_id}"

This step is quite simple if you understand the previous one.

 Reading data

How to invoke:

methods = [
  { method: 'where', arguments: ['id']},
  { method: 'equals', arguments: [user_id] },
  { method: 'toArray', arguments: [] }

p'indexeddb:query', store: 'users', methods: methods)

Here we pass an array of methods and their arguments to template, iterate over them and build a dexie scope (just like ActiveRecord::Relation), and return a result back to ruby command.

After wrapping it even more we can get interface like this

Full example can be found here


I would say the topic of this article is not so popular. Single page applications that work in offline mode (and because of this, use WebSQL/IndexedDB/some other async storage, probably) are still not frequent today. Usually when you build an SPA, you just write your tests using Jasmine or something like that. You mock your requests to the server API and test your client in some isolated environment. But these tests are still functional (you verify a single component - client, in this case - but not the whole application).

Someday working on a rich client-side application, remember about the idea of this post. You can control the logical flow of your client-server communication in integration tests, and this is good. Wrap your client code, build a micro-framework on top of this gem and test every piece of your code.


Now read this

Ruby Marshalling from A to Z

Marshalling is a serialization process when you convert an object to a binary string. Ruby has a standard class Marshal that does all the job for serialization and deserialization. To serialize an object, use Marshal.dump, to deserialize... Continue →