I am using regular expressions in Python to search through a page source, and find all the json information in the javascript. Specifically an example would look something like this:
var fooData = { id: 123456789, name : "foo bar", country_name: "foo", country_is_eu: null, foo_bars: null, foo_email: null, foo_rate: 1.0, foo_id: 0987654321 };
I'm fairly new to understanding all there is to know about regular expressions, and I'm not sure if what I'm doing is correct. I can get some individual lines, but I'm not completely sure of how to use re.MULTILINE. This is the code I have so right now:
prog = re.compile('[var ]?\w+ ?= ?{[^.*]+\n};', re.MULTILINE)
vars = prog.findall(text)
Why is this not working?
To be more clear, I really need it to match everything in between these brackets like this:
var fooData = { };
So, essentially I can't figure out a way to match every line except one that looks like this:
};
[^}]+
, I did not know you could do that.json
module that is useful. It sounds like you should be using that instead of regex.