This work proposes a novel data-driven robust model predictive control (DDRMPC) framework for automatic control of greenhouse temperature and CO2 concentration level. The essential concept is to combine dynamic models of greenhouse temperature and CO2 concentration level with data-driven models that identify uncertainty in weather forecast error. By leveraging a machine learning approach, support vector clustering with weighted generalized intersection kernel, data-driven uncertainty sets for ambient temperature and solar radiation are constructed from historical weather data. A training-calibration procedure that tunes the size of uncertainty sets is implemented to ensure that data-driven uncertainty sets attain appropriate performance guarantee. In order to solve the optimization problem in DDRMPC, an affine disturbance feedback policy that provides tractable approximations of optimal control is utilized. A case study of controlling temperature and CO2 concentration level in a greenhouse is carried out. The results show that the proposed DDRMPC framework can prevent the greenhouse climate from becoming harmful to plant and fruit. DDRMPC approach ends up with 20 % less total economic cost than rule-based control strategy. The proposed DDRMPC approach also gives better control performance comparing to certainty equivalent MPC and robust MPC.