The length of the page is too long can lead to not included

website optimization crawl the pictures directly on the binary content in HTML page length is too long, the web page length up to 164k;

site feedback:


website is JS generated for user access, not doing optimization; but the site for the crawler to do the optimization, and the picture also directly to do a Base64 conversion; however after optimization content has not been found love Shanghai included

3, when the crawler crawl do optimization, please put in front of the theme, avoid crawling cut off the content capture all

recently received a very typical grasp optimization example, arrange to share to you, you the webmaster remember not to appear the same as:

3, crawler content, page content has long truncated, grabbing part does not recognize the main content, resulting in page was identified as empty short and not included.

does not recommend the site using the JS generation of the main content, such as JS rendering error, is likely to lead to the page content read error page cannot capture



page quality is very good, also specifically for the crawler to do the optimization, why content but not included?

Engineer1, according to


; Recommendation: The main content of

2, site optimization will be the main content on the last picture is put in front of

2, such as the site for crawler crawling do optimization, recommended page length within 128K, not too long

